IJAIEM

International journal of application or innovation in engineering
and management
ISSN:2319-4847

Abstract

SPAM DETECTION USING TEXT CLUSTERING

Yethirajam Yuganand Sree, Bomma Prashanth , Dovoor Shiva Kumar, V. ChandraPrakash

Abstract

We propose a new spam detection technique using the text clustering based on vector space model. Our method computes disjoint clusters automatically using a spherical k-means algorithm for all spam/non-spam mails and obtains centroid vectors of the clusters for extracting the cluster description. For each centroid vectors, the label(‘spam‘ or ‘non-spam‘) is assigned by calculating the number of spam email in the cluster. When new mail arrives, the cosine similarity between the new mail vector and centroid vector is calculated. Finally, the label of the most relevant cluster is assigned to the new mail. By using our method, we can extract many kinds of topics in spam/non-spam email and detect the spam email efficiently. In this paper, we describe the our spam detection system and show the result of our experiments using the Ling-Spam test collection.

IMPORTANT LINKS

Plagiarism

Check Article for

Plagiarism


UPDATES

  • call for paper:
    volume8
  • issue-1 october 2024
  • Submission date:
    22.10.2024

  • publishing date:28.10.2024

INDEXED BY: