Published: 2024-08-20

Optimization of K Value in KNN Algorithm for Spam and HAM Classification in SMS Texts

DOI: 10.35870/ijsecs.v4i2.2681

Ferryma Arba Apriansyah, Arief Hermawan, Donny Avianto

Abstract

Spam refers to the unsolicited and repetitive sending of messages to others via electronic devices without their consent. This activity, commonly known as spamming, is typically carried out by individuals referred to as spammers. SMS spam, which often originates from unknown sources, frequently contains advertisements, phishing attempts, scams, and even malware. Such spam messages can be pervasive, affecting almost all mobile phone numbers, thereby causing significant disruptions to communication by delivering irrelevant content. The persistent nature of spam messages underscores the need for effective filtering mechanisms. This study investigates the application of the K-Nearest Neighbors (KNN) algorithm for classifying SMS messages as either spam or non-spam (ham). The findings demonstrate that KNN, when optimized through various methods for determining the appropriate value of K, can achieve an impressive average accuracy of 99.16% in classifying SMS spam. This high level of accuracy indicates that KNN is a reliable method for spam detection.

Keywords

Classification ; KNN ; SMS Spam

Peer Review Process

This article has undergone a double-blind peer review process to ensure quality and impartiality.

Indexing Information

Discover where this journal is indexed at our indexing page to understand its reach and credibility.

Open Science Badges

This journal supports transparency in research and encourages authors to meet criteria for Open Science Badges by sharing data, materials, or preregistered studies.

Issue Cover

Downloads

Article Metrics

If the link doesn't work, copy the DOI or article title for manual search (API Maintenance).

Share:
Article Information

This article has been peer-reviewed and published in the International Journal Software Engineering and Computer Science (IJSECS). The content is available under the terms of the Creative Commons Attribution 4.0 International License.

  • Issue: Vol. 4 No. 2 (2024)

  • Section: Articles

  • Published: August 20, 2024

AI Research Hub

This article is indexed and available through various AI-powered research tools and citation platforms. Our AI Research Hub ensures that scholarly work is discoverable, accessible, and easily integrated into the global research ecosystem. By leveraging artificial intelligence for indexing, recommendation, and citation analysis, we enhance the visibility and impact of published research.

Semantic Scholar Scite Dimensions Connected Papers

Similar Articles

You may also start an advanced similarity search for this article.

Most read articles by the same author(s)