Published: 2025-04-01
Sales Data Clustering Using the K-Means Algorithm to Determine Retail Product Needs
DOI: 10.35870/ijsecs.v5i1.4090
Riwan Irosucipto Manarung, Edy Widodo, Anggi Muhammad Rifai
- Riwan Irosucipto Manarung: Universitas Pelita Bangsa Bekasi
- Edy Widodo: Universitas Pelita Bangsa Bekasi
- Anggi Muhammad Rifai: Universitas Pelita Bangsa Bekasi
Abstract
Sales data is a systematic record of transactional behavior with goods or services distributed over time boundaries and furnishes primary key business metrics for evaluating and planning. Using the K-Means clustering algorithm, this research segments retail product demand by differences attributes to identify demand patterns. The iterative process of clustering ended at the fifth cycle after the division of objects in each cluster stabilized, which can serve as a sign that we arrive at an optimal solution. Results showed that the first cluster located at a centroid 94, 6 contains 100 data items belonging in a primary set and similarly fifth cluster (same centroid) had also same number of products. The automated approach of Collaboratory also differs from the manual method where there are not pre-defined cluster initial values in our preliminary setup. Despite this procedural difference, there is a remarkable concision in the results which demonstrates the strength of the method when implemented using different ingrained constructions. These results offer some refined results on product classification, which is essential to solve the problem that retail ranks may vary during inventory management and sales optimization.
Keywords
Data Mining ; Clustering Techniques ; K-Means Algorithm ; Sales Analytics ; Retail Product Demand
Article Metadata
Peer Review Process
This article has undergone a double-blind peer review process to ensure quality and impartiality.
Indexing Information
Discover where this journal is indexed at our indexing page to understand its reach and credibility.
Open Science Badges
This journal supports transparency in research and encourages authors to meet criteria for Open Science Badges by sharing data, materials, or preregistered studies.
How to Cite
Article Metrics
- Views0
- Downloads0
- Scopus Citations
- Google Scholar
- Crossref Citations
- Semantic Scholar
- DataCite Metrics
If the link doesn't work, copy the DOI or article title for manual search (API Maintenance).
Article Information
This article has been peer-reviewed and published in the International Journal Software Engineering and Computer Science (IJSECS). The content is available under the terms of the Creative Commons Attribution 4.0 International License.
-
Issue: Vol. 5 No. 1 (2025)
-
Section: Articles
-
Published: April 1, 2025
-
License: CC BY 4.0
-
Copyright: © 2025 Authors
-
DOI: 10.35870/ijsecs.v5i1.4090
AI Research Hub
This article is indexed and available through various AI-powered research tools and citation platforms. Our AI Research Hub ensures that scholarly work is discoverable, accessible, and easily integrated into the global research ecosystem. By leveraging artificial intelligence for indexing, recommendation, and citation analysis, we enhance the visibility and impact of published research.




Riwan Irosucipto Manarung
Informatics Engineering Study Program, Faculty of Engineering, Universitas Pelita Bangsa Bekasi, Bekasi Regency, West Java Province, Indonesia
Edy Widodo
Informatics Engineering Study Program, Faculty of Engineering, Universitas Pelita Bangsa Bekasi, Bekasi Regency, West Java Province, Indonesia
-
Agussalim, A. (2023). Sales product clustering using RFM calculation model and K-Means algorithm on Primskystore. Technium: Romanian Journal of Applied Sciences and Technology, 16, 176-182. https://doi.org/10.47577/technium.v16i.9978
-
Astuti, J., & Yuniarti, T. (2023). Data mining modeling in clustering car products sales data in the automotive industry in Indonesia. Jurnal Manajemen Industri dan Logistik, 7(2), 261-281. https://doi.org/10.30988/jmil.v7i2.1258
-
Basuki, A. T. A., Paramastri, Z. P., Rachmayanti, D. F., Chan, D. A., & Aldo, D. (2024). Penerapan algoritma K-Means untuk pengelompokan pola penjualan oleh-oleh khas Medan. In Proceedings of the National Conference on Electrical Engineering, Informatics, Industrial Technology, and Creative Media (Vol. 4, No. 1, pp. 1194-1200).
-
Bilgiç, E., Çakır, Ö., Kantardzic, M., Duan, Y., & Cao, G. (2021). Retail analytics: Store segmentation using rule-based purchasing behavior analysis. The International Review of Retail, Distribution and Consumer Research, 31(4), 457-480. https://doi.org/10.1080/09593969.2021.1915847
-
Darmi, Y. D., & Setiawan, A. (2017). Penerapan metode clustering K-Means dalam pengelompokan penjualan produk. Jurnal Media Infotama, 12(2). https://doi.org/10.37676/jmi.v12i2.418
-
Dolega, L., Reynolds, J., Singleton, A., & Pavlis, M. (2019). Beyond retail: New ways of classifying UK shopping and consumption spaces. Environment and Planning B: Urban Analytics and City Science, 48(1), 132-150. https://doi.org/10.1177/2399808319840666
-
Firdausi, F., Afrianto, E., & Wiranto, F. (2023). Pastry sales data clustering with K-Means clustering approach for product grouping. Inside, 1(1), 36-41. https://doi.org/10.31967/inside.v1i1.887
-
Firman, F., Wulandari, N., & Irawan, A. (2020). Perancangan sistem informasi jurnal mengajar dosen berbasis web pada Fakultas Keguruan dan Ilmu Pendidikan Universitas Pendidikan Muhammadiyah Sorong. Jurnal Petisi (Pendidikan Teknologi Informasi), 1(1), 33-43. https://doi.org/10.36232/jurnalpetisi.v1i1.386
-
-
Lestari, S., Nazila, R., Ulhar, L. A., & Zidane, M. (2024). Implementasi data mining dalam menentukan penjualan alat perabot dengan menggunakan metode K-Means clustering pada PT. XYZ. Jurnal Sistem Informasi dan Ilmu Komputer, 2(1), 271-278. https://doi.org/10.59581/jusiik-widyakarya.v2i1.2824
-
-
-
-
-
Rosidi, R. P. M., & Setiawan, K. (2024). Implementasi algoritma Naïve Bayes terhadap data penjualan untuk mengetahui pola pembelian konsumen pada kantin. Jurnal Indonesia: Manajemen Informatika dan Komunikasi, 5(1), 120-126. https://doi.org/10.35870/jimik.v5i1.407
-
Setiawan, M., & Rino, R. (2022). Design and application of K-Means method to predict sales at Arya Elektrik stores. Bit-Tech, 5(2), 67-76. https://doi.org/10.32877/bt.v5i2.562
-
-
-
-
Widodo, E., & Hadikristanto, W. (2023). Pengelompokan untuk penjualan obat dengan menggunakan algoritma K-Means. Bulletin of Information Technology (BIT), 4(3), 408-413. https://doi.org/10.47065/bit.v4i3.848
-
Yahya, A., & Kurniawan, R. (2025). Implementation of K-Means algorithm for clustering sales data based on sales patterns. Malcom, 5(1), 350-358. https://journal.irpi.or.id/index.php/malcom.

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
Copyright and Licensing Agreement
Authors who publish with this journal agree to the following terms:
1. Copyright Retention and Open Access License
- Authors retain full copyright of their work
- Authors grant the journal right of first publication under the Creative Commons Attribution 4.0 International License (CC BY 4.0)
- This license allows unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited
2. Rights Granted Under CC BY 4.0
Under this license, readers are free to:
- Share — copy and redistribute the material in any medium or format
- Adapt — remix, transform, and build upon the material for any purpose, including commercial use
- No additional restrictions — the licensor cannot revoke these freedoms as long as license terms are followed
3. Attribution Requirements
All uses must include:
- Proper citation of the original work
- Link to the Creative Commons license
- Indication if changes were made to the original work
- No suggestion that the licensor endorses the user or their use
4. Additional Distribution Rights
Authors may:
- Deposit the published version in institutional repositories
- Share through academic social networks
- Include in books, monographs, or other publications
- Post on personal or institutional websites
Requirement: All additional distributions must maintain the CC BY 4.0 license and proper attribution.
5. Self-Archiving and Pre-Print Sharing
Authors are encouraged to:
- Share pre-prints and post-prints online
- Deposit in subject-specific repositories (e.g., arXiv, bioRxiv)
- Engage in scholarly communication throughout the publication process
6. Open Access Commitment
This journal provides immediate open access to all content, supporting the global exchange of knowledge without financial, legal, or technical barriers.