Analisis Sentimen Complain dan Bukan Complain pada Twitter Telkomsel dengan SMOTE dan Naïve Bayes

Main Article Content

Budi Kurniawan
Achmad Suwarisman
Iis Afriyanti
Aditya Wahyudi
Dedi Dwi Saputra

Abstract

This analysis aims to find out the public sentiment towards Telkomsel posted on Indonesian twitter, which makes market research on public opinion very useful. The dataset was taken from Twitter social media in a query Indonesian by crawling method using the RapidMiner application and the result of crawling the data set there were 1000 tweets with sentiment complaints and not complaints. Therefore, from 1000 tweets, preprocessing will be carried out with the SMOTE Upsampling and Naivebayes methods as well as several filtering such as transform case, tokenize, tokenize (by length) stemming filters and stopwords so that the data can stay in words and there is a balance in the sentiment on the dataset. It can be concluded that in the classification of sentiment there is a balance between complaints and non-complaints as many as 581. Where the accuracy rating level is 81.58%, the precision assessment is 86.82% and the recall assessment is 74.87 and the resulting AUC is 0.803.

Downloads

Download data is not yet available.

Article Details

How to Cite
Kurniawan, B., Suwarisman, A., Afriyanti, I., Wahyudi, A., & Saputra, D. D. (2023). Analisis Sentimen Complain dan Bukan Complain pada Twitter Telkomsel dengan SMOTE dan Naïve Bayes. Jurnal JTIK (Jurnal Teknologi Informasi Dan Komunikasi), 7(1), 106–113. https://doi.org/10.35870/jtik.v7i1.691
Section
Computer & Communication Science
Author Biographies

Budi Kurniawan, Universitas Nusamandiri

Program Studi Sistem Informasi, Fakultas Teknologi Informasi, Universitas Nusamandiri

Achmad Suwarisman, Universitas Nusamandiri

Program Studi Sistem Informasi, Fakultas Teknologi Informasi, Universitas Nusamandiri

Iis Afriyanti, Universitas Nusamandiri

Program Studi Sistem Informasi, Fakultas Teknologi Informasi, Universitas Nusamandiri

Aditya Wahyudi, Universitas Nusamandiri

Program Studi Sistem Informasi, Fakultas Teknologi Informasi, Universitas Nusamandiri

Dedi Dwi Saputra, Universitas Nusamandiri

Program Studi Sistem Informasi, Fakultas Teknologi Informasi, Universitas Nusamandiri

References

Suryono, S., Utami, E. and Luthfi, E.T., 2018. Analisis Sentiment Pada Twitter Dengan Menggunakan Metode Naïve Bayes Classifier. Seminar Nasional GEOTIK 2018.

Cambria, E., Schuller, B., Xia, Y. and Havasi, C., 2013. New avenues in opinion mining and sentiment analysis. IEEE Intelligent systems, 28(2), pp.15-21. DOI: 10.1109/MIS.2013.30.

Liu, B., 2012. Sentiment analysis and opinion mining. Synthesis lectures on human language technologies, 5(1), pp.1-167.

Deepa, N., Priya, J.S. and Devi, T., 2022. Towards applying internet of things and machine learning for the risk prediction of COVID-19 in pandemic situation using Naive Bayes classifier for improving accuracy. Materials Today: Proceedings. DOI: https://doi.org/10.1016/j.matpr.2022.03.345.

Naraswati, N.P.G., Nooraeni, R., Rosmilda, D.C., Desinta, D., Khairi, F. and Damaiyanti, R., 2021. Analisis Sentimen Publik dari Twitter Tentang Kebijakan Penanganan Covid-19 di Indonesia dengan Naive Bayes Classification. Sistemasi: Jurnal Sistem Informasi, 10(1), pp.222-238. DOI: https://doi.org/10.32520/stmsi.v10i1.1179.

Sari, A. C. et al. 2019. Komunikasi dan media sosial. Reserachgate.net.

Statistic Brain. 2013. Twitter statistics. Available at: http://www.statisticbrain.com/ twitter-statistics.

Olofinlua, T. 2019. Twitter: social communication in the twitter age, Information, Communication & Society, 22(13), pp. 2037–2038. DOI: 10.1080/1369118x.2019.1620824.

Vu, D.H., 2022. Privacy-preserving Naive Bayes classification in semi-fully distributed data model. Computers & Security, 115, p.102630. DOI: https://doi.org/10.1016/j.cose.2022.102630.

Chawla, N.V., Bowyer, K.W., Hall, L.O. and Kegelmeyer, W.P., 2002. SMOTE: synthetic minority over-sampling technique. Journal of artificial intelligence research, 16, pp.321-357. DOI: https://doi.org/10.1613/jair.953.

Zhang, A., Yu, H., Zhou, S., Huan, Z. and Yang, X., 2022. Instance weighted SMOTE by indirectly exploring the data distribution. Knowledge-Based Systems, 249, p.108919. doi: https://doi.org/10.1016/j.knosys.2022.108919

Anandarajan, M., Hill, C. and Nolan, T. 2019 Text Preprocessing BT - Practical Text Analytics: Maximizing the Value of Text Data, in Anandarajan, M., Hill, C., and Nolan, T. (eds). Cham: Springer International Publishing, pp. 45–59. DOI: 10.1007/978-3-319-95663-3_4.

Samuel, N. 2010. Naive Bayes Classifier dan Penggunaannya pada Klasifikasi Dokumen. Bandung: Institut Teknologi Bandung.

Bustami, B., 2013. Penerapan algoritma Naive Bayes untuk mengklasifikasi data nasabah asuransi. TECHSI-Jurnal Teknik Informatika, 5(2). DOI: 10.26555/jifo.v8i1.a2086.

Singh, M., Bhatt, M.W., Bedi, H.S. and Mishra, U., 2020. Performance of bernoulli’s naive bayes classifier in the detection of fake news. Materials Today: Proceedings. DOI: https://doi.org/10.1016/j.matpr.2020.10.896.

Keller, K.L. and Lehmann, D.R., 2006. Brands and branding: Research findings and future priorities. Marketing science, 25(6), pp.740-759. DOI: https://doi.org/10.1287/mksc.1050.0153.

Tanesab, F.I., Sembiring, I. and Purnomo, H.D., 2017. Sentiment analysis model based on Youtube comment using support vector machine. International Journal of Computer Science and Software Engineering, 6(8), p.180.

Gunawan, B., Sastypratiwi, H. and Pratama, E.E., 2018. Sistem Analisis Sentimen pada Ulasan Produk Menggunakan Metode Naive Bayes. JEPIN (Jurnal Edukasi dan Penelitian Informatika), 4(2), pp.113-118. DOI: 10.26418/jp.v4i2.27526.

Sembodo, J.E., Setiawan, E.B. and Baizal, Z.A., 2016, October. Data Crawling Otomatis pada Twitter. In Indonesian Symposium on Computing (Indo-SC) (pp. 11-16). DOI: 10.21108/indosc.2016.111.

Kotsiantis, S.B., Kanellopoulos, D. and Pintelas, P.E., 2006. Data preprocessing for supervised leaning. International journal of computer science, 1(2), pp.111-117.