Analisis Sentimen Complain dan Bukan Complain pada Twitter Telkomsel dengan SMOTE dan Naïve Bayes
Main Article Content
Abstract
This analysis aims to find out the public sentiment towards Telkomsel posted on Indonesian twitter, which makes market research on public opinion very useful. The dataset was taken from Twitter social media in a query Indonesian by crawling method using the RapidMiner application and the result of crawling the data set there were 1000 tweets with sentiment complaints and not complaints. Therefore, from 1000 tweets, preprocessing will be carried out with the SMOTE Upsampling and Naivebayes methods as well as several filtering such as transform case, tokenize, tokenize (by length) stemming filters and stopwords so that the data can stay in words and there is a balance in the sentiment on the dataset. It can be concluded that in the classification of sentiment there is a balance between complaints and non-complaints as many as 581. Where the accuracy rating level is 81.58%, the precision assessment is 86.82% and the recall assessment is 74.87 and the resulting AUC is 0.803.
Downloads
Article Details
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
The Authors submitting a manuscript do so on the understanding that if accepted for publication, copyright of the article shall be assigned to JTIK journal and Research Division, KITA Institute as the publisher of the journal. Copyright encompasses rights to reproduce and deliver the article in all form and media, including reprints, photographs, microfilms, and any other similar reproductions, as well as translations.
JTIK journal and Research Division, KITA Institute and the Editors make every effort to ensure that no wrong or misleading data, opinions or statements be published in the journal. In any way, the contents of the articles and advertisements published in JTIK journal are the sole and exclusive responsibility of their respective authors and advertisers.
The Copyright Transfer Form can be downloaded here: [Copyright Transfer Form JTIK]. The copyright form should be signed originally and send to the Editorial Office in the form of original mail, scanned document or fax :
Muhammad Wali (Editor-in-Chief)
Editorial Office of Jurnal JTIK (Jurnal Teknologi Informasi dan Komunikasi)
Research Division, KITA Institute
Teuku Nyak Arief Street Nomor : 7b, Lamnyong, Lamgugop, Kota Banda Aceh
Telp./Fax: 0651-8070141
Email: jtik@lembagakita.org - journal@lembagakita.org
References
Suryono, S., Utami, E. and Luthfi, E.T., 2018. Analisis Sentiment Pada Twitter Dengan Menggunakan Metode Naïve Bayes Classifier. Seminar Nasional GEOTIK 2018.
Cambria, E., Schuller, B., Xia, Y. and Havasi, C., 2013. New avenues in opinion mining and sentiment analysis. IEEE Intelligent systems, 28(2), pp.15-21. DOI: 10.1109/MIS.2013.30.
Liu, B., 2012. Sentiment analysis and opinion mining. Synthesis lectures on human language technologies, 5(1), pp.1-167.
Deepa, N., Priya, J.S. and Devi, T., 2022. Towards applying internet of things and machine learning for the risk prediction of COVID-19 in pandemic situation using Naive Bayes classifier for improving accuracy. Materials Today: Proceedings. DOI: https://doi.org/10.1016/j.matpr.2022.03.345.
Naraswati, N.P.G., Nooraeni, R., Rosmilda, D.C., Desinta, D., Khairi, F. and Damaiyanti, R., 2021. Analisis Sentimen Publik dari Twitter Tentang Kebijakan Penanganan Covid-19 di Indonesia dengan Naive Bayes Classification. Sistemasi: Jurnal Sistem Informasi, 10(1), pp.222-238. DOI: https://doi.org/10.32520/stmsi.v10i1.1179.
Sari, A. C. et al. 2019. Komunikasi dan media sosial. Reserachgate.net.
Statistic Brain. 2013. Twitter statistics. Available at: http://www.statisticbrain.com/ twitter-statistics.
Olofinlua, T. 2019. Twitter: social communication in the twitter age, Information, Communication & Society, 22(13), pp. 2037–2038. DOI: 10.1080/1369118x.2019.1620824.
Vu, D.H., 2022. Privacy-preserving Naive Bayes classification in semi-fully distributed data model. Computers & Security, 115, p.102630. DOI: https://doi.org/10.1016/j.cose.2022.102630.
Chawla, N.V., Bowyer, K.W., Hall, L.O. and Kegelmeyer, W.P., 2002. SMOTE: synthetic minority over-sampling technique. Journal of artificial intelligence research, 16, pp.321-357. DOI: https://doi.org/10.1613/jair.953.
Zhang, A., Yu, H., Zhou, S., Huan, Z. and Yang, X., 2022. Instance weighted SMOTE by indirectly exploring the data distribution. Knowledge-Based Systems, 249, p.108919. doi: https://doi.org/10.1016/j.knosys.2022.108919
Anandarajan, M., Hill, C. and Nolan, T. 2019 Text Preprocessing BT - Practical Text Analytics: Maximizing the Value of Text Data, in Anandarajan, M., Hill, C., and Nolan, T. (eds). Cham: Springer International Publishing, pp. 45–59. DOI: 10.1007/978-3-319-95663-3_4.
Samuel, N. 2010. Naive Bayes Classifier dan Penggunaannya pada Klasifikasi Dokumen. Bandung: Institut Teknologi Bandung.
Bustami, B., 2013. Penerapan algoritma Naive Bayes untuk mengklasifikasi data nasabah asuransi. TECHSI-Jurnal Teknik Informatika, 5(2). DOI: 10.26555/jifo.v8i1.a2086.
Singh, M., Bhatt, M.W., Bedi, H.S. and Mishra, U., 2020. Performance of bernoulli’s naive bayes classifier in the detection of fake news. Materials Today: Proceedings. DOI: https://doi.org/10.1016/j.matpr.2020.10.896.
Keller, K.L. and Lehmann, D.R., 2006. Brands and branding: Research findings and future priorities. Marketing science, 25(6), pp.740-759. DOI: https://doi.org/10.1287/mksc.1050.0153.
Tanesab, F.I., Sembiring, I. and Purnomo, H.D., 2017. Sentiment analysis model based on Youtube comment using support vector machine. International Journal of Computer Science and Software Engineering, 6(8), p.180.
Gunawan, B., Sastypratiwi, H. and Pratama, E.E., 2018. Sistem Analisis Sentimen pada Ulasan Produk Menggunakan Metode Naive Bayes. JEPIN (Jurnal Edukasi dan Penelitian Informatika), 4(2), pp.113-118. DOI: 10.26418/jp.v4i2.27526.
Sembodo, J.E., Setiawan, E.B. and Baizal, Z.A., 2016, October. Data Crawling Otomatis pada Twitter. In Indonesian Symposium on Computing (Indo-SC) (pp. 11-16). DOI: 10.21108/indosc.2016.111.
Kotsiantis, S.B., Kanellopoulos, D. and Pintelas, P.E., 2006. Data preprocessing for supervised leaning. International journal of computer science, 1(2), pp.111-117.