Evaluasi Efektivitas Model Klasifikasi Sentimen untuk Analisis Opini Publik terhadap Kebijakan Lingkungan Berdasarkan Data Media Sosial Berbahasa Indonesia
DOI:
https://doi.org/10.62383/polygon.v2i2.951Keywords:
Environmental Policy, Long Short-Term Memory, Natural Language Processing, Public Opinion Analysis, Sentiment ClassificationAbstract
Background: Public opinion analysis has become increasingly important in the digital era, where social media platforms generate large-scale textual data reflecting public perceptions toward environmental policies. Advances in Natural language processing (NLP) and machine learning enable systematic sentiment classification to support data-driven decision-making. Objective: This study aims to evaluate the effectiveness of several sentiment classification models in analyzing Indonesian-language social media data related to environmental policies. Method: The research employed a text mining pipeline including data crawling, preprocessing (case folding, tokenization, stopword removal, and stemming), and vectorization using TF-IDF. Three classification models Logistic Regression, Support Vector Machine (SVM), and Long Short-Term Memory (LSTM) were trained and evaluated using accuracy and F1-score metrics. Results: Experimental findings indicate that LSTM achieved the highest performance with 91.7% accuracy and 91.2% F1-score, outperforming SVM (88.5%) and Logistic Regression (84.2%). Sentiment distribution analysis shows that public opinion is dominated by positive sentiment (47.5%), followed by neutral (32.0%) and negative (20.5%). Overall: The results demonstrate that deep learning-based models provide more robust contextual understanding and more reliable sentiment mapping for environmental policy analysis.
Downloads
References
Abdiansah, A., Yusliani, N., Fathoni, F., Nizar, M. F., Salsabella, A., & Davi, A. A. (2024). IDSpider: Indonesian Standard Dataset for Text-to-SQL. Proceedings of the 2024 9th International Conference on Informatics and Computing (ICIC 2024). https://doi.org/10.1109/ICIC64337.2024.10956918
Alameri, S. A., & Mohd, M. (2021). Comparison of Fake News Detection Using Machine learning and Deep learning Techniques. Proceedings of the 2021 3rd International Cyber Resilience Conference (CRC 2021), 9392458. https://doi.org/10.1109/CRC50527.2021.9392458
Astuti, L. W., Sari, Y., & Suprapto. (2023). Code-mixed Sentiment Analysis Using Transformer for Twitter Social Media Data. International Journal of Advanced Computer Science and Applications, 14(10), 498–504. https://doi.org/10.14569/IJACSA.2023.0141053
Balan, S., Conlon, S., & Reithel, B. (2024). Text Analysis on Green Supply Chain Practices of Electronic Companies. International Journal of Decision Support System Technology, 16(1). https://doi.org/10.4018/IJDSST.358950
Cahyawijaya, S., Lovenia, H., Koto, F., Adhista, D., Dave, E., Oktavianti, S., Akbar, S. M., Lee, J., Shadieq, N., Cenggoro, T. W., Linuwih, H. W., Wilie, B., Muridan, G. P., Winata, G. I., Moeljadi, D., Aji, A. F., Purwarianti, A., & Fung, P. (2023). NusaWrites: Constructing High-Quality Corpora for Underrepresented and Extremely Low-resource Languages. Proceedings of the 13th International Joint Conference on Natural language processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics (IJCNLP–AACL 2023), 1, 921–945. https://doi.org/10.18653/v1/2023.ijcnlp-main.60
Chai, C. P. (2023). Comparison of Text Preprocessing Methods. Natural Language Engineering, 29(3), 509–553. https://doi.org/10.1017/S1351324922000213
Chen, C., & Hu, X. (2024). The Research on an Online Review Sentiment Analysis Model Based on Improved RoBERTa. Proceedings of the 2024 3rd International Conference on Electronics and Information Technology (EIT 2024), 624–627. https://doi.org/10.1109/EIT63098.2024.10762224
Eryigit, G. (2014). ITU Turkish NLP Web Service. Proceedings of the Demonstrations at the 14th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2014), 1–4. https://doi.org/10.3115/v1/E14-2001
Fauzan, R., Labib, M. I. A., Johannis, J. O. T., Herlinawati, Noor, S., & Saifulah. (2022). Semantic Similarity of Indonesian Sentences Using Natural language processing and Cosine Similarity. Proceedings of the 2022 4th International Conference on Cybernetics and Intelligent System (ICORIS 2022). https://doi.org/10.1109/ICORIS56080.2022.10031439
Freeda, A., Anju, A., Venket, K., Dhaya, K., Kanthavel, R., & Vijay, F. (2024). Sentiment Analysis and Text Mining in Environmental Sustainability and Climate Change. In Text Mining and Sentiment Analysis in Climate Change and Environmental Sustainability (pp. 367–384). IGI Global. https://doi.org/10.4018/979-8-3693-7230-2.ch020
Giabbanelli, P. J., Adams, J., & Sai Pillutla, V. (2016). Feasibility and Framing of Interventions Based on Public Support: Leveraging Text Analytics for Policymakers. Lecture Notes in Computer Science, 9742, 188–200. https://doi.org/10.1007/978-3-319-39910-2_18
Gupta, S., & Arora, B. (2022). Stemming Techniques on English Language and Devanagari Script: A Review. Lecture Notes in Electrical Engineering, 832, 541–550. https://doi.org/10.1007/978-981-16-8248-3_45
Hadiprakoso, R. B., Setiawan, H., Yasa, R. N., & Girinoto. (2023). Text Preprocessing for Optimal Accuracy in Indonesian Sentiment Analysis Using a Deep learning Model with Word Embedding. AIP Conference Proceedings, 2680(1), 20050. https://doi.org/10.1063/5.0126116
Jiang, H., Qiang, M., & Lin, P. (2016). Assessment of Online Public Opinions on Large Infrastructure Projects: A Case Study of the Three Gorges Project in China. Environmental Impact Assessment Review, 61, 38–51. https://doi.org/10.1016/j.eiar.2016.06.004
Jiang, S., Li, S., Fu, S., & Lin, N. (2020). An Overview of Natural language processing for Indonesian and Malay. Pattern Recognition and Artificial Intelligence, 33(6), 530–541. https://doi.org/10.16451/j.cnki.issn1003-6059.202006006
Kusumawati, R., D’Arofah, A., & Pramana, P. A. (2019). Comparison Performance of Naive Bayes Classifier and Support Vector Machine Algorithm for Twitter’s Classification of Tokopedia Services. Journal of Physics: Conference Series, 1320(1), 12016. https://doi.org/10.1088/1742-6596/1320/1/012016
Literature Review on Public Opinion Identification and Analysis in Emergencies. (2021). Documentation, Information and Knowledge, 38(1), 93–102. https://doi.org/10.13366/j.dik.2021.01.093
Mahdi, Z. M., Istiqomah, R. F., Alfarelzi, A., Astuti, S., Asror, I., & Mayasari, R. (2024). Text Classification Using NLP by Comparing LSTM and Machine learning Method. Proceedings of the 2024 10th International Conference on Wireless and Telematics (ICWT 2024). https://doi.org/10.1109/ICWT62080.2024.10674679
Omar, A., & Hamouda, W. I. (2021). A Sentiment Analysis of Egypt’s New Real Estate Registration Law on Facebook. International Journal of Advanced Computer Science and Applications, 12(4), 656–663. https://doi.org/10.14569/IJACSA.2021.0120481
Pekar, V., Najafi, H., Binner, J. M., Swanson, R., Rickard, C., & Fry, J. (2022). Voting Intentions on Social Media and Political Opinion Polls. Government Information Quarterly, 39(4), 101658. https://doi.org/10.1016/j.giq.2021.101658
Qi, J., Liu, X., Yuan, M., & Gu, H. (2022). Design and Implementation of Weibo Public Opinion Analysis System. Lecture Notes in Electrical Engineering, 961, 1185–1195. https://doi.org/10.1007/978-981-19-6901-0_124
Rahmadi, N., Sudirman, S., Prastyo, A. B., Iswantoro, M. A., Utami, E., & Yaqin, A. (2023). Exploring the Boundless Potential of Deep learning in Gender Prediction from Indonesian Names. Proceedings of the 2023 6th International Conference on Vocational Education and Electrical Engineering (ICVEE 2023), 1–6. https://doi.org/10.1109/ICVEE59738.2023.10348281
Raj, H., Weihong, Y., Banbhrani, S. K., & Dino, S. P. (2018). LSTM-Based Short Message Service (SMS) Modeling for Spam Classification. Proceedings of the ACM International Conference Proceeding Series, 76–80. https://doi.org/10.1145/3231884.3231895
Ramdhani, M. A., Maylawati, D. S., & Mantoro, T. (2020). Indonesian News Classification Using Convolutional Neural Network. Indonesian Journal of Electrical Engineering and Computer Science, 19(2), 1000–1009. https://doi.org/10.11591/ijeecs.v19.i2.pp1000-1009
Rianto, Mutiara, A. B., Wibowo, E. P., & Santosa, P. I. (2021). Improving Stemming Techniques for Non-Formal Indonesian Sentences Using Incorbiz. ICIC Express Letters, 15(1), 67–74. https://doi.org/10.24507/icicel.15.01.67
Santhiya, P., Kogilavani, S. V, & Malliga, S. (2021). Sentiment Analysis Classifiers for Polarity Detection in Social Media Text: A Comparative Study. Proceedings of the 5th International Conference on Electronics, Communication and Aerospace Technology (ICECA 2021), 1407–1411. https://doi.org/10.1109/ICECA52323.2021.9676111
Sumathi, N., & Sheela, T. (2017). An Efficient Sentiment Analysis by Using Hybrid Naive Bayes and SVM Approach in Banking Institutions. International Journal of Civil Engineering and Technology, 8(12), 373–391.
Thummala, G. R., & Baskar, R. (2024). Comparison of SVM-Based Heart Disease Prediction with Naive Bayes-Based Prediction on Accuracy. AIP Conference Proceedings, 2853(1), 20180. https://doi.org/10.1063/5.0198178
Tyagi, A., Jain, V. K., & Kumar, V. (2024). Benchmark Text Preprocessing Techniques in Natural language processing. Proceedings of the 2024 4th International Conference on Innovative Sustainable Computational Technologies (CISCT 2024). https://doi.org/10.1109/CISCT62494.2024.11134188
Wang, Y., Huang, X., Li, B., Liu, X., Ma, Y., & Huang, X. (2023). Spreading Mechanism of Weibo Public Opinion Phonetic Representation Based on the Epidemic Model. International Journal of Speech Technology, 26(1), 11–21. https://doi.org/10.1007/s10772-020-09790-z
Wang, Y., Li, H., Zuo, J., & Wang, Z. (2019). Evolution of Online Public Opinions on Social Impact Induced by NIMBY Facility. Environmental Impact Assessment Review, 78, 106290. https://doi.org/10.1016/j.eiar.2019.106290
Wilie, B., Vincentio, K., Winata, G. I., Cahyawijaya, S., Li, X., Lim, Z. Y., Soleman, S., Mahendra, R., Fung, P., Bahar, S., & Purwarianti, A. (2020). IndoNLU: Benchmark and Resources for Evaluating Indonesian Natural Language Understanding. Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural language processing (AACL–IJCNLP 2020), 843–857. https://doi.org/10.18653/v1/2020.aacl-main.85
Winata, G. I., Aji, A. F., Cahyawijaya, S., Mahendra, R., Koto, F., Romadhony, A., Kurniawan, K., Moeljadi, D., Prasojo, R. E., Fung, P., Baldwin, T., Lau, J. H., Sennrich, R., & Ruder, S. (2023). NusaX: Multilingual Parallel Sentiment Dataset for 10 Indonesian Local Languages. Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2023), 815–834. https://doi.org/10.18653/v1/2023.eacl-main.57
Yuyun, Latief, A. D., Sampurno, T., Hazriani, Arisha, A. O., & Mushaf. (2023). Next Sentence Prediction: The Impact of Preprocessing Techniques in Deep learning. Proceedings of the 2023 10th International Conference on Computer, Control, Informatics and Its Applications (IC3INA 2023), 274–278. https://doi.org/10.1109/IC3INA60834.2023.10285805
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2024 Polygon : Jurnal Ilmu Komputer dan Ilmu Pengetahuan Alam

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.



