Evaluasi Efektivitas Model Klasifikasi Sentimen untuk Analisis Opini Publik terhadap Kebijakan Lingkungan Berdasarkan Data Media Sosial Berbahasa Indonesia

Dada Suhaida; Adisti Primi Wulan; Rosanti Rosanti; Dianna Dianna

doi:10.62383/polygon.v2i2.951

Authors

Dada Suhaida Universitas PGRI Pontianak
Adisti Primi Wulan Universitas PGRI Pontianak
Rosanti Rosanti Universitas PGRI Pontianak
Dianna Dianna Poltekes Kemenkes Pontianak

DOI:

https://doi.org/10.62383/polygon.v2i2.951

Keywords:

Environmental Policy, Long Short-Term Memory, Natural Language Processing, Public Opinion Analysis, Sentiment Classification

Abstract

Background: Public opinion analysis has become increasingly important in the digital era, where social media platforms generate large-scale textual data reflecting public perceptions toward environmental policies. Advances in Natural language processing (NLP) and machine learning enable systematic sentiment classification to support data-driven decision-making. Objective: This study aims to evaluate the effectiveness of several sentiment classification models in analyzing Indonesian-language social media data related to environmental policies. Method: The research employed a text mining pipeline including data crawling, preprocessing (case folding, tokenization, stopword removal, and stemming), and vectorization using TF-IDF. Three classification models Logistic Regression, Support Vector Machine (SVM), and Long Short-Term Memory (LSTM) were trained and evaluated using accuracy and F1-score metrics. Results: Experimental findings indicate that LSTM achieved the highest performance with 91.7% accuracy and 91.2% F1-score, outperforming SVM (88.5%) and Logistic Regression (84.2%). Sentiment distribution analysis shows that public opinion is dominated by positive sentiment (47.5%), followed by neutral (32.0%) and negative (20.5%). Overall: The results demonstrate that deep learning-based models provide more robust contextual understanding and more reliable sentiment mapping for environmental policy analysis.

Downloads

Download data is not yet available.

References

Abdiansah, A., Yusliani, N., Fathoni, F., Nizar, M. F., Salsabella, A., & Davi, A. A. (2024). IDSpider: Indonesian Standard Dataset for Text-to-SQL. Proceedings of the 2024 9th International Conference on Informatics and Computing (ICIC 2024). https://doi.org/10.1109/ICIC64337.2024.10956918

Alameri, S. A., & Mohd, M. (2021). Comparison of Fake News Detection Using Machine learning and Deep learning Techniques. Proceedings of the 2021 3rd International Cyber Resilience Conference (CRC 2021), 9392458. https://doi.org/10.1109/CRC50527.2021.9392458

Astuti, L. W., Sari, Y., & Suprapto. (2023). Code-mixed Sentiment Analysis Using Transformer for Twitter Social Media Data. International Journal of Advanced Computer Science and Applications, 14(10), 498–504. https://doi.org/10.14569/IJACSA.2023.0141053

Balan, S., Conlon, S., & Reithel, B. (2024). Text Analysis on Green Supply Chain Practices of Electronic Companies. International Journal of Decision Support System Technology, 16(1). https://doi.org/10.4018/IJDSST.358950

Cahyawijaya, S., Lovenia, H., Koto, F., Adhista, D., Dave, E., Oktavianti, S., Akbar, S. M., Lee, J., Shadieq, N., Cenggoro, T. W., Linuwih, H. W., Wilie, B., Muridan, G. P., Winata, G. I., Moeljadi, D., Aji, A. F., Purwarianti, A., & Fung, P. (2023). NusaWrites: Constructing High-Quality Corpora for Underrepresented and Extremely Low-resource Languages. Proceedings of the 13th International Joint Conference on Natural language processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics (IJCNLP–AACL 2023), 1, 921–945. https://doi.org/10.18653/v1/2023.ijcnlp-main.60

Chai, C. P. (2023). Comparison of Text Preprocessing Methods. Natural Language Engineering, 29(3), 509–553. https://doi.org/10.1017/S1351324922000213

Chen, C., & Hu, X. (2024). The Research on an Online Review Sentiment Analysis Model Based on Improved RoBERTa. Proceedings of the 2024 3rd International Conference on Electronics and Information Technology (EIT 2024), 624–627. https://doi.org/10.1109/EIT63098.2024.10762224

Eryigit, G. (2014). ITU Turkish NLP Web Service. Proceedings of the Demonstrations at the 14th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2014), 1–4. https://doi.org/10.3115/v1/E14-2001

Fauzan, R., Labib, M. I. A., Johannis, J. O. T., Herlinawati, Noor, S., & Saifulah. (2022). Semantic Similarity of Indonesian Sentences Using Natural language processing and Cosine Similarity. Proceedings of the 2022 4th International Conference on Cybernetics and Intelligent System (ICORIS 2022). https://doi.org/10.1109/ICORIS56080.2022.10031439

Freeda, A., Anju, A., Venket, K., Dhaya, K., Kanthavel, R., & Vijay, F. (2024). Sentiment Analysis and Text Mining in Environmental Sustainability and Climate Change. In Text Mining and Sentiment Analysis in Climate Change and Environmental Sustainability (pp. 367–384). IGI Global. https://doi.org/10.4018/979-8-3693-7230-2.ch020

Giabbanelli, P. J., Adams, J., & Sai Pillutla, V. (2016). Feasibility and Framing of Interventions Based on Public Support: Leveraging Text Analytics for Policymakers. Lecture Notes in Computer Science, 9742, 188–200. https://doi.org/10.1007/978-3-319-39910-2_18

Gupta, S., & Arora, B. (2022). Stemming Techniques on English Language and Devanagari Script: A Review. Lecture Notes in Electrical Engineering, 832, 541–550. https://doi.org/10.1007/978-981-16-8248-3_45

Hadiprakoso, R. B., Setiawan, H., Yasa, R. N., & Girinoto. (2023). Text Preprocessing for Optimal Accuracy in Indonesian Sentiment Analysis Using a Deep learning Model with Word Embedding. AIP Conference Proceedings, 2680(1), 20050. https://doi.org/10.1063/5.0126116

Jiang, H., Qiang, M., & Lin, P. (2016). Assessment of Online Public Opinions on Large Infrastructure Projects: A Case Study of the Three Gorges Project in China. Environmental Impact Assessment Review, 61, 38–51. https://doi.org/10.1016/j.eiar.2016.06.004

Jiang, S., Li, S., Fu, S., & Lin, N. (2020). An Overview of Natural language processing for Indonesian and Malay. Pattern Recognition and Artificial Intelligence, 33(6), 530–541. https://doi.org/10.16451/j.cnki.issn1003-6059.202006006

Kusumawati, R., D’Arofah, A., & Pramana, P. A. (2019). Comparison Performance of Naive Bayes Classifier and Support Vector Machine Algorithm for Twitter’s Classification of Tokopedia Services. Journal of Physics: Conference Series, 1320(1), 12016. https://doi.org/10.1088/1742-6596/1320/1/012016

Literature Review on Public Opinion Identification and Analysis in Emergencies. (2021). Documentation, Information and Knowledge, 38(1), 93–102. https://doi.org/10.13366/j.dik.2021.01.093

Mahdi, Z. M., Istiqomah, R. F., Alfarelzi, A., Astuti, S., Asror, I., & Mayasari, R. (2024). Text Classification Using NLP by Comparing LSTM and Machine learning Method. Proceedings of the 2024 10th International Conference on Wireless and Telematics (ICWT 2024). https://doi.org/10.1109/ICWT62080.2024.10674679

Omar, A., & Hamouda, W. I. (2021). A Sentiment Analysis of Egypt’s New Real Estate Registration Law on Facebook. International Journal of Advanced Computer Science and Applications, 12(4), 656–663. https://doi.org/10.14569/IJACSA.2021.0120481

Pekar, V., Najafi, H., Binner, J. M., Swanson, R., Rickard, C., & Fry, J. (2022). Voting Intentions on Social Media and Political Opinion Polls. Government Information Quarterly, 39(4), 101658. https://doi.org/10.1016/j.giq.2021.101658

Qi, J., Liu, X., Yuan, M., & Gu, H. (2022). Design and Implementation of Weibo Public Opinion Analysis System. Lecture Notes in Electrical Engineering, 961, 1185–1195. https://doi.org/10.1007/978-981-19-6901-0_124

Rahmadi, N., Sudirman, S., Prastyo, A. B., Iswantoro, M. A., Utami, E., & Yaqin, A. (2023). Exploring the Boundless Potential of Deep learning in Gender Prediction from Indonesian Names. Proceedings of the 2023 6th International Conference on Vocational Education and Electrical Engineering (ICVEE 2023), 1–6. https://doi.org/10.1109/ICVEE59738.2023.10348281

Raj, H., Weihong, Y., Banbhrani, S. K., & Dino, S. P. (2018). LSTM-Based Short Message Service (SMS) Modeling for Spam Classification. Proceedings of the ACM International Conference Proceeding Series, 76–80. https://doi.org/10.1145/3231884.3231895

Ramdhani, M. A., Maylawati, D. S., & Mantoro, T. (2020). Indonesian News Classification Using Convolutional Neural Network. Indonesian Journal of Electrical Engineering and Computer Science, 19(2), 1000–1009. https://doi.org/10.11591/ijeecs.v19.i2.pp1000-1009

Rianto, Mutiara, A. B., Wibowo, E. P., & Santosa, P. I. (2021). Improving Stemming Techniques for Non-Formal Indonesian Sentences Using Incorbiz. ICIC Express Letters, 15(1), 67–74. https://doi.org/10.24507/icicel.15.01.67

Santhiya, P., Kogilavani, S. V, & Malliga, S. (2021). Sentiment Analysis Classifiers for Polarity Detection in Social Media Text: A Comparative Study. Proceedings of the 5th International Conference on Electronics, Communication and Aerospace Technology (ICECA 2021), 1407–1411. https://doi.org/10.1109/ICECA52323.2021.9676111

Sumathi, N., & Sheela, T. (2017). An Efficient Sentiment Analysis by Using Hybrid Naive Bayes and SVM Approach in Banking Institutions. International Journal of Civil Engineering and Technology, 8(12), 373–391.

Thummala, G. R., & Baskar, R. (2024). Comparison of SVM-Based Heart Disease Prediction with Naive Bayes-Based Prediction on Accuracy. AIP Conference Proceedings, 2853(1), 20180. https://doi.org/10.1063/5.0198178

Tyagi, A., Jain, V. K., & Kumar, V. (2024). Benchmark Text Preprocessing Techniques in Natural language processing. Proceedings of the 2024 4th International Conference on Innovative Sustainable Computational Technologies (CISCT 2024). https://doi.org/10.1109/CISCT62494.2024.11134188

Wang, Y., Huang, X., Li, B., Liu, X., Ma, Y., & Huang, X. (2023). Spreading Mechanism of Weibo Public Opinion Phonetic Representation Based on the Epidemic Model. International Journal of Speech Technology, 26(1), 11–21. https://doi.org/10.1007/s10772-020-09790-z

Wang, Y., Li, H., Zuo, J., & Wang, Z. (2019). Evolution of Online Public Opinions on Social Impact Induced by NIMBY Facility. Environmental Impact Assessment Review, 78, 106290. https://doi.org/10.1016/j.eiar.2019.106290

Wilie, B., Vincentio, K., Winata, G. I., Cahyawijaya, S., Li, X., Lim, Z. Y., Soleman, S., Mahendra, R., Fung, P., Bahar, S., & Purwarianti, A. (2020). IndoNLU: Benchmark and Resources for Evaluating Indonesian Natural Language Understanding. Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural language processing (AACL–IJCNLP 2020), 843–857. https://doi.org/10.18653/v1/2020.aacl-main.85

Winata, G. I., Aji, A. F., Cahyawijaya, S., Mahendra, R., Koto, F., Romadhony, A., Kurniawan, K., Moeljadi, D., Prasojo, R. E., Fung, P., Baldwin, T., Lau, J. H., Sennrich, R., & Ruder, S. (2023). NusaX: Multilingual Parallel Sentiment Dataset for 10 Indonesian Local Languages. Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2023), 815–834. https://doi.org/10.18653/v1/2023.eacl-main.57

Yuyun, Latief, A. D., Sampurno, T., Hazriani, Arisha, A. O., & Mushaf. (2023). Next Sentence Prediction: The Impact of Preprocessing Techniques in Deep learning. Proceedings of the 2023 10th International Conference on Computer, Control, Informatics and Its Applications (IC3INA 2023), 274–278. https://doi.org/10.1109/IC3INA60834.2023.10285805

Evaluasi Efektivitas Model Klasifikasi Sentimen untuk Analisis Opini Publik terhadap Kebijakan Lingkungan Berdasarkan Data Media Sosial Berbahasa Indonesia

Authors

DOI:

Keywords:

Abstract

Downloads

References

Downloads

Published

How to Cite

Issue

Section

License

Similar Articles

Menu New