Indexing metadata

Enhancing Sentiment and Emotion Classification with LSTM-Based Semi-Supervised Learning


 
Dublin Core PKP Metadata Items Metadata for this Document
 
1. Title Title of document Enhancing Sentiment and Emotion Classification with LSTM-Based Semi-Supervised Learning
 
2. Creator Author's name, affiliation, country Rochmat Husaini; Universitas Pembangunan Nasional Yogyakarta; Indonesia
 
2. Creator Author's name, affiliation, country Nur Heri Cahyana; Universitas Pembangunan Nasional Yogyakarta; Indonesia
 
2. Creator Author's name, affiliation, country Wisnalmawati Wisnalmawati; Universitas Pembangunan Nasional Yogyakarta; Indonesia
 
2. Creator Author's name, affiliation, country Tri Mardiana; Universitas Pembangunan Nasional Yogyakarta; Indonesia
 
2. Creator Author's name, affiliation, country Yuli Fauziah; Universitas Pembangunan Nasional Yogyakarta; Indonesia
 
3. Subject Discipline(s) Artificial Intelligence
 
3. Subject Keyword(s) Semi-supervised Learning; LSTM; Sentiment Analysis
 
4. Description Abstract

The evolution of sentiment analysis has increasingly relied on semi-supervised learning (SSL) models, particularly due to their efficiency in utilizing large amounts of unlabeled data. This study employed four Indonesian datasets—Ridife (sentiment classification), Emotion Indonlu (emotion classification), Sentiment Indonlu (sentiment classification), and Hate Speech (offensive content detection). The LSTM model was trained using labeled data and used to generate pseudo-labels for unlabeled data across three iterations. The performance of the pseudo-labels was evaluated using Random Forest, Logistic Regression, and Support Vector Machine (SVM). The LSTM model demonstrated varying effectiveness across different datasets. For the Sentiment Ridife dataset, LSTM achieved an accuracy of 70.23%, slightly lower than Random Forest but higher than Logistic Regression and SVM. In the Sentiment IndoNLU dataset, LSTM's accuracy was 86.12%, showing strong performance but slightly below Random Forest and Logistic Regression. The Emotion IndoNLU dataset revealed similar performance across models, while the Hate Speech dataset saw LSTM perform well with an accuracy of 86.49%. The results indicate that while LSTM-based SSL can effectively generate pseudo-labels and enhance model performance, its performance varies depending on the dataset and task. This study underscores the need for further research into optimizing pseudo-labeling techniques and exploring advanced NLP models to improve sentiment and emotion analysis in diverse languages.

 
5. Publisher Organizing agency, location Institut Teknologi Dirgantara Adisutjipto
 
6. Contributor Sponsor(s)
 
7. Date (YYYY-MM-DD) 2025-06-13
 
8. Type Status & genre Peer-reviewed Article
 
8. Type Type
 
9. Format File format PDF, Check Plagiarism PDF
 
10. Identifier Uniform Resource Identifier https://ejournals.itda.ac.id/index.php/compiler/article/view/2965
 
10. Identifier Digital Object Identifier http://dx.doi.org/10.28989/compiler.v14i1.2965
 
11. Source Title; vol., no. (year) Compiler; Vol 14, No 1 (2025): May
 
12. Language English=en en
 
13. Relation Supp. Files check plagiarism (1015KB)
 
14. Coverage Geo-spatial location, chronological period, research sample (gender, age, etc.)
 
15. Rights Copyright and permissions Copyright (c) 2025 Compiler