GenSent: Improving Sentiment Analysis Using Genetic Algorithm-Based Ensemble Optimization

dc.contributor.authorAziz, Roza Hikmat Hama
dc.contributor.authorDimilier, Nazife
dc.date.accessioned2026-02-06T18:27:00Z
dc.date.issued2025
dc.departmentDoğu Akdeniz Üniversitesi
dc.description.abstractSocial media platforms are currently the primary medium of all types of communication from personal interactions, and opinion sharing to the dissemination of important international news. However, the ever-increasing amount of user-generated textual information coupled with the dynamic nature of the language, subtle or hidden nuances in expressions used, and contextual dependencies in text, renders timely and accurate sentiment analysis increasingly challenging. Sentiment analysis is an important task in its own right and is also used as the first step of many other classification tasks such as hate speech and misinformation detection. A significant portion of research on sentiment analysis and opinion mining has concentrated on categorizing social media content into three classifications: positive, negative, or neutral. However, despite their importance across numerous practical domains, the classification of extreme opinions, such as highly negative and highly positive sentiments, has only recently gained attention. To address this gap, we propose a framework, GenSent, a novel genetic algorithm-based optimization framework for sentiment classification. Unlike traditional methods that are often tailored to specific datasets, GenSent provides a versatile framework applicable to diverse sentiment analysis tasks from binary, ternary, and fine-grained 5-point scale classification that represents extreme sentiments as well. Through the use of a diverse pool of classifiers including support vector machines, Na & iuml;ve Bayes, Logistic Regression, Decision Trees, Random Forests, and Stochastic Gradient Descent Algorithms, GenSent effectively builds a robust ensemble without any intervention. The framework is evaluated using binary, ternary, and fine-grained sentiment analysis datasets, namely, SemEval-2017 (Sentiment Analysis in Twitter) task (4A, 4B, and 4C) and Stanford Sentiment Treebank (SST-2 and SST-5). The performance of the proposed framework is compared with other existing well-known methods in the field using the same datasets. Comparative results demonstrate that GenSent outperforms existing methods, achieving significant improvements in sentiment classification across various metrics while reducing the computational complexity.
dc.identifier.doi10.2339/politeknik.1705902
dc.identifier.issn1302-0900
dc.identifier.issn2147-9429
dc.identifier.orcid0000-0002-8861-4132
dc.identifier.scopusqualityN/A
dc.identifier.urihttps://doi.org/10.2339/politeknik.1705902
dc.identifier.urihttps://hdl.handle.net/11129/10752
dc.identifier.wosWOS:001603737500001
dc.identifier.wosqualityQ4
dc.indekslendigikaynakWeb of Science
dc.language.isoen
dc.publisherGazi Univ
dc.relation.ispartofJournal of Polytechnic-Politeknik Dergisi
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı
dc.rightsinfo:eu-repo/semantics/openAccess
dc.snmzKA_WoS_20260204
dc.subjectsentiment analysis
dc.subjectmachine learning
dc.subjectoptimized ensemble classifier
dc.subjectgenetic optimization
dc.subjectmeta-heuristic algorithms
dc.titleGenSent: Improving Sentiment Analysis Using Genetic Algorithm-Based Ensemble Optimization
dc.typeArticle

Files