Publication:
Evolutionary multiobjective feature selection for sentiment analysis

dc.contributor.coauthorPelin Angın
dc.contributor.coauthorDeniz, Ayça
dc.contributor.departmentDepartment of International Relations
dc.contributor.kuauthorAngın, Merih
dc.contributor.kuprofileFaculty Member
dc.contributor.otherDepartment of International Relations
dc.contributor.schoolcollegeinstituteCollege of Administrative Sciences and Economics
dc.contributor.yokid308500
dc.date.accessioned2024-11-09T13:56:18Z
dc.date.issued2021
dc.description.abstractSentiment analysis is one of the prominent research areas in data mining and knowledge discovery, which has proven to be an effective technique for monitoring public opinion. The big data era with a high volume of data generated by a variety of sources has provided enhanced opportunities for utilizing sentiment analysis in various domains. In order to take best advantage of the high volume of data for accurate sentiment analysis, it is essential to clean the data before the analysis, as irrelevant or redundant data will hinder extracting valuable information. In this paper, we propose a hybrid feature selection algorithm to improve the performance of sentiment analysis tasks. Our proposed sentiment analysis approach builds a binary classification model based on two feature selection techniques: an entropy-based metric and an evolutionary algorithm. We have performed comprehensive experiments in two different domains using a benchmark dataset, Stanford Sentiment Treebank, and a real-world dataset we have created based on World Health Organization (WHO) public speeches regarding COVID-19. The proposed feature selection model is shown to achieve significant performance improvements in both datasets, increasing classification accuracy for all utilized machine learning and text representation technique combinations. Moreover, it achieves over 70% reduction in feature size, which provides efficiency in computation time and space.
dc.description.fulltextYES
dc.description.indexedbyWoS
dc.description.indexedbyScopus
dc.description.openaccessYES
dc.description.publisherscopeInternational
dc.description.sponsoredbyTubitakEuTÜBİTAK
dc.description.sponsorshipScientific and Technological Research Council of Turkey (TÜBİTAK)
dc.description.sponsorship2232 International Fellowship for Outstanding Researchers Program
dc.description.versionPublisher version
dc.description.volume9
dc.formatpdf
dc.identifier.doi10.1109/ACCESS.2021.3118961
dc.identifier.embargoNO
dc.identifier.filenameinventorynoIR03261
dc.identifier.issn2169-3536
dc.identifier.linkhttps://doi.org/10.1109/ACCESS.2021.3118961
dc.identifier.quartileQ2
dc.identifier.scopus2-s2.0-85117131191
dc.identifier.urihttps://hdl.handle.net/20.500.14288/4054
dc.identifier.wos711712600001
dc.keywordsBinary classification
dc.keywordsEvolutionary computation
dc.keywordsFeature selection
dc.keywordsMultiobjective optimization
dc.keywordsSentiment analysis
dc.languageEnglish
dc.publisherInstitute of Electrical and Electronics Engineers (IEEE)
dc.relation.grantno118C309
dc.relation.urihttp://cdm21054.contentdm.oclc.org/cdm/ref/collection/IR/id/10044
dc.sourceIEEE Access
dc.subjectComputer science
dc.subjectEngineering
dc.subjectTelecommunications
dc.subjectInformation systems
dc.titleEvolutionary multiobjective feature selection for sentiment analysis
dc.typeJournal Article
dspace.entity.typePublication
local.contributor.authorid0000-0003-0739-798X
local.contributor.kuauthorAngın, Merih
relation.isOrgUnitOfPublication9fc25a77-75a8-48c0-8878-02d9b71a9126
relation.isOrgUnitOfPublication.latestForDiscovery9fc25a77-75a8-48c0-8878-02d9b71a9126

Files

Original bundle

Now showing 1 - 1 of 1
Thumbnail Image
Name:
10044.pdf
Size:
6.34 MB
Format:
Adobe Portable Document Format