Publication:
Team curie at HSD-2Lang 2024:hate speech detection in Turkish and Arabic tweets using BERT-based models

dc.contributor.coauthorHürriyetoğlu, Ali
dc.contributor.departmentGraduate School of Sciences and Engineering
dc.contributor.kuauthorBarkhordar, Ehsan
dc.contributor.kuauthorTopçu, Işık Sulal
dc.contributor.schoolcollegeinstituteGRADUATE SCHOOL OF SCIENCES AND ENGINEERING
dc.date.accessioned2024-12-29T09:36:46Z
dc.date.issued2024
dc.description.abstractThis study focuses on hate speech detection in Turkish and Arabic tweets using advanced BERT-based models. Performance metrics demonstrate the models' effectiveness, with the Turkish variant achieving a 71.8% F1 score and the Arabic model a 76.9% F1 score, ranking them fourth and third, respectively, in a competitive leaderboard. Performance enhancements were realized through targeted preprocessing, including emoji translation and user mention exclusion, and thoughtful data balancing approaches. Future directions include refining model accuracy and broadening language support. Our reproducible approach and detailed findings are accessible on GitHub.
dc.description.indexedbyScopus
dc.description.publisherscopeInternational
dc.description.sponsoredbyTubitakEuEU
dc.description.sponsorshipThis work is supported by the European Research Council Politus Project (ID:101082050) and European Union's HORIZON projects EFRA (ID: 101093026) and ECO-Ready (ID: 101084201).
dc.identifier.isbn979-889176070-7
dc.identifier.quartileN/A
dc.identifier.scopus2-s2.0-85190289452
dc.identifier.urihttps://hdl.handle.net/20.500.14288/22162
dc.keywordsF1 scores
dc.keywordsModeling accuracy
dc.keywordsPerformance enhancements
dc.keywordsPerformance metrices
dc.keywordsSpeech detection
dc.keywordsTurkishs
dc.language.isoeng
dc.publisherAssociation for Computational Linguistics (ACL)
dc.relation.grantnoEuropean Research Council Politus Project
dc.relation.ispartofCase 2024 - 7th Workshop on Challenges and Applications of Automated Extraction of Socio-Political Events From Text, Proceedings of the Workshop
dc.subjectSpeech recognition
dc.titleTeam curie at HSD-2Lang 2024:hate speech detection in Turkish and Arabic tweets using BERT-based models
dc.typeConference Proceeding
dspace.entity.typePublication
local.contributor.kuauthorBarkhordar, Ehsan
local.contributor.kuauthorTopçu, Işık Sulal
local.publication.orgunit1GRADUATE SCHOOL OF SCIENCES AND ENGINEERING
local.publication.orgunit2Graduate School of Sciences and Engineering
relation.isOrgUnitOfPublication3fc31c89-e803-4eb1-af6b-6258bc42c3d8
relation.isOrgUnitOfPublication.latestForDiscovery3fc31c89-e803-4eb1-af6b-6258bc42c3d8
relation.isParentOrgUnitOfPublication434c9663-2b11-4e66-9399-c863e2ebae43
relation.isParentOrgUnitOfPublication.latestForDiscovery434c9663-2b11-4e66-9399-c863e2ebae43

Files