Publication: Team curie at HSD-2Lang 2024:hate speech detection in Turkish and Arabic tweets using BERT-based models
dc.contributor.coauthor | Hürriyetoğlu, Ali | |
dc.contributor.department | Graduate School of Sciences and Engineering | |
dc.contributor.kuauthor | Barkhordar, Ehsan | |
dc.contributor.kuauthor | Topçu, Işık Sulal | |
dc.contributor.schoolcollegeinstitute | GRADUATE SCHOOL OF SCIENCES AND ENGINEERING | |
dc.date.accessioned | 2024-12-29T09:36:46Z | |
dc.date.issued | 2024 | |
dc.description.abstract | This study focuses on hate speech detection in Turkish and Arabic tweets using advanced BERT-based models. Performance metrics demonstrate the models' effectiveness, with the Turkish variant achieving a 71.8% F1 score and the Arabic model a 76.9% F1 score, ranking them fourth and third, respectively, in a competitive leaderboard. Performance enhancements were realized through targeted preprocessing, including emoji translation and user mention exclusion, and thoughtful data balancing approaches. Future directions include refining model accuracy and broadening language support. Our reproducible approach and detailed findings are accessible on GitHub. | |
dc.description.indexedby | Scopus | |
dc.description.publisherscope | International | |
dc.description.sponsoredbyTubitakEu | EU | |
dc.description.sponsorship | This work is supported by the European Research Council Politus Project (ID:101082050) and European Union's HORIZON projects EFRA (ID: 101093026) and ECO-Ready (ID: 101084201). | |
dc.identifier.isbn | 979-889176070-7 | |
dc.identifier.quartile | N/A | |
dc.identifier.scopus | 2-s2.0-85190289452 | |
dc.identifier.uri | https://hdl.handle.net/20.500.14288/22162 | |
dc.keywords | F1 scores | |
dc.keywords | Modeling accuracy | |
dc.keywords | Performance enhancements | |
dc.keywords | Performance metrices | |
dc.keywords | Speech detection | |
dc.keywords | Turkishs | |
dc.language.iso | eng | |
dc.publisher | Association for Computational Linguistics (ACL) | |
dc.relation.grantno | European Research Council Politus Project | |
dc.relation.ispartof | Case 2024 - 7th Workshop on Challenges and Applications of Automated Extraction of Socio-Political Events From Text, Proceedings of the Workshop | |
dc.subject | Speech recognition | |
dc.title | Team curie at HSD-2Lang 2024:hate speech detection in Turkish and Arabic tweets using BERT-based models | |
dc.type | Conference Proceeding | |
dspace.entity.type | Publication | |
local.contributor.kuauthor | Barkhordar, Ehsan | |
local.contributor.kuauthor | Topçu, Işık Sulal | |
local.publication.orgunit1 | GRADUATE SCHOOL OF SCIENCES AND ENGINEERING | |
local.publication.orgunit2 | Graduate School of Sciences and Engineering | |
relation.isOrgUnitOfPublication | 3fc31c89-e803-4eb1-af6b-6258bc42c3d8 | |
relation.isOrgUnitOfPublication.latestForDiscovery | 3fc31c89-e803-4eb1-af6b-6258bc42c3d8 | |
relation.isParentOrgUnitOfPublication | 434c9663-2b11-4e66-9399-c863e2ebae43 | |
relation.isParentOrgUnitOfPublication.latestForDiscovery | 434c9663-2b11-4e66-9399-c863e2ebae43 |