Publication:
Team curie at HSD-2Lang 2024:hate speech detection in Turkish and Arabic tweets using BERT-based models

Placeholder

School / College / Institute

Organizational Unit

Program

KU Authors

Co-Authors

Hürriyetoğlu, Ali

Publication Date

Language

Embargo Status

Journal Title

Journal ISSN

Volume Title

Alternative Title

Abstract

This study focuses on hate speech detection in Turkish and Arabic tweets using advanced BERT-based models. Performance metrics demonstrate the models' effectiveness, with the Turkish variant achieving a 71.8% F1 score and the Arabic model a 76.9% F1 score, ranking them fourth and third, respectively, in a competitive leaderboard. Performance enhancements were realized through targeted preprocessing, including emoji translation and user mention exclusion, and thoughtful data balancing approaches. Future directions include refining model accuracy and broadening language support. Our reproducible approach and detailed findings are accessible on GitHub.

Source

Publisher

Association for Computational Linguistics (ACL)

Subject

Speech recognition

Citation

Has Part

Source

Case 2024 - 7th Workshop on Challenges and Applications of Automated Extraction of Socio-Political Events From Text, Proceedings of the Workshop

Book Series Title

Edition

DOI

item.page.datauri

Link

Rights

Copyrights Note

Endorsement

Review

Supplemented By

Referenced By

1

Views

0

Downloads