Team curie at HSD-2Lang 2024:hate speech detection in Turkish and Arabic tweets using BERT-based models

Publication:
Team curie at HSD-2Lang 2024:hate speech detection in Turkish and Arabic tweets using BERT-based models

Departments

Organizational Unit

Graduate School of Sciences and Engineering

School / College / Institute

Organizational Unit

GRADUATE SCHOOL OF SCIENCES AND ENGINEERING

Upper Org Unit

KU-Authors

Barkhordar, Ehsan

Topçu, Işık Sulal

Co-Authors

Hürriyetoğlu, Ali

Date

2024

Type

Conference Proceeding

Embargo Status

N/A

Abstract

This study focuses on hate speech detection in Turkish and Arabic tweets using advanced BERT-based models. Performance metrics demonstrate the models' effectiveness, with the Turkish variant achieving a 71.8% F1 score and the Arabic model a 76.9% F1 score, ranking them fourth and third, respectively, in a competitive leaderboard. Performance enhancements were realized through targeted preprocessing, including emoji translation and user mention exclusion, and thoughtful data balancing approaches. Future directions include refining model accuracy and broadening language support. Our reproducible approach and detailed findings are accessible on GitHub.

Publisher

Association for Computational Linguistics (ACL)

Subject

Speech recognition

Source

Case 2024 - 7th Workshop on Challenges and Applications of Automated Extraction of Socio-Political Events From Text, Proceedings of the Workshop

URI

https://hdl.handle.net/20.500.14288/22162

Rights

N/A

Collections

Publications without Fulltext

Full item page

Publication: Team curie at HSD-2Lang 2024:hate speech detection in Turkish and Arabic tweets using BERT-based models

Departments

School / College / Institute

Program

KU-Authors

KU Authors

Co-Authors

Editor & Affiliation

Compiler & Affiliation

Translator

Other Contributor

Date

Language

Type

Embargo Status

Journal Title

Journal ISSN

Volume Title

Alternative Title

Abstract

Source

Publisher

Subject

Citation

Has Part

Source

Book Series Title

Edition

DOI

URI

item.page.datauri

Link

Rights

Copyrights Note

Collections

Endorsement

Review

Supplemented By

Referenced By

Related Goal

3

Views

0

Downloads

Publication:
Team curie at HSD-2Lang 2024:hate speech detection in Turkish and Arabic tweets using BERT-based models