Research Outputs

Permanent URI for this communityhttps://hdl.handle.net/20.500.14288/2

Browse

Search Results

Now showing 1 - 1 of 1
  • Placeholder
    Publication
    KUISAIL at SemEval-2020 Task 12: BERT-CNN for offensive speech identification in social media
    (International Committee for Computational Linguistics, 2020) Department of Computer Engineering; N/A; N/A; YĆ¼ret, Deniz; Safaya, Ali; Isentemiz, Moutasem; Faculty Member; PhD Student; Master Student; Department of Computer Engineering; College of Engineering; Graduate School of Sciences and Engineering; Graduate School of Sciences and Engineering; 179996; N/A; N/A
    In this paper, we describe our approach to utilize pre-trained BERT models with Convolutional Neural Networks for sub-task A of the Multilingual Offensive Language Identification shared task (OffensEval 2020), which is a part of the SemEval 2020. We show that combining CNN with BERT is better than using BERT on its own, and we emphasize the importance of utilizing pre-trained language models for downstream tasks. Our system, ranked 4th with macro averaged F1-Score of 0.897 in Arabic, 4th with score of 0.843 in Greek, and 3rd with score of 0.814 in Turkish. Additionally, we present ArabicBERT, a set of pre-trained transformer language models for Arabic that we share with the community.