Publication:
KUISAIL at SemEval-2020 Task 12: BERT-CNN for offensive speech identification in social media

Placeholder

Organizational Units

Program

KU Authors

Co-Authors

Advisor

Publication Date

2020

Language

English

Type

Conference proceeding

Journal Title

Journal ISSN

Volume Title

Abstract

In this paper, we describe our approach to utilize pre-trained BERT models with Convolutional Neural Networks for sub-task A of the Multilingual Offensive Language Identification shared task (OffensEval 2020), which is a part of the SemEval 2020. We show that combining CNN with BERT is better than using BERT on its own, and we emphasize the importance of utilizing pre-trained language models for downstream tasks. Our system, ranked 4th with macro averaged F1-Score of 0.897 in Arabic, 4th with score of 0.843 in Greek, and 3rd with score of 0.814 in Turkish. Additionally, we present ArabicBERT, a set of pre-trained transformer language models for Arabic that we share with the community.

Description

Source:

14th International Workshops on Semantic Evaluation, SemEval 2020 - co-located 28th International Conference on Computational Linguistics, COLING 2020, Proceedings

Publisher:

International Committee for Computational Linguistics

Keywords:

Subject

Cyberbullying, Hate speech, Social networks

Citation

Endorsement

Review

Supplemented By

Referenced By

Copy Rights Note

0

Views

0

Downloads

View PlumX Details