Publication: KUISAIL at SemEval-2020 Task 12: BERT-CNN for offensive speech identification in social media
| dc.conference.date | DEC 12-13, 2020 | |
| dc.conference.location | Barcelona, Spain | |
| dc.conference.organizer | 14th International Workshops on Semantic Evaluation (SemEval) | |
| dc.contributor.department | KUIS AI (Koç University & İş Bank Artificial Intelligence Center) | |
| dc.contributor.facultymember | Yes | |
| dc.contributor.kuauthor | Isentemiz, Moutasem | |
| dc.contributor.kuauthor | Safaya, Ali | |
| dc.contributor.kuauthor | Yüret, Deniz | |
| dc.contributor.schoolcollegeinstitute | Research Center | |
| dc.date.accessioned | 2024-11-09T23:12:31Z | |
| dc.date.issued | 2020 | |
| dc.description.abstract | In this paper, we describe our approach to utilize pre-trained BERT models with Convolutional Neural Networks for sub-task A of the Multilingual Offensive Language Identification shared task (OffensEval 2020), which is a part of the SemEval 2020. We show that combining CNN with BERT is better than using BERT on its own, and we emphasize the importance of utilizing pre-trained language models for downstream tasks. Our system, ranked 4th with macro averaged F1-Score of 0.897 in Arabic, 4th with score of 0.843 in Greek, and 3rd with score of 0.814 in Turkish. Additionally, we present ArabicBERT, a set of pre-trained transformer language models for Arabic that we share with the community. | |
| dc.description.fulltext | Yes | |
| dc.description.harvestedfrom | Manual | |
| dc.description.indexedby | Scopus | |
| dc.description.indexedby | WOS | |
| dc.description.openaccess | YES | |
| dc.description.peerreviewstatus | N/A | |
| dc.description.publisherscope | International | |
| dc.description.readpublish | N/A | |
| dc.description.sponsoredbyTubitakEu | EU | |
| dc.description.studentonlypublication | No | |
| dc.description.studentpublication | Yes | |
| dc.description.version | Post-print | |
| dc.identifier.embargo | No | |
| dc.identifier.filenameinventoryno | IR06867 | |
| dc.identifier.grantno | 714868 | |
| dc.identifier.isbn | 9781952148316 | |
| dc.identifier.quartile | N/A | |
| dc.identifier.scopus | 2-s2.0-85118416740 | |
| dc.identifier.uri | https://hdl.handle.net/20.500.14288/9830 | |
| dc.identifier.wos | 001361895500271 | |
| dc.keywords | Computational linguistics | |
| dc.keywords | Convolutional neural networks | |
| dc.keywords | Social networking (online) | |
| dc.keywords | Speech recognition | |
| dc.keywords | Convolutional neural network | |
| dc.keywords | Down-stream | |
| dc.keywords | F1 scores | |
| dc.keywords | Language identification | |
| dc.keywords | Language model | |
| dc.keywords | Offensive languages | |
| dc.keywords | Social media | |
| dc.keywords | Speech identification | |
| dc.keywords | Subtask | |
| dc.keywords | Turkishs | |
| dc.keywords | Semantics | |
| dc.language.iso | eng | |
| dc.publisher | International Committee for Computational Linguistics | |
| dc.relation.affiliation | Koç University | |
| dc.relation.collection | Koç University Institutional Repository | |
| dc.relation.ispartof | Proceedings of the Fourteenth Workshop on Semantic Evaluation | |
| dc.relation.openaccess | Yes | |
| dc.rights | Other | |
| dc.subject | Cyberbullying | |
| dc.subject | Hate speech | |
| dc.subject | Social networks | |
| dc.title | KUISAIL at SemEval-2020 Task 12: BERT-CNN for offensive speech identification in social media | |
| dc.type | Conference Proceeding | |
| dspace.entity.type | Publication | |
| local.contributor.kuauthor | Yüret, Deniz | |
| local.contributor.kuauthor | Safaya, Ali | |
| local.contributor.kuauthor | Isentemiz, Moutasem | |
| relation.isOrgUnitOfPublication | 77d67233-829b-4c3a-a28f-bd97ab5c12c7 | |
| relation.isOrgUnitOfPublication.latestForDiscovery | 77d67233-829b-4c3a-a28f-bd97ab5c12c7 | |
| relation.isParentOrgUnitOfPublication | d437580f-9309-4ecb-864a-4af58309d287 | |
| relation.isParentOrgUnitOfPublication.latestForDiscovery | d437580f-9309-4ecb-864a-4af58309d287 |
Files
Original bundle
1 - 1 of 1
