Logo image
HurtBERT: Incorporating Lexical Features with BERT for the Detection of Abusive Language
Conference paper   Open access

HurtBERT: Incorporating Lexical Features with BERT for the Detection of Abusive Language

Anna Koufakou
Workshop on Abusive Language Online (2020–2020)
11-16-2020

Abstract

The detection of abusive or offensive remarks in social texts has received significant attention in research. In several related shared tasks, BERT has been shown to be the state-of-the-art. In this paper, we propose to utilize lexical features derived from a hate lexicon towards improving the performance of BERT in such tasks. We explore different ways to utilize the lexical features in the form of lexicon-based encodings at the sentence level or embeddings at the word level. We provide an extensive dataset evaluation that addresses in-domain as well as cross-domain detection of abusive content to render a complete picture. Our results indicate that our proposed models combining BERT with lexical features help improve over a baseline BERT model in many of our in-domain and cross-domain experiments.
url
https://aclanthology.org/2020.alw-1.5View
url
https://doi.org/10.18653/v1/2020.alw-1.5View
Published (Version of record) Open

Related links

Metrics

23 Record Views

Details

Logo image