HurtBERT: Incorporating Lexical Features with BERT for the Detection of Abusive Language

Anna Koufakou

doi:10.18653/v1/2020.alw-1.5

Back

HurtBERT: Incorporating Lexical Features with BERT for the Detection of Abusive Language

Conference paper

Open access

HurtBERT: Incorporating Lexical Features with BERT for the Detection of Abusive Language

Anna Koufakou

Workshop on Abusive Language Online (2020–2020)

11-16-2020

DOI: https://doi.org/10.18653/v1/2020.alw-1.5

Abstract

The detection of abusive or offensive remarks in social texts has received significant attention in research. In several related shared tasks, BERT has been shown to be the state-of-the-art. In this paper, we propose to utilize lexical features derived from a hate lexicon towards improving the performance of BERT in such tasks. We explore different ways to utilize the lexical features in the form of lexicon-based encodings at the sentence level or embeddings at the word level. We provide an extensive dataset evaluation that addresses in-domain as well as cross-domain detection of abusive content to render a complete picture. Our results indicate that our proposed models combining BERT with lexical features help improve over a baseline BERT model in many of our in-domain and cross-domain experiments.

Files and links (2)

url

https://aclanthology.org/2020.alw-1.5View

url

https://doi.org/10.18653/v1/2020.alw-1.5View

Published (Version of record) Open

Metrics

23 Record Views

Details

Title: HurtBERT: Incorporating Lexical Features with BERT for the Detection of Abusive Language
Creators: Anna Koufakou - Florida Gulf Coast University, Department of Computing and Software Engineering
Contributors: Endang Wahyu Pamungkas (Corresponding Author) - University of Turin
Valerio Basile (Corresponding Author) - University of Turin
Viviana Patti (Corresponding Author) - University of Turin
Conference: Workshop on Abusive Language Online (2020–2020)
Identifiers: 99383431277506570
Academic Unit: Department of Computing and Software Engineering
Resource Type: Conference paper

HurtBERT: Incorporating Lexical Features with BERT for the Detection of Abusive Language

Abstract

Files and links (2)

Related links

Metrics

Details