Characterising and Mitigating Aggregation-Bias in Crowdsourced Toxicity Annotations

Balayn, A.M.A.; Mavridis, P.; Bozzon, A.; Timmermans, B.F.L.; Szlávik, Z.

Characterising and Mitigating Aggregation-Bias in Crowdsourced Toxicity Annotations

Conference paper (2018)

Authors

A.M.A. Balayn Student, IBM Nederland

P. Mavridis Web Information Systems -

A. Bozzon Web Information Systems -

B.F.L. Timmermans IBM Nederland

Z. Szlávik IBM Nederland

Research Group

Web Information Systems () (TU Delft)

Crowdsourcing Dataset bias Machine Learning fairness Annotation aggregation

To reference this document use:

http://resolver.tudelft.nl/uuid:43f84e8d-71f7-4379-84a6-b6cac86253e5

More Info

expand_more

Published Date

2018

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Electrical Engineering, Mathematics and Computer Science

Department

Software Technology

Research Group

Web Information Systems

Abstract

Training machine learning (ML) models for natural language processing usually requires large amount of data, often acquired through crowdsourcing. The way this data is collected and aggregated can have an effect on the outputs of the trained model such as ignoring the labels which differ from the majority. In this paper we investigate how label aggregation can bias the ML results towards certain data samples and propose a methodology to highlight and mitigate this bias. Although our work is applicable to any kind of label aggregation for data subject to multiple interpretations, we focus on the effects of the bias introduced by majority voting on toxicity prediction over sentences. Our preliminary results point out that we can mitigate the majority-bias and get increased prediction accuracy for the minority opinions if we take into account the different labels from annotators when training adapted models, rather than rely on the aggregated labels.

Files

Paper7.pdf

(pdf | 0.392 Mb)

Unknown license