XCrowd

Smirnova, Alisa; Yang, J.; Cudré-Mauroux, Philippe

XCrowd

Combining Explainability and Crowdsourcing to Diagnose Models in Relation Extraction

Conference paper (2024)

Authors

Alisa Smirnova University of Fribourg

J. Yang Web Information Systems -

Philippe Cudré-Mauroux University of Fribourg

Research Group

Web Information Systems () (TU Delft)

Error analysis Human computation Model interpretation Relation extraction

To reference this document use:

http://resolver.tudelft.nl/uuid:060b9386-545c-423e-aceb-7a281693c0c7

More Info

expand_more

Published Date

2024

Language

English

Faculty

Electrical Engineering, Mathematics and Computer Science

Department

Software Technology

Research Group

Web Information Systems

Abstract

Relation extraction methods are currently dominated by deep neural models, which capture complex statistical patterns while being brittle and vulnerable to perturbations in data and distribution. Explainability techniques offer a means for understanding such vulnerabilities, and thus represent an opportunity to mitigate future errors; yet, existing methods are limited to describing what the model 'knows', while totally failing at explaining what the model does not know. This paper presents a new method for diagnosing model predictions and detecting potential inaccuracies. Our approach involves breaking down the problem into two components: (i) determining the necessary knowledge the model should possess for accurate prediction, through human annotations, and (ii) assessing the actual knowledge possessed by the model, using explainable AI methods (XAI). We apply our method to several relation extraction tasks and conduct an empirical study leveraging human specifications of what a model should know and does not know. Results show that human workers are capable of accurately specifying the model should-knows, despite variations in the specification, that the alignment between what a model really knows and what it should know is indeed indicative of model accuracy, and that the unknowns identified through our methods allow to foresee future errors that may or may not have been observed otherwise.

Files

3627673.3679777.pdf

(pdf | 1.78 Mb)