Normalization of Long-tail Adverse Drug Reactions in Social Media

Manousogiannis, E.; Mesbah, Sepideh; Bozzon, A; Sips, Robert-Jan; Szlávik, Zoltán; Baez Santamaria, Selene

Normalization of Long-tail Adverse Drug Reactions in Social Media

Conference paper (2020)

Authors

E. Manousogiannis myTomorrows

Sepideh Mesbah Human-Centred Artificial Intelligence

A Bozzon Human-Centred Artificial Intelligence

Robert-Jan Sips myTomorrows

Zoltán Szlávik myTomorrows

Selene Baez Santamaria myTomorrows

Research Group

Human-Centred Artificial Intelligence

To reference this document use:

http://resolver.tudelft.nl/uuid:eec3715e-60ab-4ce9-be99-2d4e9061cf3a

More Info

expand_more

Published Date

2020

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Research Group

Human-Centred Artificial Intelligence

Abstract

The automatic mapping of Adverse Drug Reaction (ADR) reports from user-generated content to concepts in a controlled medical vocabulary provides valuable insights for monitoring public health. While state-of-the-art deep learning-based sequence classification techniques achieve impressive performance for medical concepts with large amounts of training data, they show their limit with long-tail concepts that have a low number of training samples. The above hinders their adaptability to the changes of layman’s terminology and the constant emergence of new informal medical terms. Our objective in this paper is to tackle the problem of normalizing long-tail ADR mentions in user-generated content. In this paper, we exploit the implicit semantics of rare ADRs for which we have few training samples, in order to detect the most similar class for the given ADR. The evaluation results demonstrate that our proposed approach addresses the limitations of the existing techniques when the amount of training data is limited.

Files

2020.louhi_1.6.pdf

(pdf | 0.348 Mb)

Unknown license

Download not available