Improving Long Content Question Generation with Multi-level Passage Encoding

Zhu, P.

Improving Long Content Question Generation with Multi-level Passage Encoding

Conference paper (2021)

Authors

P. Zhu Web Information Systems

Research Group

Web Information Systems

To reference this document use:

http://resolver.tudelft.nl/uuid:aa177f86-7bd3-4c92-be35-3bd51145ad1e

More Info

expand_more

Published Date

2021

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Research Group

Web Information Systems

Abstract

Generating questions that can be answered with word spans from passages is an important natural language task, which can be used for educational applications, question-answering systems, and conversational systems. Existing question generation models suffer from creating questions that are often unrelated to the context passage and answer span. In this paper, we first analyze questions generated by a common baseline model: we find over half of the generated questions that are rated as the lowest quality to be semantically unrelated to the context passage. We then investigate how humans ask factual questions and show that most often they are a reformulation of the target sentence and information from context passage. Based on these findings, we propose a multi-level encoding and gated attention fusion based neural network model for question generation (QG) which overcomes these shortcomings. Our experiments demonstrate that our model outperforms existing state-of-art seq2seq QG models.

Files

Zhu2021_Chapter_ImprovingLongC... (pdf)

(pdf | 0.642 Mb)

Unknown license

Download not available