FeQA: Fusion and enhancement of multi-source knowledge on question answering
In the question answering task, we usually need to reason the answer according to the question. Question answering tasks mostly use the pretrained language model to obtain the semantic embedding of questions and choices to predict answers, while the pretrained language model cannot accurately repres...
Saved in:
Main Authors: | , , , , |
---|---|
Format: | Article |
Published: |
Elsevier Ltd
2023
|
Subjects: | |
Online Access: | http://eprints.utm.my/107040/ http://dx.doi.org/10.1016/j.eswa.2023.120286 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | In the question answering task, we usually need to reason the answer according to the question. Question answering tasks mostly use the pretrained language model to obtain the semantic embedding of questions and choices to predict answers, while the pretrained language model cannot accurately represent the potential relationship between entities in the question. Therefore, researchers introduce knowledge graph to realize the reasoning from question entity to answer entity. However, the limitation of knowledge graph lies in the lack of background information of entities, which may lead to wrong reasoning. To solve the above problems, a new question answering system model FeQA is proposed, which adopts large-scale pretrained language model and knowledge graph. The former uses dual-attention mechanism to enhance the semantics of questions by using Wiktionary and other question answering datasets, while the latter uses graph neural network to infer entities. During the interaction of two modal knowledge, the former provides the basis for the reasoning of nodes in the latter, and the latter provides structured knowledge for the former. After several reasoning iterations, the final answer is obtained by using the knowledge of the two modes. The experimental results on the CommonsenseQA and OpenBookQA datasets show that the performance of this model is better than that of the baseline models. Ablation experiments show that the components and knowledge sources included in this model play an important role in the effect of question and answering task. Extended experiments show the model has good application capability. |
---|