Semantic-based question answering framework for fuzzy factoid answer from Thai texts

Text is an important human knowledge source. The question-answering system can retrieve the fact from the source of knowledge and provide the answer to the user. Translating the text to the knowledge base is a very challenge task and complicated process. Thai text can be a form of character stream w...

Full description

Saved in:
Bibliographic Details
Main Author: Kongwan, Authapon
Format: Thesis
Language:English
English
Published: 2024
Subjects:
Online Access:https://etd.uum.edu.my/11490/1/depositpermission.pdf
https://etd.uum.edu.my/11490/2/s900995_01.pdf
https://etd.uum.edu.my/11490/
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.uum.etd.11490
record_format eprints
spelling my.uum.etd.114902025-01-06T04:15:18Z https://etd.uum.edu.my/11490/ Semantic-based question answering framework for fuzzy factoid answer from Thai texts Kongwan, Authapon P Philology. Linguistics Text is an important human knowledge source. The question-answering system can retrieve the fact from the source of knowledge and provide the answer to the user. Translating the text to the knowledge base is a very challenge task and complicated process. Thai text can be a form of character stream written continuously without any punctuation or marker to separate each word and each sentence in a paragraph. This research is aim to develop a semantic base question-answering framework that can handle the fuzzy factoid and target the knowledge source to Thai text. In building a Thai question-answering system, Thai morphological analysis is an important component to process Thai text. Ellipsis and anaphora resolution in Thai text is also the needed process for constructing the complete fact from Thai text. Thai semantic parser is the core component to construct the knowledge base by extracting the fact from Thai text into the semantic frame structure. The methodology of this research is divided into 4 steps. First is building the accurate Thai morphological analysis: Thai word segmentation and Thai EDU segmentation. The second is to develop the ellipsis and anaphora resolution for Thai text to achieve the goal that is creating the complete fact in Thai EDU segmentation. The third is to develop the semantic parser to build the knowledge base that transforms the Thai text into a semantic frame representation. Forth is developed the answer extraction for the question answering system with fuzzy matching to handle the fuzzy factoid. From the pipeline of the processes, the semanticbased question answering system performs high precision and recall to 0.9892 and 0.9484. In conclusion, anaphora and ellipsis resolution are crucial for achieving precise semantic construction, while fuzzy matching significantly enhances answer extraction recall. Together, these components are essential for building robust "What" and "How many" question answering systems 2024 Thesis NonPeerReviewed text en https://etd.uum.edu.my/11490/1/depositpermission.pdf text en https://etd.uum.edu.my/11490/2/s900995_01.pdf Kongwan, Authapon (2024) Semantic-based question answering framework for fuzzy factoid answer from Thai texts. Doctoral thesis, Universiti Utara Malaysia.
institution Universiti Utara Malaysia
building UUM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Utara Malaysia
content_source UUM Electronic Theses
url_provider http://etd.uum.edu.my/
language English
English
topic P Philology. Linguistics
spellingShingle P Philology. Linguistics
Kongwan, Authapon
Semantic-based question answering framework for fuzzy factoid answer from Thai texts
description Text is an important human knowledge source. The question-answering system can retrieve the fact from the source of knowledge and provide the answer to the user. Translating the text to the knowledge base is a very challenge task and complicated process. Thai text can be a form of character stream written continuously without any punctuation or marker to separate each word and each sentence in a paragraph. This research is aim to develop a semantic base question-answering framework that can handle the fuzzy factoid and target the knowledge source to Thai text. In building a Thai question-answering system, Thai morphological analysis is an important component to process Thai text. Ellipsis and anaphora resolution in Thai text is also the needed process for constructing the complete fact from Thai text. Thai semantic parser is the core component to construct the knowledge base by extracting the fact from Thai text into the semantic frame structure. The methodology of this research is divided into 4 steps. First is building the accurate Thai morphological analysis: Thai word segmentation and Thai EDU segmentation. The second is to develop the ellipsis and anaphora resolution for Thai text to achieve the goal that is creating the complete fact in Thai EDU segmentation. The third is to develop the semantic parser to build the knowledge base that transforms the Thai text into a semantic frame representation. Forth is developed the answer extraction for the question answering system with fuzzy matching to handle the fuzzy factoid. From the pipeline of the processes, the semanticbased question answering system performs high precision and recall to 0.9892 and 0.9484. In conclusion, anaphora and ellipsis resolution are crucial for achieving precise semantic construction, while fuzzy matching significantly enhances answer extraction recall. Together, these components are essential for building robust "What" and "How many" question answering systems
format Thesis
author Kongwan, Authapon
author_facet Kongwan, Authapon
author_sort Kongwan, Authapon
title Semantic-based question answering framework for fuzzy factoid answer from Thai texts
title_short Semantic-based question answering framework for fuzzy factoid answer from Thai texts
title_full Semantic-based question answering framework for fuzzy factoid answer from Thai texts
title_fullStr Semantic-based question answering framework for fuzzy factoid answer from Thai texts
title_full_unstemmed Semantic-based question answering framework for fuzzy factoid answer from Thai texts
title_sort semantic-based question answering framework for fuzzy factoid answer from thai texts
publishDate 2024
url https://etd.uum.edu.my/11490/1/depositpermission.pdf
https://etd.uum.edu.my/11490/2/s900995_01.pdf
https://etd.uum.edu.my/11490/
_version_ 1821005244687974400
score 13.23648