Query disambiguation approach using triple-filter

In order to effectively deal with structured information, some technical skills are needed. However, most users do not possess these skills. Natural Language Interfaces (NLIs) are therefore built to provide everyday users who lack the needed technical knowledge, with some means of gaining access...

Full description

Saved in:
Bibliographic Details
Main Author: Agaie, Aliyu Isah
Format: Thesis
Language:English
Published: 2017
Online Access:http://psasir.upm.edu.my/id/eprint/68753/1/FSKTM%202018%2011%20-%20IR.pdf
http://psasir.upm.edu.my/id/eprint/68753/
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.upm.eprints.68753
record_format eprints
institution Universiti Putra Malaysia
building UPM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Putra Malaysia
content_source UPM Institutional Repository
url_provider http://psasir.upm.edu.my/
language English
description In order to effectively deal with structured information, some technical skills are needed. However, most users do not possess these skills. Natural Language Interfaces (NLIs) are therefore built to provide everyday users who lack the needed technical knowledge, with some means of gaining access to the information stored in knowledge bases. They are designed to deal with the natural language articulation of what the user wants, and then transform it into a computer language that specifies how to accomplish it. Each NLI system also has a scope and limitation which everyday users are unaware of. It is therefore understandable that the users may not see errors in their queries or even know how to write appropriate queries (according to the system’s limitation and scope) in order to retrieve the correct information. The effective retrieval of any piece of information depends wholly on the correct mapping of queries made in natural language to machine understandable form. However, most of the existing NLIs are lacking in terms of being able to provide support for users to formulate their queries. Once queries are wrongly formulated, there is tendency for the system to retrieve wrong answers. As a result, a user may have to reformulate his query severally before the required answer is retrieved (if at all). In this thesis therefore, it is proposed that the best way to formulate appropriate queries is by guiding the user through the query writing process and helping the user to resolve ambiguities by providing suggestions that are easy to understand. The proposed approach, referred to as triple-filter query disambiguation approach, has been implemented into a prototype as a proof of concept. The prototype, referred to as QuFA (Query Formulation Assistant), is intended to serve as an upper layer for NLIs. It is equipped with an authoring service that guides the user to write his query, a disambiguation module that resolves ambiguities in order to ascertain the user’s intention and finally, a query rewriting module that transforms the user’s input query into an intermediate query that will suit the underlying search system’s perspective of the user’s question. Extensive experimental evaluations were conducted in order to validate the proposed approach, using the developed prototype. The proposed triple-filter query disambiguation approach was directly compared with the approach in FREyA (Feedback, Refinement and Extended vocabulary Aggregation) that also provides support to users when formulating queries. The evaluation was based on the usability and performance of the approaches. In terms of usability, the results show that the proposed approach has the potential of being more acceptable in the field; and in terms of effectiveness, it also shows a high performance based on precision and recall. The proposed approach helps users to conceive and articulate more effective queries, and facilitates information search activities. The main contributions of this research work include the introduction of an approach that enables users without knowledge of formal computer languages to formulate useful queries while effectively expressing themselves using natural language. The approach utilizes the effectiveness of human-computer dialogue to effectively retrieve desired information from ontologies. The proposed triple-filter disambiguation approach inculcates a learning mechanism that continues to automatically learn from user queries and continuously improves its performance capability. The triple-filter disambiguation algorithm was also developed, along with two documents (terms equivalence catalogue (TEC) and the enhanced concepts store (ECS)) that represent the thesaurus and the lexicon for use with the Mooney Geoquery dataset. All of these are available for use by other researchers.
format Thesis
author Agaie, Aliyu Isah
spellingShingle Agaie, Aliyu Isah
Query disambiguation approach using triple-filter
author_facet Agaie, Aliyu Isah
author_sort Agaie, Aliyu Isah
title Query disambiguation approach using triple-filter
title_short Query disambiguation approach using triple-filter
title_full Query disambiguation approach using triple-filter
title_fullStr Query disambiguation approach using triple-filter
title_full_unstemmed Query disambiguation approach using triple-filter
title_sort query disambiguation approach using triple-filter
publishDate 2017
url http://psasir.upm.edu.my/id/eprint/68753/1/FSKTM%202018%2011%20-%20IR.pdf
http://psasir.upm.edu.my/id/eprint/68753/
_version_ 1643839296040337408
spelling my.upm.eprints.687532019-05-29T01:46:25Z http://psasir.upm.edu.my/id/eprint/68753/ Query disambiguation approach using triple-filter Agaie, Aliyu Isah In order to effectively deal with structured information, some technical skills are needed. However, most users do not possess these skills. Natural Language Interfaces (NLIs) are therefore built to provide everyday users who lack the needed technical knowledge, with some means of gaining access to the information stored in knowledge bases. They are designed to deal with the natural language articulation of what the user wants, and then transform it into a computer language that specifies how to accomplish it. Each NLI system also has a scope and limitation which everyday users are unaware of. It is therefore understandable that the users may not see errors in their queries or even know how to write appropriate queries (according to the system’s limitation and scope) in order to retrieve the correct information. The effective retrieval of any piece of information depends wholly on the correct mapping of queries made in natural language to machine understandable form. However, most of the existing NLIs are lacking in terms of being able to provide support for users to formulate their queries. Once queries are wrongly formulated, there is tendency for the system to retrieve wrong answers. As a result, a user may have to reformulate his query severally before the required answer is retrieved (if at all). In this thesis therefore, it is proposed that the best way to formulate appropriate queries is by guiding the user through the query writing process and helping the user to resolve ambiguities by providing suggestions that are easy to understand. The proposed approach, referred to as triple-filter query disambiguation approach, has been implemented into a prototype as a proof of concept. The prototype, referred to as QuFA (Query Formulation Assistant), is intended to serve as an upper layer for NLIs. It is equipped with an authoring service that guides the user to write his query, a disambiguation module that resolves ambiguities in order to ascertain the user’s intention and finally, a query rewriting module that transforms the user’s input query into an intermediate query that will suit the underlying search system’s perspective of the user’s question. Extensive experimental evaluations were conducted in order to validate the proposed approach, using the developed prototype. The proposed triple-filter query disambiguation approach was directly compared with the approach in FREyA (Feedback, Refinement and Extended vocabulary Aggregation) that also provides support to users when formulating queries. The evaluation was based on the usability and performance of the approaches. In terms of usability, the results show that the proposed approach has the potential of being more acceptable in the field; and in terms of effectiveness, it also shows a high performance based on precision and recall. The proposed approach helps users to conceive and articulate more effective queries, and facilitates information search activities. The main contributions of this research work include the introduction of an approach that enables users without knowledge of formal computer languages to formulate useful queries while effectively expressing themselves using natural language. The approach utilizes the effectiveness of human-computer dialogue to effectively retrieve desired information from ontologies. The proposed triple-filter disambiguation approach inculcates a learning mechanism that continues to automatically learn from user queries and continuously improves its performance capability. The triple-filter disambiguation algorithm was also developed, along with two documents (terms equivalence catalogue (TEC) and the enhanced concepts store (ECS)) that represent the thesaurus and the lexicon for use with the Mooney Geoquery dataset. All of these are available for use by other researchers. 2017-12 Thesis NonPeerReviewed text en http://psasir.upm.edu.my/id/eprint/68753/1/FSKTM%202018%2011%20-%20IR.pdf Agaie, Aliyu Isah (2017) Query disambiguation approach using triple-filter. PhD thesis, Universiti Putra Malaysia.
score 13.211869