Quranic ontology for resolving query translation disambiguation in English-Malay cross-language information retrieval

This research proposed a Cross Language Information Retrieval (CLIR)method based on specific domain/ontology using specific concepts for disambiguating translation of the query. This research experiment the use of specific domain/ontology: Quran, written in English and Malay languages as a bilingual...

Full description

Saved in:
Bibliographic Details
Main Author: Yahya, Zulaini
Format: Thesis
Language:English
Published: 2012
Online Access:http://psasir.upm.edu.my/id/eprint/31652/1/FSKTM%202012%2027R.pdf
http://psasir.upm.edu.my/id/eprint/31652/
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.upm.eprints.31652
record_format eprints
spelling my.upm.eprints.316522015-02-04T07:31:06Z http://psasir.upm.edu.my/id/eprint/31652/ Quranic ontology for resolving query translation disambiguation in English-Malay cross-language information retrieval Yahya, Zulaini This research proposed a Cross Language Information Retrieval (CLIR)method based on specific domain/ontology using specific concepts for disambiguating translation of the query. This research experiment the use of specific domain/ontology: Quran, written in English and Malay languages as a bilingual parallel-corpora and specific concepts: Quran, as a resource for cross-language query translation along with dictionary-based translation. This study evaluates the effectiveness of query translation using dictionary based and ontology for CLIR system. For translation, we use two basic approaches as benchmark: 1) first translation listed in the dictionary; and 2)all translation candidates listed in the dictionary. For the proposed CLIR method, we use three approaches: 1) based on verse list; 2) based on concepts similarity; and 3) based on concepts expansion. For concepts matching before and after query translation, we used two approaches: 1)query concepts; and 2) translation concepts. The experimental result shows that retrieval performance using dictionary based is lower than monolingual either in English or Malay document collections. Direct translation involved in returning many possibility results which can affect the decreasing in document retrieval performance either in English or Malay document collections. For the proposed CLIR method, performance of CLIR query translation based on verse list approach, concepts similarity approach and concepts expansion approach, obtained a better result either using query concepts or translation concepts matching compared to dictionary-based for English document collections but not in Malay document collections. In Malay document collections the retrieval performance only improved in concepts expansion approach. English language has a better structure compared to Malay language which affects the retrieval performance. A single Malay word may have a variety of meaning, not only by the word itself but also depends on the meaning of the verse or chapter. This is one of the reasons why retrieval performance decreasing in Malay document collections. 2012-11 Thesis NonPeerReviewed application/pdf en http://psasir.upm.edu.my/id/eprint/31652/1/FSKTM%202012%2027R.pdf Yahya, Zulaini (2012) Quranic ontology for resolving query translation disambiguation in English-Malay cross-language information retrieval. Masters thesis, Universiti Putra Malaysia.
institution Universiti Putra Malaysia
building UPM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Putra Malaysia
content_source UPM Institutional Repository
url_provider http://psasir.upm.edu.my/
language English
description This research proposed a Cross Language Information Retrieval (CLIR)method based on specific domain/ontology using specific concepts for disambiguating translation of the query. This research experiment the use of specific domain/ontology: Quran, written in English and Malay languages as a bilingual parallel-corpora and specific concepts: Quran, as a resource for cross-language query translation along with dictionary-based translation. This study evaluates the effectiveness of query translation using dictionary based and ontology for CLIR system. For translation, we use two basic approaches as benchmark: 1) first translation listed in the dictionary; and 2)all translation candidates listed in the dictionary. For the proposed CLIR method, we use three approaches: 1) based on verse list; 2) based on concepts similarity; and 3) based on concepts expansion. For concepts matching before and after query translation, we used two approaches: 1)query concepts; and 2) translation concepts. The experimental result shows that retrieval performance using dictionary based is lower than monolingual either in English or Malay document collections. Direct translation involved in returning many possibility results which can affect the decreasing in document retrieval performance either in English or Malay document collections. For the proposed CLIR method, performance of CLIR query translation based on verse list approach, concepts similarity approach and concepts expansion approach, obtained a better result either using query concepts or translation concepts matching compared to dictionary-based for English document collections but not in Malay document collections. In Malay document collections the retrieval performance only improved in concepts expansion approach. English language has a better structure compared to Malay language which affects the retrieval performance. A single Malay word may have a variety of meaning, not only by the word itself but also depends on the meaning of the verse or chapter. This is one of the reasons why retrieval performance decreasing in Malay document collections.
format Thesis
author Yahya, Zulaini
spellingShingle Yahya, Zulaini
Quranic ontology for resolving query translation disambiguation in English-Malay cross-language information retrieval
author_facet Yahya, Zulaini
author_sort Yahya, Zulaini
title Quranic ontology for resolving query translation disambiguation in English-Malay cross-language information retrieval
title_short Quranic ontology for resolving query translation disambiguation in English-Malay cross-language information retrieval
title_full Quranic ontology for resolving query translation disambiguation in English-Malay cross-language information retrieval
title_fullStr Quranic ontology for resolving query translation disambiguation in English-Malay cross-language information retrieval
title_full_unstemmed Quranic ontology for resolving query translation disambiguation in English-Malay cross-language information retrieval
title_sort quranic ontology for resolving query translation disambiguation in english-malay cross-language information retrieval
publishDate 2012
url http://psasir.upm.edu.my/id/eprint/31652/1/FSKTM%202012%2027R.pdf
http://psasir.upm.edu.my/id/eprint/31652/
_version_ 1643830384447717376
score 13.211869