Query translation for multilingual content with semantic technique

Cross-lingual information retrieval (CLIR) allows user query in a different language from the language of target resources. Thus, translation is the key element in the query processing. There are three translation approaches: query, document, or hybrid query-document. However, query translation is v...

Full description

Saved in:
Bibliographic Details
Main Authors: Norita Md Norwawi,, Sundresan a/l Perumal,, Emran Huda,, Waka Jeng,
Format: Article
Language:English
Published: Penerbit Universiti Kebangsaan Malaysia 2020
Online Access:http://journalarticle.ukm.my/15907/1/9.pdf
http://journalarticle.ukm.my/15907/
http://www.ukm.my/jsm/malay_journals/jilid49bil9_2020/KandunganJilid49Bil9_2020.html
Tags: Add Tag
No Tags, Be the first to tag this record!
id my-ukm.journal.15907
record_format eprints
spelling my-ukm.journal.159072020-12-06T09:12:51Z http://journalarticle.ukm.my/15907/ Query translation for multilingual content with semantic technique Norita Md Norwawi, Sundresan a/l Perumal, Emran Huda, Waka Jeng, Cross-lingual information retrieval (CLIR) allows user query in a different language from the language of target resources. Thus, translation is the key element in the query processing. There are three translation approaches: query, document, or hybrid query-document. However, query translation is very challenging due to the polysemy problem. Different linguistic nature of the languages will lead to ambiguity of meaning subsequently user’s true intention could be misinterpreted. This paper presents a semantic technique on query translation for a multilingual knowledge repository to improve the query processing. Offline translated documents or parallel corpora in English, Arabic, and Malay language including Jawi text was used as the data. Set of keywords were constructed preidentified by expert related to prophetic food. These keywords were annotated with the relevant Quranic verses, Hadith texts, Manuscript text images and scientific article determined by expert. The synonym and context-based translation was annotated together with the specific keyword. A query will do a three-way pattern match based on the keyword indexing list that link to the relevant documents. A one-stop knowledge repository on prophetic food was developed as a proof of concept using sources are from al-Quran, Hadith, classical manuscript, and scientific articles verified by experts to ensure the content authenticity and integrity. Penerbit Universiti Kebangsaan Malaysia 2020-09 Article PeerReviewed application/pdf en http://journalarticle.ukm.my/15907/1/9.pdf Norita Md Norwawi, and Sundresan a/l Perumal, and Emran Huda, and Waka Jeng, (2020) Query translation for multilingual content with semantic technique. Sains Malaysiana, 49 (9). pp. 2113-2118. ISSN 0126-6039 http://www.ukm.my/jsm/malay_journals/jilid49bil9_2020/KandunganJilid49Bil9_2020.html
institution Universiti Kebangsaan Malaysia
building Tun Sri Lanang Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Kebangsaan Malaysia
content_source UKM Journal Article Repository
url_provider http://journalarticle.ukm.my/
language English
description Cross-lingual information retrieval (CLIR) allows user query in a different language from the language of target resources. Thus, translation is the key element in the query processing. There are three translation approaches: query, document, or hybrid query-document. However, query translation is very challenging due to the polysemy problem. Different linguistic nature of the languages will lead to ambiguity of meaning subsequently user’s true intention could be misinterpreted. This paper presents a semantic technique on query translation for a multilingual knowledge repository to improve the query processing. Offline translated documents or parallel corpora in English, Arabic, and Malay language including Jawi text was used as the data. Set of keywords were constructed preidentified by expert related to prophetic food. These keywords were annotated with the relevant Quranic verses, Hadith texts, Manuscript text images and scientific article determined by expert. The synonym and context-based translation was annotated together with the specific keyword. A query will do a three-way pattern match based on the keyword indexing list that link to the relevant documents. A one-stop knowledge repository on prophetic food was developed as a proof of concept using sources are from al-Quran, Hadith, classical manuscript, and scientific articles verified by experts to ensure the content authenticity and integrity.
format Article
author Norita Md Norwawi,
Sundresan a/l Perumal,
Emran Huda,
Waka Jeng,
spellingShingle Norita Md Norwawi,
Sundresan a/l Perumal,
Emran Huda,
Waka Jeng,
Query translation for multilingual content with semantic technique
author_facet Norita Md Norwawi,
Sundresan a/l Perumal,
Emran Huda,
Waka Jeng,
author_sort Norita Md Norwawi,
title Query translation for multilingual content with semantic technique
title_short Query translation for multilingual content with semantic technique
title_full Query translation for multilingual content with semantic technique
title_fullStr Query translation for multilingual content with semantic technique
title_full_unstemmed Query translation for multilingual content with semantic technique
title_sort query translation for multilingual content with semantic technique
publisher Penerbit Universiti Kebangsaan Malaysia
publishDate 2020
url http://journalarticle.ukm.my/15907/1/9.pdf
http://journalarticle.ukm.my/15907/
http://www.ukm.my/jsm/malay_journals/jilid49bil9_2020/KandunganJilid49Bil9_2020.html
_version_ 1685579813739298816
score 13.160551