Query translation architecture for Malay-English cross-language information retrieval system

This paper discusses research on query translation events in Malay-English Cross-Language Information Retrieval (CLIR) system. We assume that by improving query translation accuracy, we can improve the information retrieval performance. The dictionary-based CLIR system facing three main problems: tr...

Full description

Saved in:
Bibliographic Details
Main Authors: Rais, Nurjannaton Hidayah, Abdullah, Muhamad Taufik, Abdul Kadir, Rabiah
Format: Conference or Workshop Item
Language:English
Published: IEEE 2010
Online Access:http://psasir.upm.edu.my/id/eprint/48228/1/Query%20translation%20architecture%20for%20Malay-English%20cross-language%20information%20retrieval%20system.pdf
http://psasir.upm.edu.my/id/eprint/48228/
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:This paper discusses research on query translation events in Malay-English Cross-Language Information Retrieval (CLIR) system. We assume that by improving query translation accuracy, we can improve the information retrieval performance. The dictionary-based CLIR system facing three main problems: translation ambiguity; compound and phrase handling and proper names translation. The use of natural language processing (NLP) techniques, such as stemming, Part-of-Speech (POS) tagging is useful in query translation process. Hence, n-gram matching technique has successfully applied to information retrieval (IR) system for phrases and proper names translation. The proposed query translation architecture consist of stemming, Part-of-Speech (POS) tagging and n-gram matching techniques is useful in CLIR system as well as search engine application.