Intelligent Schema Integrator (ISI): A Tool to Solve the Problem of Naming Conflict for Schema Integration

The data stored in the data warehouse are mostly coming from different sources. It may be developed using different model or structure for the schema. In order to improve the usability of these data, the process of combining or integrating is needed so that it can provide users with a unified view...

Full description

Saved in:
Bibliographic Details
Main Authors: Kamsuriah, Ahmad, Hea, Khim Chiew, Reduan, Samad
Format: Conference or Workshop Item
Language:English
Published: 2011
Subjects:
Online Access:http://ur.aeu.edu.my/483/1/Intelligent%20Schema%20Integrator%20%28ISI%29%3B%20A%20Tool%20to%20Solve%20the%20Problem%20of%20Naming%20Conflict%20for%20Schema%20Integration.pdf
http://ur.aeu.edu.my/483/
Tags: Add Tag
No Tags, Be the first to tag this record!
id my-aeu-eprints.483
record_format eprints
spelling my-aeu-eprints.4832019-06-19T08:18:58Z http://ur.aeu.edu.my/483/ Intelligent Schema Integrator (ISI): A Tool to Solve the Problem of Naming Conflict for Schema Integration Kamsuriah, Ahmad Hea, Khim Chiew Reduan, Samad TA Engineering (General). Civil engineering (General) The data stored in the data warehouse are mostly coming from different sources. It may be developed using different model or structure for the schema. In order to improve the usability of these data, the process of combining or integrating is needed so that it can provide users with a unified view or a global view of these data. The most important issue in data integration is the schema integration: that is to solve the problem of “how can equivalent real-world entities from multiple data sources be matched up?” This is referred to as entity identification process. Terms may be given a different interpretation at different sources by different people. For example, how can data analyst be sure that customer id in one database and cust number in another refer to the same entity? In this paper, a tool which is called an Intelligent Schema Integrator (ISI) is built to increase the uses of data from the data warehouse and to make the process more simple, systematic and impressive. ISI is an intelligent tool which can be used to integrate two different schemas from different sources into a unified schema (global schema). ISI is developed to solve the problems of naming conflict which are homonym conflict and synonym conflict. Homonym conflict means the same element name is used to represent different concept. Synonym conflict means different element name is used to represent the same concept. Thesaurus is used to get the meaning of each element concept and compares it with the other concept. An interface is built to allow the user to choose which elements are going to be renamed or removed, if there are occurrences of homonym and synonym conflicts in the schemas. These are the intelligence features built for ISI. The methodology used in this study consists of 4 phases: Design the Input and Output, Extraction, Comparison, and Integration. The development of this tool is an important direction for more efficient and effective implementation of data integration in data warehousing. Keywords: schema integration, homonym conflict, synonym conflict, naming conflict 2011-07-17 Conference or Workshop Item NonPeerReviewed text en http://ur.aeu.edu.my/483/1/Intelligent%20Schema%20Integrator%20%28ISI%29%3B%20A%20Tool%20to%20Solve%20the%20Problem%20of%20Naming%20Conflict%20for%20Schema%20Integration.pdf Kamsuriah, Ahmad and Hea, Khim Chiew and Reduan, Samad (2011) Intelligent Schema Integrator (ISI): A Tool to Solve the Problem of Naming Conflict for Schema Integration. In: 2011 International Conference on Electrical Engineering and Informatics, Bandung, Indonesia.
institution Asia e University
building AEU Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Asia e University
content_source AEU University Repository
url_provider http://ur.aeu.edu.my/
language English
topic TA Engineering (General). Civil engineering (General)
spellingShingle TA Engineering (General). Civil engineering (General)
Kamsuriah, Ahmad
Hea, Khim Chiew
Reduan, Samad
Intelligent Schema Integrator (ISI): A Tool to Solve the Problem of Naming Conflict for Schema Integration
description The data stored in the data warehouse are mostly coming from different sources. It may be developed using different model or structure for the schema. In order to improve the usability of these data, the process of combining or integrating is needed so that it can provide users with a unified view or a global view of these data. The most important issue in data integration is the schema integration: that is to solve the problem of “how can equivalent real-world entities from multiple data sources be matched up?” This is referred to as entity identification process. Terms may be given a different interpretation at different sources by different people. For example, how can data analyst be sure that customer id in one database and cust number in another refer to the same entity? In this paper, a tool which is called an Intelligent Schema Integrator (ISI) is built to increase the uses of data from the data warehouse and to make the process more simple, systematic and impressive. ISI is an intelligent tool which can be used to integrate two different schemas from different sources into a unified schema (global schema). ISI is developed to solve the problems of naming conflict which are homonym conflict and synonym conflict. Homonym conflict means the same element name is used to represent different concept. Synonym conflict means different element name is used to represent the same concept. Thesaurus is used to get the meaning of each element concept and compares it with the other concept. An interface is built to allow the user to choose which elements are going to be renamed or removed, if there are occurrences of homonym and synonym conflicts in the schemas. These are the intelligence features built for ISI. The methodology used in this study consists of 4 phases: Design the Input and Output, Extraction, Comparison, and Integration. The development of this tool is an important direction for more efficient and effective implementation of data integration in data warehousing. Keywords: schema integration, homonym conflict, synonym conflict, naming conflict
format Conference or Workshop Item
author Kamsuriah, Ahmad
Hea, Khim Chiew
Reduan, Samad
author_facet Kamsuriah, Ahmad
Hea, Khim Chiew
Reduan, Samad
author_sort Kamsuriah, Ahmad
title Intelligent Schema Integrator (ISI): A Tool to Solve the Problem of Naming Conflict for Schema Integration
title_short Intelligent Schema Integrator (ISI): A Tool to Solve the Problem of Naming Conflict for Schema Integration
title_full Intelligent Schema Integrator (ISI): A Tool to Solve the Problem of Naming Conflict for Schema Integration
title_fullStr Intelligent Schema Integrator (ISI): A Tool to Solve the Problem of Naming Conflict for Schema Integration
title_full_unstemmed Intelligent Schema Integrator (ISI): A Tool to Solve the Problem of Naming Conflict for Schema Integration
title_sort intelligent schema integrator (isi): a tool to solve the problem of naming conflict for schema integration
publishDate 2011
url http://ur.aeu.edu.my/483/1/Intelligent%20Schema%20Integrator%20%28ISI%29%3B%20A%20Tool%20to%20Solve%20the%20Problem%20of%20Naming%20Conflict%20for%20Schema%20Integration.pdf
http://ur.aeu.edu.my/483/
_version_ 1644539717171019776
score 13.160551