A Threshold-Based Combination of String and Semantic Similarity Measures for Record Linkage
Since integrated data have got richer information, integration of different data sources is a key step in most data warehousing and mining projects. One of the principal challenges in integrating databases is duplication. In other words, in different databases, one entity may be available in differ...
Saved in:
Main Author: | |
---|---|
Format: | Thesis |
Published: |
2011
|
Online Access: | http://psasir.upm.edu.my/id/eprint/19638/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|