Staff View: Bootsrapping instance-based ontology matching via unsupervised generation of training samples

Bootsrapping instance-based ontology matching via unsupervised generation of training samples

Training set is the key role player that can improve the performance of any classification task. Different techniques and methods are being applied to generate training set depending on its area of application. Researchers in data science and semantic web community use different kind of training set...

Full description

Saved in:

Bibliographic Details
Main Authors:	Abubakar, Mansir, Hamdan, Hazlina, Mustapha, Norwati, Mohd Aris, Teh Noranis
Format:	Article
Language:	English
Published:	Little Lion Scientific R&D 2019
Online Access:	http://psasir.upm.edu.my/id/eprint/80836/1/BOOTS.pdf http://psasir.upm.edu.my/id/eprint/80836/ http://www.dit.unitn.it/~pavel/OM/articles/14Vol97No6.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!

id	my.upm.eprints.80836
record_format	eprints
spelling	my.upm.eprints.808362020-10-15T21:39:15Z http://psasir.upm.edu.my/id/eprint/80836/ Bootsrapping instance-based ontology matching via unsupervised generation of training samples Abubakar, Mansir Hamdan, Hazlina Mustapha, Norwati Mohd Aris, Teh Noranis Training set is the key role player that can improve the performance of any classification task. Different techniques and methods are being applied to generate training set depending on its area of application. Researchers in data science and semantic web community use different kind of training sets generated to improve the performances of classifications and information retrieval capability. Operational Training Set Generator (TSG) should always solve a minimum of two issues; (1) it must address the computational cost in producing a reasonable outcome, thereby reducing the computational cost in the whole system. The runtime of TSG is near linear as in blocking approach and (2) it must produce the qualitative training sets. We use LogTfIdf as the cosine similarity function of two given vectors to produce Bag of Words (BoW); the tokenizer is developed to specially take care of delimiters that often come across URIs and other RDF essentials. We evaluated our UTSG on nine cross-domain benchmark ontologies publically available in OAEI website. The results obtained shows that our UTSG outperforms the two baseline TSGs previously developed to address similar problem. Little Lion Scientific R&D 2019 Article PeerReviewed text en http://psasir.upm.edu.my/id/eprint/80836/1/BOOTS.pdf Abubakar, Mansir and Hamdan, Hazlina and Mustapha, Norwati and Mohd Aris, Teh Noranis (2019) Bootsrapping instance-based ontology matching via unsupervised generation of training samples. Journal of Theoretical and Applied Information Technology, 97 (6). pp. 1832-1844. ISSN 1992-8645; ESSN: 1817-3195 http://www.dit.unitn.it/~pavel/OM/articles/14Vol97No6.pdf
institution	Universiti Putra Malaysia
building	UPM Library
collection	Institutional Repository
continent	Asia
country	Malaysia
content_provider	Universiti Putra Malaysia
content_source	UPM Institutional Repository
url_provider	http://psasir.upm.edu.my/
language	English
description	Training set is the key role player that can improve the performance of any classification task. Different techniques and methods are being applied to generate training set depending on its area of application. Researchers in data science and semantic web community use different kind of training sets generated to improve the performances of classifications and information retrieval capability. Operational Training Set Generator (TSG) should always solve a minimum of two issues; (1) it must address the computational cost in producing a reasonable outcome, thereby reducing the computational cost in the whole system. The runtime of TSG is near linear as in blocking approach and (2) it must produce the qualitative training sets. We use LogTfIdf as the cosine similarity function of two given vectors to produce Bag of Words (BoW); the tokenizer is developed to specially take care of delimiters that often come across URIs and other RDF essentials. We evaluated our UTSG on nine cross-domain benchmark ontologies publically available in OAEI website. The results obtained shows that our UTSG outperforms the two baseline TSGs previously developed to address similar problem.
format	Article
author	Abubakar, Mansir Hamdan, Hazlina Mustapha, Norwati Mohd Aris, Teh Noranis
spellingShingle	Abubakar, Mansir Hamdan, Hazlina Mustapha, Norwati Mohd Aris, Teh Noranis Bootsrapping instance-based ontology matching via unsupervised generation of training samples
author_facet	Abubakar, Mansir Hamdan, Hazlina Mustapha, Norwati Mohd Aris, Teh Noranis
author_sort	Abubakar, Mansir
title	Bootsrapping instance-based ontology matching via unsupervised generation of training samples
title_short	Bootsrapping instance-based ontology matching via unsupervised generation of training samples
title_full	Bootsrapping instance-based ontology matching via unsupervised generation of training samples
title_fullStr	Bootsrapping instance-based ontology matching via unsupervised generation of training samples
title_full_unstemmed	Bootsrapping instance-based ontology matching via unsupervised generation of training samples
title_sort	bootsrapping instance-based ontology matching via unsupervised generation of training samples
publisher	Little Lion Scientific R&D
publishDate	2019
url	http://psasir.upm.edu.my/id/eprint/80836/1/BOOTS.pdf http://psasir.upm.edu.my/id/eprint/80836/ http://www.dit.unitn.it/~pavel/OM/articles/14Vol97No6.pdf
_version_	1681490823870414848
score	13.18916

Bootsrapping instance-based ontology matching via unsupervised generation of training samples

Similar Items