A hybrid approach for learning concept hierarchy from Malay text using artificial immune network

A concept hierarchy is an integral part of an ontology but it is expensive and time consuming to build. Motivated by this, many unsupervised learning methods have been proposed to (semi-) automatically develop a concept hierarchy. A significant work is the Guided Agglomerative Hierarchical Clusterin...

Full description

Saved in:
Bibliographic Details
Main Authors: Ahmad Nazri, Mohd. Zakree, Shamsudin, Siti Mariyam, Abu Bakar, Azuraliza, Abdullah, Salwani
Format: Article
Published: Springer 2010
Subjects:
Online Access:http://eprints.utm.my/id/eprint/22797/
http://link.springer.com/article/10.1007%2Fs11047-010-9228-7
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.utm.22797
record_format eprints
spelling my.utm.227972018-03-15T01:38:41Z http://eprints.utm.my/id/eprint/22797/ A hybrid approach for learning concept hierarchy from Malay text using artificial immune network Ahmad Nazri, Mohd. Zakree Shamsudin, Siti Mariyam Abu Bakar, Azuraliza Abdullah, Salwani QA75 Electronic computers. Computer science A concept hierarchy is an integral part of an ontology but it is expensive and time consuming to build. Motivated by this, many unsupervised learning methods have been proposed to (semi-) automatically develop a concept hierarchy. A significant work is the Guided Agglomerative Hierarchical Clustering (GAHC) which relies on linguistic patterns (i.e., hypernyms) to guide the clustering process. However, GAHC still relies on contextual features to build the concept hierarchy, thus data sparsity still remains an issue in GAHC. Artificial Immune Systems are known for robustness, noise tolerance and adaptability. Thus, an extension to the GAHC is proposed by hybridizing it with Artificial Immune Network (aiNet) which we call Guided Clustering and aiNet for Learning Concept Hierarchy (GCAINY). In this paper, we have tested GCAINY using two parameter settings. The first parameter setting is obtained from the literature as a baseline parameter setting and second is by automatic parameter tuning using Particle Swarm Optimization (PSO). The effectiveness of the GCAINY is evaluated on three data sets. For further validations, a comparison between GCAINY and GAHC has been conducted and with statistical tests showing that GCAINY increases the quality of the induced concept hierarchy. The results reveal that the parameters value found by using PSO significantly produce better concept hierarchy than the vanilla parameter. Thus it can be concluded that the proposed approach has greater ability to be used in the field of ontology learning. Springer 2010 Article PeerReviewed Ahmad Nazri, Mohd. Zakree and Shamsudin, Siti Mariyam and Abu Bakar, Azuraliza and Abdullah, Salwani (2010) A hybrid approach for learning concept hierarchy from Malay text using artificial immune network. Natural Computing, 10 (1). 275- 304. ISSN 1567-7818 http://link.springer.com/article/10.1007%2Fs11047-010-9228-7 DOI:10.1007/s11047-010-9228-7
institution Universiti Teknologi Malaysia
building UTM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Malaysia
content_source UTM Institutional Repository
url_provider http://eprints.utm.my/
topic QA75 Electronic computers. Computer science
spellingShingle QA75 Electronic computers. Computer science
Ahmad Nazri, Mohd. Zakree
Shamsudin, Siti Mariyam
Abu Bakar, Azuraliza
Abdullah, Salwani
A hybrid approach for learning concept hierarchy from Malay text using artificial immune network
description A concept hierarchy is an integral part of an ontology but it is expensive and time consuming to build. Motivated by this, many unsupervised learning methods have been proposed to (semi-) automatically develop a concept hierarchy. A significant work is the Guided Agglomerative Hierarchical Clustering (GAHC) which relies on linguistic patterns (i.e., hypernyms) to guide the clustering process. However, GAHC still relies on contextual features to build the concept hierarchy, thus data sparsity still remains an issue in GAHC. Artificial Immune Systems are known for robustness, noise tolerance and adaptability. Thus, an extension to the GAHC is proposed by hybridizing it with Artificial Immune Network (aiNet) which we call Guided Clustering and aiNet for Learning Concept Hierarchy (GCAINY). In this paper, we have tested GCAINY using two parameter settings. The first parameter setting is obtained from the literature as a baseline parameter setting and second is by automatic parameter tuning using Particle Swarm Optimization (PSO). The effectiveness of the GCAINY is evaluated on three data sets. For further validations, a comparison between GCAINY and GAHC has been conducted and with statistical tests showing that GCAINY increases the quality of the induced concept hierarchy. The results reveal that the parameters value found by using PSO significantly produce better concept hierarchy than the vanilla parameter. Thus it can be concluded that the proposed approach has greater ability to be used in the field of ontology learning.
format Article
author Ahmad Nazri, Mohd. Zakree
Shamsudin, Siti Mariyam
Abu Bakar, Azuraliza
Abdullah, Salwani
author_facet Ahmad Nazri, Mohd. Zakree
Shamsudin, Siti Mariyam
Abu Bakar, Azuraliza
Abdullah, Salwani
author_sort Ahmad Nazri, Mohd. Zakree
title A hybrid approach for learning concept hierarchy from Malay text using artificial immune network
title_short A hybrid approach for learning concept hierarchy from Malay text using artificial immune network
title_full A hybrid approach for learning concept hierarchy from Malay text using artificial immune network
title_fullStr A hybrid approach for learning concept hierarchy from Malay text using artificial immune network
title_full_unstemmed A hybrid approach for learning concept hierarchy from Malay text using artificial immune network
title_sort hybrid approach for learning concept hierarchy from malay text using artificial immune network
publisher Springer
publishDate 2010
url http://eprints.utm.my/id/eprint/22797/
http://link.springer.com/article/10.1007%2Fs11047-010-9228-7
_version_ 1643647400380727296
score 13.214268