Cluster identification and separation in the growing self-organizing map: Application in protein sequence classification

Growing self-organizing map (GSOM) has been introduced as an improvement to the self-organizing map (SOM) algorithm in clustering and knowledge discovery. Unlike the traditional SOM, GSOM has a dynamic structure which allows nodes to grow reflecting the knowledge discovered from the input data as le...

Full description

Saved in:
Bibliographic Details
Main Author: Ahmad, N.
Format: Article
Language:English
Published: 2010
Subjects:
Online Access:http://eprints.utem.edu.my/id/eprint/87/1/Norashikin_JournalOfNeuralComputing.pdf
http://eprints.utem.edu.my/id/eprint/87/
http://www.scopus.com/inward/record.url?eid=2-s2.0-77952876529&partnerID=40&md5=eb7886ca427a6158351632248739a407
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.utem.eprints.87
record_format eprints
spelling my.utem.eprints.872021-09-19T18:04:59Z http://eprints.utem.edu.my/id/eprint/87/ Cluster identification and separation in the growing self-organizing map: Application in protein sequence classification Ahmad, N. Q Science (General) Growing self-organizing map (GSOM) has been introduced as an improvement to the self-organizing map (SOM) algorithm in clustering and knowledge discovery. Unlike the traditional SOM, GSOM has a dynamic structure which allows nodes to grow reflecting the knowledge discovered from the input data as learning progresses. The spread factor parameter (SF) in GSOM can be utilized to control the spread of the map, thus giving an analyst a flexibility to examine the clusters at different granularities. Although GSOM has been applied in various areas and has been proven effective in knowledge discovery tasks, no comprehensive study has been done on the effect of the spread factor parameter value to the cluster formation and separation. Therefore, the aim of this paper is to investigate the effect of the spread factor value towards cluster separation in the GSOM. We used simple k-means algorithm as a method to identify clusters in the GSOM. By using Davies-Bouldin index, clusters formed by different values of spread factor are obtained and the resulting clusters are analyzed. In this work, we show that clusters can be more separated when the spread factor value is increased. Hierarchical clusters can then be constructed by mapping the GSOM clusters at different spread factor values. © 2009 Springer-Verlag London Limited. 2010 Article PeerReviewed text en http://eprints.utem.edu.my/id/eprint/87/1/Norashikin_JournalOfNeuralComputing.pdf Ahmad, N. (2010) Cluster identification and separation in the growing self-organizing map: Application in protein sequence classification. Neural Computing and Applications, 19 (4). pp. 531-542. ISSN 0941-0643 http://www.scopus.com/inward/record.url?eid=2-s2.0-77952876529&partnerID=40&md5=eb7886ca427a6158351632248739a407
institution Universiti Teknikal Malaysia Melaka
building UTEM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknikal Malaysia Melaka
content_source UTEM Institutional Repository
url_provider http://eprints.utem.edu.my/
language English
topic Q Science (General)
spellingShingle Q Science (General)
Ahmad, N.
Cluster identification and separation in the growing self-organizing map: Application in protein sequence classification
description Growing self-organizing map (GSOM) has been introduced as an improvement to the self-organizing map (SOM) algorithm in clustering and knowledge discovery. Unlike the traditional SOM, GSOM has a dynamic structure which allows nodes to grow reflecting the knowledge discovered from the input data as learning progresses. The spread factor parameter (SF) in GSOM can be utilized to control the spread of the map, thus giving an analyst a flexibility to examine the clusters at different granularities. Although GSOM has been applied in various areas and has been proven effective in knowledge discovery tasks, no comprehensive study has been done on the effect of the spread factor parameter value to the cluster formation and separation. Therefore, the aim of this paper is to investigate the effect of the spread factor value towards cluster separation in the GSOM. We used simple k-means algorithm as a method to identify clusters in the GSOM. By using Davies-Bouldin index, clusters formed by different values of spread factor are obtained and the resulting clusters are analyzed. In this work, we show that clusters can be more separated when the spread factor value is increased. Hierarchical clusters can then be constructed by mapping the GSOM clusters at different spread factor values. © 2009 Springer-Verlag London Limited.
format Article
author Ahmad, N.
author_facet Ahmad, N.
author_sort Ahmad, N.
title Cluster identification and separation in the growing self-organizing map: Application in protein sequence classification
title_short Cluster identification and separation in the growing self-organizing map: Application in protein sequence classification
title_full Cluster identification and separation in the growing self-organizing map: Application in protein sequence classification
title_fullStr Cluster identification and separation in the growing self-organizing map: Application in protein sequence classification
title_full_unstemmed Cluster identification and separation in the growing self-organizing map: Application in protein sequence classification
title_sort cluster identification and separation in the growing self-organizing map: application in protein sequence classification
publishDate 2010
url http://eprints.utem.edu.my/id/eprint/87/1/Norashikin_JournalOfNeuralComputing.pdf
http://eprints.utem.edu.my/id/eprint/87/
http://www.scopus.com/inward/record.url?eid=2-s2.0-77952876529&partnerID=40&md5=eb7886ca427a6158351632248739a407
_version_ 1712288900295163904
score 13.160551