Cluster identification and separation in the growing self-organizing map: Application in protein sequence classification
Growing self-organizing map (GSOM) has been introduced as an improvement to the self-organizing map (SOM) algorithm in clustering and knowledge discovery. Unlike the traditional SOM, GSOM has a dynamic structure which allows nodes to grow reflecting the knowledge discovered from the input data as le...
Saved in:
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
2010
|
Subjects: | |
Online Access: | http://eprints.utem.edu.my/id/eprint/87/1/Norashikin_JournalOfNeuralComputing.pdf http://eprints.utem.edu.my/id/eprint/87/ http://www.scopus.com/inward/record.url?eid=2-s2.0-77952876529&partnerID=40&md5=eb7886ca427a6158351632248739a407 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my.utem.eprints.87 |
---|---|
record_format |
eprints |
spelling |
my.utem.eprints.872021-09-19T18:04:59Z http://eprints.utem.edu.my/id/eprint/87/ Cluster identification and separation in the growing self-organizing map: Application in protein sequence classification Ahmad, N. Q Science (General) Growing self-organizing map (GSOM) has been introduced as an improvement to the self-organizing map (SOM) algorithm in clustering and knowledge discovery. Unlike the traditional SOM, GSOM has a dynamic structure which allows nodes to grow reflecting the knowledge discovered from the input data as learning progresses. The spread factor parameter (SF) in GSOM can be utilized to control the spread of the map, thus giving an analyst a flexibility to examine the clusters at different granularities. Although GSOM has been applied in various areas and has been proven effective in knowledge discovery tasks, no comprehensive study has been done on the effect of the spread factor parameter value to the cluster formation and separation. Therefore, the aim of this paper is to investigate the effect of the spread factor value towards cluster separation in the GSOM. We used simple k-means algorithm as a method to identify clusters in the GSOM. By using Davies-Bouldin index, clusters formed by different values of spread factor are obtained and the resulting clusters are analyzed. In this work, we show that clusters can be more separated when the spread factor value is increased. Hierarchical clusters can then be constructed by mapping the GSOM clusters at different spread factor values. © 2009 Springer-Verlag London Limited. 2010 Article PeerReviewed text en http://eprints.utem.edu.my/id/eprint/87/1/Norashikin_JournalOfNeuralComputing.pdf Ahmad, N. (2010) Cluster identification and separation in the growing self-organizing map: Application in protein sequence classification. Neural Computing and Applications, 19 (4). pp. 531-542. ISSN 0941-0643 http://www.scopus.com/inward/record.url?eid=2-s2.0-77952876529&partnerID=40&md5=eb7886ca427a6158351632248739a407 |
institution |
Universiti Teknikal Malaysia Melaka |
building |
UTEM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Teknikal Malaysia Melaka |
content_source |
UTEM Institutional Repository |
url_provider |
http://eprints.utem.edu.my/ |
language |
English |
topic |
Q Science (General) |
spellingShingle |
Q Science (General) Ahmad, N. Cluster identification and separation in the growing self-organizing map: Application in protein sequence classification |
description |
Growing self-organizing map (GSOM) has been introduced as an improvement to the self-organizing map (SOM) algorithm in clustering and knowledge discovery. Unlike the traditional SOM, GSOM has a dynamic structure which allows nodes to grow reflecting the knowledge discovered from the input data as learning progresses. The spread factor parameter (SF) in GSOM can be utilized to control the spread of the map, thus giving an analyst a flexibility to examine the clusters at different granularities. Although GSOM has been applied in various areas and has been proven effective in knowledge discovery tasks, no comprehensive study has been done on the effect of the spread factor parameter value to the cluster formation and separation. Therefore, the aim of this paper is to investigate the effect of the spread factor value towards cluster separation in the GSOM. We used simple k-means algorithm as a method to identify clusters in the GSOM. By using Davies-Bouldin index, clusters formed by different values of spread factor are obtained and the resulting clusters are analyzed. In this work, we show that clusters can be more separated when the spread factor value is increased. Hierarchical clusters can then be constructed by mapping the GSOM clusters at different spread factor values. © 2009 Springer-Verlag London Limited. |
format |
Article |
author |
Ahmad, N. |
author_facet |
Ahmad, N. |
author_sort |
Ahmad, N. |
title |
Cluster identification and separation in the growing self-organizing map: Application in protein sequence classification |
title_short |
Cluster identification and separation in the growing self-organizing map: Application in protein sequence classification |
title_full |
Cluster identification and separation in the growing self-organizing map: Application in protein sequence classification |
title_fullStr |
Cluster identification and separation in the growing self-organizing map: Application in protein sequence classification |
title_full_unstemmed |
Cluster identification and separation in the growing self-organizing map: Application in protein sequence classification |
title_sort |
cluster identification and separation in the growing self-organizing map: application in protein sequence classification |
publishDate |
2010 |
url |
http://eprints.utem.edu.my/id/eprint/87/1/Norashikin_JournalOfNeuralComputing.pdf http://eprints.utem.edu.my/id/eprint/87/ http://www.scopus.com/inward/record.url?eid=2-s2.0-77952876529&partnerID=40&md5=eb7886ca427a6158351632248739a407 |
_version_ |
1712288900295163904 |
score |
13.160551 |