Hybrid of hierarchical and partitional clustering algorithm for gene expression data

Microarray analysis able to monitor thousands of gene expression data, however, to elucidate the hidden patterns in the data is a complex process. These gene expression data show its imprecision, noise and vagueness due to its high dimensional properties. There are a handful of clustering algorithms...

Full description

Saved in:
Bibliographic Details
Main Authors: Raja Kumaran, Shamini, Othman, Mohd. Shahizan, Mi Yusuf, Lizawati
Format: Conference or Workshop Item
Language:English
Published: 2020
Subjects:
Online Access:http://eprints.utm.my/id/eprint/91810/1/ShaminiRajaKumaran2020_HybridofHierarchicalandPartitionalClustering.pdf
http://eprints.utm.my/id/eprint/91810/
http://dx.doi.org/10.1088/1757-899X/864/1/012071
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.utm.91810
record_format eprints
spelling my.utm.918102021-07-28T08:47:44Z http://eprints.utm.my/id/eprint/91810/ Hybrid of hierarchical and partitional clustering algorithm for gene expression data Raja Kumaran, Shamini Othman, Mohd. Shahizan Mi Yusuf, Lizawati QA75 Electronic computers. Computer science Microarray analysis able to monitor thousands of gene expression data, however, to elucidate the hidden patterns in the data is a complex process. These gene expression data show its imprecision, noise and vagueness due to its high dimensional properties. There are a handful of clustering algorithms have been proposed to extract the important information from the gene expression data. However, identifying the underlying biological knowledge of the data is still hard. To acknowledge these issues, clustering algorithms are used to reduce the data complexity. In this article, hybrid of agglomerative hierarchical clustering and modified k-medoids (partitional clustering) are proposed. Application of the proposed of clustering algorithms to group the genes that have similar functionality which might assist pre-processing procedures. In order to emphasize the quality of the clustering results, cluster quality index (CQI) is determined. Lung and ovary data sets used and the method retrieved a fair clustering with CQI, 0.37 and 0.48 respectively. This research contributes by avoiding biasness toward genes and provide true sense of clustering output using the advantage of hierarchical and partitional clustering methods. 2020-07-09 Conference or Workshop Item PeerReviewed application/pdf en http://eprints.utm.my/id/eprint/91810/1/ShaminiRajaKumaran2020_HybridofHierarchicalandPartitionalClustering.pdf Raja Kumaran, Shamini and Othman, Mohd. Shahizan and Mi Yusuf, Lizawati (2020) Hybrid of hierarchical and partitional clustering algorithm for gene expression data. In: 2nd Joint Conference on Green Engineering Technology and Applied Computing 2020, IConGETech 2020 and International Conference on Applied Computing 2020, ICAC 2020, 4 February 2020 - 5 February 2020, Langkawi, Kedah, Malaysia. http://dx.doi.org/10.1088/1757-899X/864/1/012071
institution Universiti Teknologi Malaysia
building UTM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Malaysia
content_source UTM Institutional Repository
url_provider http://eprints.utm.my/
language English
topic QA75 Electronic computers. Computer science
spellingShingle QA75 Electronic computers. Computer science
Raja Kumaran, Shamini
Othman, Mohd. Shahizan
Mi Yusuf, Lizawati
Hybrid of hierarchical and partitional clustering algorithm for gene expression data
description Microarray analysis able to monitor thousands of gene expression data, however, to elucidate the hidden patterns in the data is a complex process. These gene expression data show its imprecision, noise and vagueness due to its high dimensional properties. There are a handful of clustering algorithms have been proposed to extract the important information from the gene expression data. However, identifying the underlying biological knowledge of the data is still hard. To acknowledge these issues, clustering algorithms are used to reduce the data complexity. In this article, hybrid of agglomerative hierarchical clustering and modified k-medoids (partitional clustering) are proposed. Application of the proposed of clustering algorithms to group the genes that have similar functionality which might assist pre-processing procedures. In order to emphasize the quality of the clustering results, cluster quality index (CQI) is determined. Lung and ovary data sets used and the method retrieved a fair clustering with CQI, 0.37 and 0.48 respectively. This research contributes by avoiding biasness toward genes and provide true sense of clustering output using the advantage of hierarchical and partitional clustering methods.
format Conference or Workshop Item
author Raja Kumaran, Shamini
Othman, Mohd. Shahizan
Mi Yusuf, Lizawati
author_facet Raja Kumaran, Shamini
Othman, Mohd. Shahizan
Mi Yusuf, Lizawati
author_sort Raja Kumaran, Shamini
title Hybrid of hierarchical and partitional clustering algorithm for gene expression data
title_short Hybrid of hierarchical and partitional clustering algorithm for gene expression data
title_full Hybrid of hierarchical and partitional clustering algorithm for gene expression data
title_fullStr Hybrid of hierarchical and partitional clustering algorithm for gene expression data
title_full_unstemmed Hybrid of hierarchical and partitional clustering algorithm for gene expression data
title_sort hybrid of hierarchical and partitional clustering algorithm for gene expression data
publishDate 2020
url http://eprints.utm.my/id/eprint/91810/1/ShaminiRajaKumaran2020_HybridofHierarchicalandPartitionalClustering.pdf
http://eprints.utm.my/id/eprint/91810/
http://dx.doi.org/10.1088/1757-899X/864/1/012071
_version_ 1706956997260214272
score 13.250246