Modelling kernel methods for unsupervised learning of micro array data
Unsupervised learning, mostly represented by data clustering methods, is an important machine learning technique. Data clustering analysis has been extensively applied to extract information from microarray gene expression data. However, finding good quality clusters in gene expression data is more...
Saved in:
Main Author: | |
---|---|
Format: | Monograph |
Language: | English |
Published: |
Faculty of Computer Science and Information System
2008
|
Subjects: | |
Online Access: | http://eprints.utm.my/id/eprint/5818/1/78096.pdf http://eprints.utm.my/id/eprint/5818/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my.utm.5818 |
---|---|
record_format |
eprints |
spelling |
my.utm.58182017-08-10T01:37:17Z http://eprints.utm.my/id/eprint/5818/ Modelling kernel methods for unsupervised learning of micro array data Md. Sap, Mohd. Noor QA75 Electronic computers. Computer science Unsupervised learning, mostly represented by data clustering methods, is an important machine learning technique. Data clustering analysis has been extensively applied to extract information from microarray gene expression data. However, finding good quality clusters in gene expression data is more challenging because of its peculiar characteristics such as non-linear separability, outliers, high dimensionality, and diverse structures. Therefore, this study aims at combining kernel methods, capable of both handling the high dimensionality and discovering nonlinear relationships in the data, with the approximate reasoning offered by fuzzy approach. To this end, a robust Weighted Kernel Fuzzy C-Means incorporating local approximation (WKFCM) is presented. In WKFCM, fuzzy membership of each object is approximated from the memberships of its neighbouring objects. It brings in the synergy of partitioning and density based clustering approaches and provides a substantial improvement in the analysis of the data using unsupervised learning. Comparative analysis with K-means, hierarchical, fuzzy C-means and fuzzy self organizing maps showed that, although different types of datasets are better partitioned by different algorithms, WKFCM displays the best overall performance, and has the ability to capture nonlinear relationships and non-globular clusters, and identify cluster outliers. Faculty of Computer Science and Information System 2008-01-31 Monograph NonPeerReviewed application/pdf en http://eprints.utm.my/id/eprint/5818/1/78096.pdf Md. Sap, Mohd. Noor (2008) Modelling kernel methods for unsupervised learning of micro array data. Project Report. Faculty of Computer Science and Information System, Skudai, Johor. (Unpublished) |
institution |
Universiti Teknologi Malaysia |
building |
UTM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Teknologi Malaysia |
content_source |
UTM Institutional Repository |
url_provider |
http://eprints.utm.my/ |
language |
English |
topic |
QA75 Electronic computers. Computer science |
spellingShingle |
QA75 Electronic computers. Computer science Md. Sap, Mohd. Noor Modelling kernel methods for unsupervised learning of micro array data |
description |
Unsupervised learning, mostly represented by data clustering methods, is an important machine learning technique. Data clustering analysis has been extensively applied to extract information from microarray gene expression data. However, finding good quality clusters in gene expression data is more challenging because of its peculiar characteristics such as non-linear separability, outliers, high dimensionality, and diverse structures. Therefore, this study aims at combining kernel methods, capable of both handling the high dimensionality and discovering nonlinear relationships in the data, with the approximate reasoning offered by fuzzy approach. To this end, a robust Weighted Kernel Fuzzy C-Means incorporating local approximation (WKFCM) is presented. In WKFCM, fuzzy membership of each object is approximated from the memberships of its neighbouring objects. It brings in the synergy of partitioning and density based clustering approaches and provides a substantial improvement in the analysis of the data using unsupervised learning. Comparative analysis with K-means, hierarchical, fuzzy C-means and fuzzy self organizing maps showed that, although different types of datasets are better partitioned by different algorithms, WKFCM displays the best overall performance, and has the ability to capture nonlinear relationships and non-globular clusters, and identify cluster outliers. |
format |
Monograph |
author |
Md. Sap, Mohd. Noor |
author_facet |
Md. Sap, Mohd. Noor |
author_sort |
Md. Sap, Mohd. Noor |
title |
Modelling kernel methods for unsupervised learning of micro array data |
title_short |
Modelling kernel methods for unsupervised learning of micro array data |
title_full |
Modelling kernel methods for unsupervised learning of micro array data |
title_fullStr |
Modelling kernel methods for unsupervised learning of micro array data |
title_full_unstemmed |
Modelling kernel methods for unsupervised learning of micro array data |
title_sort |
modelling kernel methods for unsupervised learning of micro array data |
publisher |
Faculty of Computer Science and Information System |
publishDate |
2008 |
url |
http://eprints.utm.my/id/eprint/5818/1/78096.pdf http://eprints.utm.my/id/eprint/5818/ |
_version_ |
1643644410175422464 |
score |
13.209306 |