Modelling kernel methods for unsupervised learning of micro array data

Unsupervised learning, mostly represented by data clustering methods, is an important machine learning technique. Data clustering analysis has been extensively applied to extract information from microarray gene expression data. However, finding good quality clusters in gene expression data is more...

Full description

Saved in:
Bibliographic Details
Main Author: Md. Sap, Mohd. Noor
Format: Monograph
Language:English
Published: Faculty of Computer Science and Information System 2008
Subjects:
Online Access:http://eprints.utm.my/id/eprint/5818/1/78096.pdf
http://eprints.utm.my/id/eprint/5818/
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.utm.5818
record_format eprints
spelling my.utm.58182017-08-10T01:37:17Z http://eprints.utm.my/id/eprint/5818/ Modelling kernel methods for unsupervised learning of micro array data Md. Sap, Mohd. Noor QA75 Electronic computers. Computer science Unsupervised learning, mostly represented by data clustering methods, is an important machine learning technique. Data clustering analysis has been extensively applied to extract information from microarray gene expression data. However, finding good quality clusters in gene expression data is more challenging because of its peculiar characteristics such as non-linear separability, outliers, high dimensionality, and diverse structures. Therefore, this study aims at combining kernel methods, capable of both handling the high dimensionality and discovering nonlinear relationships in the data, with the approximate reasoning offered by fuzzy approach. To this end, a robust Weighted Kernel Fuzzy C-Means incorporating local approximation (WKFCM) is presented. In WKFCM, fuzzy membership of each object is approximated from the memberships of its neighbouring objects. It brings in the synergy of partitioning and density based clustering approaches and provides a substantial improvement in the analysis of the data using unsupervised learning. Comparative analysis with K-means, hierarchical, fuzzy C-means and fuzzy self organizing maps showed that, although different types of datasets are better partitioned by different algorithms, WKFCM displays the best overall performance, and has the ability to capture nonlinear relationships and non-globular clusters, and identify cluster outliers. Faculty of Computer Science and Information System 2008-01-31 Monograph NonPeerReviewed application/pdf en http://eprints.utm.my/id/eprint/5818/1/78096.pdf Md. Sap, Mohd. Noor (2008) Modelling kernel methods for unsupervised learning of micro array data. Project Report. Faculty of Computer Science and Information System, Skudai, Johor. (Unpublished)
institution Universiti Teknologi Malaysia
building UTM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Malaysia
content_source UTM Institutional Repository
url_provider http://eprints.utm.my/
language English
topic QA75 Electronic computers. Computer science
spellingShingle QA75 Electronic computers. Computer science
Md. Sap, Mohd. Noor
Modelling kernel methods for unsupervised learning of micro array data
description Unsupervised learning, mostly represented by data clustering methods, is an important machine learning technique. Data clustering analysis has been extensively applied to extract information from microarray gene expression data. However, finding good quality clusters in gene expression data is more challenging because of its peculiar characteristics such as non-linear separability, outliers, high dimensionality, and diverse structures. Therefore, this study aims at combining kernel methods, capable of both handling the high dimensionality and discovering nonlinear relationships in the data, with the approximate reasoning offered by fuzzy approach. To this end, a robust Weighted Kernel Fuzzy C-Means incorporating local approximation (WKFCM) is presented. In WKFCM, fuzzy membership of each object is approximated from the memberships of its neighbouring objects. It brings in the synergy of partitioning and density based clustering approaches and provides a substantial improvement in the analysis of the data using unsupervised learning. Comparative analysis with K-means, hierarchical, fuzzy C-means and fuzzy self organizing maps showed that, although different types of datasets are better partitioned by different algorithms, WKFCM displays the best overall performance, and has the ability to capture nonlinear relationships and non-globular clusters, and identify cluster outliers.
format Monograph
author Md. Sap, Mohd. Noor
author_facet Md. Sap, Mohd. Noor
author_sort Md. Sap, Mohd. Noor
title Modelling kernel methods for unsupervised learning of micro array data
title_short Modelling kernel methods for unsupervised learning of micro array data
title_full Modelling kernel methods for unsupervised learning of micro array data
title_fullStr Modelling kernel methods for unsupervised learning of micro array data
title_full_unstemmed Modelling kernel methods for unsupervised learning of micro array data
title_sort modelling kernel methods for unsupervised learning of micro array data
publisher Faculty of Computer Science and Information System
publishDate 2008
url http://eprints.utm.my/id/eprint/5818/1/78096.pdf
http://eprints.utm.my/id/eprint/5818/
_version_ 1643644410175422464
score 13.209306