An integrative gene selection with association analysis for microarray data classification

The rising interest in integrative approach has shifted gene selection from purely data-centric to incorporating additional biological knowledge. Integrative gene selection is viewed as a promising approach in microarray data classification that took into consideration the complex relationships amon...

Full description

Saved in:
Bibliographic Details
Main Authors: Ong, Huey Fang, Mustapha, Norwati, Sulaiman, Md. Nasir
Format: Article
Language:English
Published: IOS Press 2014
Online Access:http://psasir.upm.edu.my/id/eprint/34225/1/An%20integrative%20gene%20selection%20with%20association%20analysis%20for%20microarray%20data%20classification.pdf
http://psasir.upm.edu.my/id/eprint/34225/
http://content.iospress.com/articles/intelligent-data-analysis/ida00666
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.upm.eprints.34225
record_format eprints
spelling my.upm.eprints.342252016-09-14T04:23:58Z http://psasir.upm.edu.my/id/eprint/34225/ An integrative gene selection with association analysis for microarray data classification Ong, Huey Fang Mustapha, Norwati Sulaiman, Md. Nasir The rising interest in integrative approach has shifted gene selection from purely data-centric to incorporating additional biological knowledge. Integrative gene selection is viewed as a promising approach in microarray data classification that took into consideration the complex relationships among genes. However, in most of the existing methods, the selection of genes is still based on expression values alone and biological knowledge is integrated at the end of analysis to verify experimental results or to gain biological insights. Thus, this paper proposed an integrative gene selection based on filter method and association analysis for selecting genes that are not only differentially expressed but also informative for classification. Association analysis is employed to integrate microarray data with multiple types of biological knowledge simultaneously, and to identify groups of genes that are frequently co-occurred in target samples. It has been tested on four cancer-related datasets, and two types of biological knowledge are incorporated, namely Gene Ontology (GO) and KEGG Pathways (KEGG). The experimental results show that the recommended GO based models, KEGG based models, and GO-KEGG based models outperformed the expression-only models by attaining better classification accuracies with lesser number of genes. The performance of the integrative models verified the efficiency and scalability of association analysis in mining microarray data. IOS Press 2014 Article PeerReviewed application/pdf en http://psasir.upm.edu.my/id/eprint/34225/1/An%20integrative%20gene%20selection%20with%20association%20analysis%20for%20microarray%20data%20classification.pdf Ong, Huey Fang and Mustapha, Norwati and Sulaiman, Md. Nasir (2014) An integrative gene selection with association analysis for microarray data classification. Intelligent Data Analysis, 18 (4). pp. 739-758. ISSN 1088-467X; ESSN: 1571-4128 http://content.iospress.com/articles/intelligent-data-analysis/ida00666 10.3233/IDA-140666
institution Universiti Putra Malaysia
building UPM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Putra Malaysia
content_source UPM Institutional Repository
url_provider http://psasir.upm.edu.my/
language English
description The rising interest in integrative approach has shifted gene selection from purely data-centric to incorporating additional biological knowledge. Integrative gene selection is viewed as a promising approach in microarray data classification that took into consideration the complex relationships among genes. However, in most of the existing methods, the selection of genes is still based on expression values alone and biological knowledge is integrated at the end of analysis to verify experimental results or to gain biological insights. Thus, this paper proposed an integrative gene selection based on filter method and association analysis for selecting genes that are not only differentially expressed but also informative for classification. Association analysis is employed to integrate microarray data with multiple types of biological knowledge simultaneously, and to identify groups of genes that are frequently co-occurred in target samples. It has been tested on four cancer-related datasets, and two types of biological knowledge are incorporated, namely Gene Ontology (GO) and KEGG Pathways (KEGG). The experimental results show that the recommended GO based models, KEGG based models, and GO-KEGG based models outperformed the expression-only models by attaining better classification accuracies with lesser number of genes. The performance of the integrative models verified the efficiency and scalability of association analysis in mining microarray data.
format Article
author Ong, Huey Fang
Mustapha, Norwati
Sulaiman, Md. Nasir
spellingShingle Ong, Huey Fang
Mustapha, Norwati
Sulaiman, Md. Nasir
An integrative gene selection with association analysis for microarray data classification
author_facet Ong, Huey Fang
Mustapha, Norwati
Sulaiman, Md. Nasir
author_sort Ong, Huey Fang
title An integrative gene selection with association analysis for microarray data classification
title_short An integrative gene selection with association analysis for microarray data classification
title_full An integrative gene selection with association analysis for microarray data classification
title_fullStr An integrative gene selection with association analysis for microarray data classification
title_full_unstemmed An integrative gene selection with association analysis for microarray data classification
title_sort integrative gene selection with association analysis for microarray data classification
publisher IOS Press
publishDate 2014
url http://psasir.upm.edu.my/id/eprint/34225/1/An%20integrative%20gene%20selection%20with%20association%20analysis%20for%20microarray%20data%20classification.pdf
http://psasir.upm.edu.my/id/eprint/34225/
http://content.iospress.com/articles/intelligent-data-analysis/ida00666
_version_ 1643831089175724032
score 13.211869