Data Mining for Building Neural Protein Sequence Classification Systems with Improved Performance
Traditionally, two protein sequences are classified into the same class if their feature patterns have high homology. These feature patterns were originally extracted by sequence alignment algorithms, which measure similarity between an unseen protein sequence and identified protein sequences. Neur...
Saved in:
Main Authors: | , , |
---|---|
Format: | Conference or Workshop Item |
Language: | English |
Published: |
IEEE
2003
|
Subjects: | |
Online Access: | http://ir.unimas.my/id/eprint/11927/1/Data%20Mining_abstract.pdf http://ir.unimas.my/id/eprint/11927/ http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=1223671 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my.unimas.ir.11927 |
---|---|
record_format |
eprints |
spelling |
my.unimas.ir.119272016-05-12T04:32:07Z http://ir.unimas.my/id/eprint/11927/ Data Mining for Building Neural Protein Sequence Classification Systems with Improved Performance Wang, Dianhui Lee, Nung Kion Dillon, Tharam S. QA75 Electronic computers. Computer science T Technology (General) Traditionally, two protein sequences are classified into the same class if their feature patterns have high homology. These feature patterns were originally extracted by sequence alignment algorithms, which measure similarity between an unseen protein sequence and identified protein sequences. Neural network approaches, while reasonably accurate at classification, give no information ahout the relationship between the unseen case and the classified items that is useful to biologist. In contrast, in this paper we use a generalized radial basis function (GRBF) neural network architecture 'that generates fuzzy classification rules that could he used for further knowledge discovery. Our proposed techniques were evaluated using protein sequences with ten classes of super-families downloaded from a public domain database, and the results compared favorably with other standard machine learning techniques. IEEE 2003 Conference or Workshop Item NonPeerReviewed text en http://ir.unimas.my/id/eprint/11927/1/Data%20Mining_abstract.pdf Wang, Dianhui and Lee, Nung Kion and Dillon, Tharam S. (2003) Data Mining for Building Neural Protein Sequence Classification Systems with Improved Performance. In: Neural Networks, 2003. Proceedings of the International Joint Conference on, 20-24 July 2003. http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=1223671 10.1109/IJCNN.2003.1223671 |
institution |
Universiti Malaysia Sarawak |
building |
Centre for Academic Information Services (CAIS) |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Malaysia Sarawak |
content_source |
UNIMAS Institutional Repository |
url_provider |
http://ir.unimas.my/ |
language |
English |
topic |
QA75 Electronic computers. Computer science T Technology (General) |
spellingShingle |
QA75 Electronic computers. Computer science T Technology (General) Wang, Dianhui Lee, Nung Kion Dillon, Tharam S. Data Mining for Building Neural Protein Sequence Classification Systems with Improved Performance |
description |
Traditionally, two protein sequences are classified
into the same class if their feature patterns have high homology. These feature patterns were originally extracted by sequence alignment algorithms, which measure similarity between an unseen protein sequence and identified protein sequences. Neural network approaches, while reasonably accurate at classification, give no information ahout the relationship between the unseen case and the classified items that is useful to biologist. In contrast, in this paper we use a generalized radial basis function (GRBF)
neural network architecture 'that generates fuzzy classification rules that could he used for further knowledge discovery. Our proposed techniques were evaluated using protein sequences with ten classes of super-families downloaded from a public domain database, and the results compared favorably with other standard machine learning techniques. |
format |
Conference or Workshop Item |
author |
Wang, Dianhui Lee, Nung Kion Dillon, Tharam S. |
author_facet |
Wang, Dianhui Lee, Nung Kion Dillon, Tharam S. |
author_sort |
Wang, Dianhui |
title |
Data Mining for Building Neural Protein Sequence Classification Systems with Improved Performance |
title_short |
Data Mining for Building Neural Protein Sequence Classification Systems with Improved Performance |
title_full |
Data Mining for Building Neural Protein Sequence Classification Systems with Improved Performance |
title_fullStr |
Data Mining for Building Neural Protein Sequence Classification Systems with Improved Performance |
title_full_unstemmed |
Data Mining for Building Neural Protein Sequence Classification Systems with Improved Performance |
title_sort |
data mining for building neural protein sequence classification systems with improved performance |
publisher |
IEEE |
publishDate |
2003 |
url |
http://ir.unimas.my/id/eprint/11927/1/Data%20Mining_abstract.pdf http://ir.unimas.my/id/eprint/11927/ http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=1223671 |
_version_ |
1644511304775368704 |
score |
13.209306 |