CMD: a database to store the bonding states of cysteine motifs with secondary structures

Computational approaches to the disulphide bonding state and its connectivity pattern prediction are based on various descriptors. One descriptor is the amino acid sequence motifs flanking the cysteine residue motifs. Despite the existence of disulphide bonding information in many databases and appl...

Full description

Saved in:
Bibliographic Details
Main Authors: Bostan, Hamed, Salim, Naomie, Hussein, Zeti Azura, Klappa, Peter, Sahmsir, Mohd. Shahir
Format: Article
Language:English
Published: 2012
Subjects:
Online Access:http://eprints.utm.my/id/eprint/46694/1/HamedBostan_2012_CMD%20A%20Database%20to%20Store%20the%20Bonding%20States%20of%20Cysteine%20Motifs.pdf
http://eprints.utm.my/id/eprint/46694/
http://dx.doi.org/10.1155/2012/849830
http://dx.doi.org/10.1155/2012/849830
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.utm.46694
record_format eprints
spelling my.utm.466942017-09-18T04:04:18Z http://eprints.utm.my/id/eprint/46694/ CMD: a database to store the bonding states of cysteine motifs with secondary structures Bostan, Hamed Salim, Naomie Hussein, Zeti Azura Klappa, Peter Sahmsir, Mohd. Shahir QH Natural history Computational approaches to the disulphide bonding state and its connectivity pattern prediction are based on various descriptors. One descriptor is the amino acid sequence motifs flanking the cysteine residue motifs. Despite the existence of disulphide bonding information in many databases and applications, there is no complete reference and motif query available at the moment. Cysteine motif database (CMD) is the first online resource that stores all cysteine residues, their flanking motifs with their secondary structure, and propensity values assignment derived from the laboratory data. We extracted more than 3 million cysteine motifs from PDB and UniProt data, annotated with secondary structure assignment, propensity value assignment, and frequency of occurrence and coefficiency of their bonding status. Removal of redundancies generated 15875 unique flanking motifs that are always bonded and 41577 unique patterns that are always nonbonded. Queries are based on the protein ID, FASTA sequence, sequence motif, and secondary structure individually or in batch format using the provided APIs that allow remote users to query our database via third party software and/or high throughput screening/querying. The CMD offers extensive information about the bonded, free cysteine residues, and their motifs that allows in-depth characterization of the sequence motif composition. 2012 Article PeerReviewed application/pdf en http://eprints.utm.my/id/eprint/46694/1/HamedBostan_2012_CMD%20A%20Database%20to%20Store%20the%20Bonding%20States%20of%20Cysteine%20Motifs.pdf Bostan, Hamed and Salim, Naomie and Hussein, Zeti Azura and Klappa, Peter and Sahmsir, Mohd. Shahir (2012) CMD: a database to store the bonding states of cysteine motifs with secondary structures. Advances in Bioinformatics . pp. 1-5. ISSN 1687-8027 http://dx.doi.org/10.1155/2012/849830 http://dx.doi.org/10.1155/2012/849830
institution Universiti Teknologi Malaysia
building UTM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Malaysia
content_source UTM Institutional Repository
url_provider http://eprints.utm.my/
language English
topic QH Natural history
spellingShingle QH Natural history
Bostan, Hamed
Salim, Naomie
Hussein, Zeti Azura
Klappa, Peter
Sahmsir, Mohd. Shahir
CMD: a database to store the bonding states of cysteine motifs with secondary structures
description Computational approaches to the disulphide bonding state and its connectivity pattern prediction are based on various descriptors. One descriptor is the amino acid sequence motifs flanking the cysteine residue motifs. Despite the existence of disulphide bonding information in many databases and applications, there is no complete reference and motif query available at the moment. Cysteine motif database (CMD) is the first online resource that stores all cysteine residues, their flanking motifs with their secondary structure, and propensity values assignment derived from the laboratory data. We extracted more than 3 million cysteine motifs from PDB and UniProt data, annotated with secondary structure assignment, propensity value assignment, and frequency of occurrence and coefficiency of their bonding status. Removal of redundancies generated 15875 unique flanking motifs that are always bonded and 41577 unique patterns that are always nonbonded. Queries are based on the protein ID, FASTA sequence, sequence motif, and secondary structure individually or in batch format using the provided APIs that allow remote users to query our database via third party software and/or high throughput screening/querying. The CMD offers extensive information about the bonded, free cysteine residues, and their motifs that allows in-depth characterization of the sequence motif composition.
format Article
author Bostan, Hamed
Salim, Naomie
Hussein, Zeti Azura
Klappa, Peter
Sahmsir, Mohd. Shahir
author_facet Bostan, Hamed
Salim, Naomie
Hussein, Zeti Azura
Klappa, Peter
Sahmsir, Mohd. Shahir
author_sort Bostan, Hamed
title CMD: a database to store the bonding states of cysteine motifs with secondary structures
title_short CMD: a database to store the bonding states of cysteine motifs with secondary structures
title_full CMD: a database to store the bonding states of cysteine motifs with secondary structures
title_fullStr CMD: a database to store the bonding states of cysteine motifs with secondary structures
title_full_unstemmed CMD: a database to store the bonding states of cysteine motifs with secondary structures
title_sort cmd: a database to store the bonding states of cysteine motifs with secondary structures
publishDate 2012
url http://eprints.utm.my/id/eprint/46694/1/HamedBostan_2012_CMD%20A%20Database%20to%20Store%20the%20Bonding%20States%20of%20Cysteine%20Motifs.pdf
http://eprints.utm.my/id/eprint/46694/
http://dx.doi.org/10.1155/2012/849830
http://dx.doi.org/10.1155/2012/849830
_version_ 1643652111630598144
score 13.214268