DTLM-DBP: deep transfer learning models for DNA binding proteins identification

The identification of DNA binding proteins (DNABPs) is considered a major challenge in genome annotation because they are linked to several important applied and research applications of cellular functions e.g., in the study of the biological, biophysical, and biochemical effects of antibiotics, dru...

Full description

Saved in:
Bibliographic Details
Main Authors: Saber, S., Khairuddin, U., Yusof, R., Madani, A.
Format: Article
Language:English
Published: Tech Science Press 2021
Subjects:
Online Access:http://eprints.utm.my/id/eprint/94887/1/UswahKhairuddin2021_DTLMDBPDeepTransferLearningModels.pdf
http://eprints.utm.my/id/eprint/94887/
http://dx.doi.org/10.32604/cmc.2021.017769
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.utm.94887
record_format eprints
spelling my.utm.948872022-04-29T21:54:56Z http://eprints.utm.my/id/eprint/94887/ DTLM-DBP: deep transfer learning models for DNA binding proteins identification Saber, S. Khairuddin, U. Yusof, R. Madani, A. T58.5-58.64 Information technology The identification of DNA binding proteins (DNABPs) is considered a major challenge in genome annotation because they are linked to several important applied and research applications of cellular functions e.g., in the study of the biological, biophysical, and biochemical effects of antibiotics, drugs, and steroids on DNA. This paper presents an efficient approach for DNABPs identification based on deep transfer learning, named "DTLM-DBP." Two transfer learning methods are used in the identification process. The first is based on the pre-trained deep learning model as a feature's extractor and classifier. Two different pre-trained Convolutional Neural Networks (CNN), AlexNet 8 and VGG 16, are tested and compared. The second method uses the deep learning model as a feature's extractor only and two different classifiers for the identification process. Two classifiers, Support Vector Machine (SVM) and Random Forest (RF), are tested and compared. The proposed approach is tested using different DNA proteins datasets. The performance of the identification process is evaluated in terms of identification accuracy, sensitivity, specificity andMCC, with four available DNAproteins datasets:PDB1075,PDB186,PDNA-543, and PDNA-316. The results show that the RF classifier, with VGG-Net pre-trained deep transfer learning features, gives the highest performance. DTLM-DBP was compared with other published methods and it provides a considerable improvement in the performance of DNABPs identification. Tech Science Press 2021 Article PeerReviewed application/pdf en http://eprints.utm.my/id/eprint/94887/1/UswahKhairuddin2021_DTLMDBPDeepTransferLearningModels.pdf Saber, S. and Khairuddin, U. and Yusof, R. and Madani, A. (2021) DTLM-DBP: deep transfer learning models for DNA binding proteins identification. Computers, Materials and Continua, 68 (3). ISSN 1546-2218 http://dx.doi.org/10.32604/cmc.2021.017769 DOI: 10.32604/cmc.2021.017769
institution Universiti Teknologi Malaysia
building UTM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Malaysia
content_source UTM Institutional Repository
url_provider http://eprints.utm.my/
language English
topic T58.5-58.64 Information technology
spellingShingle T58.5-58.64 Information technology
Saber, S.
Khairuddin, U.
Yusof, R.
Madani, A.
DTLM-DBP: deep transfer learning models for DNA binding proteins identification
description The identification of DNA binding proteins (DNABPs) is considered a major challenge in genome annotation because they are linked to several important applied and research applications of cellular functions e.g., in the study of the biological, biophysical, and biochemical effects of antibiotics, drugs, and steroids on DNA. This paper presents an efficient approach for DNABPs identification based on deep transfer learning, named "DTLM-DBP." Two transfer learning methods are used in the identification process. The first is based on the pre-trained deep learning model as a feature's extractor and classifier. Two different pre-trained Convolutional Neural Networks (CNN), AlexNet 8 and VGG 16, are tested and compared. The second method uses the deep learning model as a feature's extractor only and two different classifiers for the identification process. Two classifiers, Support Vector Machine (SVM) and Random Forest (RF), are tested and compared. The proposed approach is tested using different DNA proteins datasets. The performance of the identification process is evaluated in terms of identification accuracy, sensitivity, specificity andMCC, with four available DNAproteins datasets:PDB1075,PDB186,PDNA-543, and PDNA-316. The results show that the RF classifier, with VGG-Net pre-trained deep transfer learning features, gives the highest performance. DTLM-DBP was compared with other published methods and it provides a considerable improvement in the performance of DNABPs identification.
format Article
author Saber, S.
Khairuddin, U.
Yusof, R.
Madani, A.
author_facet Saber, S.
Khairuddin, U.
Yusof, R.
Madani, A.
author_sort Saber, S.
title DTLM-DBP: deep transfer learning models for DNA binding proteins identification
title_short DTLM-DBP: deep transfer learning models for DNA binding proteins identification
title_full DTLM-DBP: deep transfer learning models for DNA binding proteins identification
title_fullStr DTLM-DBP: deep transfer learning models for DNA binding proteins identification
title_full_unstemmed DTLM-DBP: deep transfer learning models for DNA binding proteins identification
title_sort dtlm-dbp: deep transfer learning models for dna binding proteins identification
publisher Tech Science Press
publishDate 2021
url http://eprints.utm.my/id/eprint/94887/1/UswahKhairuddin2021_DTLMDBPDeepTransferLearningModels.pdf
http://eprints.utm.my/id/eprint/94887/
http://dx.doi.org/10.32604/cmc.2021.017769
_version_ 1732945406265917440
score 13.154949