Similarity-based virtual screen using enhanced siamese deep learning methods
Traditional drug production is a long and complex process that leads to new drug production. The virtual screening technique is a computational method that allows chemical compounds to be screened at an acceptable time and cost. Several databases contain information on various aspects of biologicall...
Saved in:
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
American Chemical Society
2022
|
Subjects: | |
Online Access: | http://eprints.utm.my/id/eprint/100560/1/ChuaLeeSuan2022_SimilarityBasedVirtualScreenUsingEnhanced.pdf http://eprints.utm.my/id/eprint/100560/ http://dx.doi.org/10.1021/acsomega.1c04587 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my.utm.100560 |
---|---|
record_format |
eprints |
spelling |
my.utm.1005602023-04-17T07:01:21Z http://eprints.utm.my/id/eprint/100560/ Similarity-based virtual screen using enhanced siamese deep learning methods Altalib, Mohammed Khaldoon Salim, Naomie QA75 Electronic computers. Computer science Traditional drug production is a long and complex process that leads to new drug production. The virtual screening technique is a computational method that allows chemical compounds to be screened at an acceptable time and cost. Several databases contain information on various aspects of biologically active substances. Simple statistical tools are difficult to use because of the enormous amount of information and complex data samples of molecules that are structurally heterogeneous recorded in these databases. Many techniques for capturing the biological similarity between a test compound and a known target ligand in LBVS have been established. However, despite the good performances of the above methods compared to their prior, especially when dealing with molecules that have homogeneous active structural elements, they are not satisfied when dealing with molecules that are structurally heterogeneous. Deep learning models have recently achieved considerable success in a variety of disciplines due to their powerful generalization and feature extraction capabilities. Also, the Siamese network has been used in similarity models for more complicated data samples, especially with heterogeneous data samples. The main aim of this study is to enhance the performance of similarity searching, especially with molecules that are structurally heterogeneous. The Siamese architecture will be enhanced using two similarity distance layers with one fusion layer to further improve the similarity measurements between molecules and then adding many layers after the fusion layer for some models to improve the retrieval recall. In this architecture, several methods of deep learning have been used, which are long short-term memory (LSTM), gated recurrent unit (GRU), convolutional neural network-one dimension (CNN1D), and convolutional neural network-two dimensions (CNN2D). A series of experiments have been carried out on real-world data sets, and the results have shown that the proposed methods outperformed the existing methods. American Chemical Society 2022 Article PeerReviewed application/pdf en http://eprints.utm.my/id/eprint/100560/1/ChuaLeeSuan2022_SimilarityBasedVirtualScreenUsingEnhanced.pdf Altalib, Mohammed Khaldoon and Salim, Naomie (2022) Similarity-based virtual screen using enhanced siamese deep learning methods. ACS Omega, 7 (6). pp. 4769-4786. ISSN 2470-1343 http://dx.doi.org/10.1021/acsomega.1c04587 DOI : 10.1021/acsomega.1c04587 |
institution |
Universiti Teknologi Malaysia |
building |
UTM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Teknologi Malaysia |
content_source |
UTM Institutional Repository |
url_provider |
http://eprints.utm.my/ |
language |
English |
topic |
QA75 Electronic computers. Computer science |
spellingShingle |
QA75 Electronic computers. Computer science Altalib, Mohammed Khaldoon Salim, Naomie Similarity-based virtual screen using enhanced siamese deep learning methods |
description |
Traditional drug production is a long and complex process that leads to new drug production. The virtual screening technique is a computational method that allows chemical compounds to be screened at an acceptable time and cost. Several databases contain information on various aspects of biologically active substances. Simple statistical tools are difficult to use because of the enormous amount of information and complex data samples of molecules that are structurally heterogeneous recorded in these databases. Many techniques for capturing the biological similarity between a test compound and a known target ligand in LBVS have been established. However, despite the good performances of the above methods compared to their prior, especially when dealing with molecules that have homogeneous active structural elements, they are not satisfied when dealing with molecules that are structurally heterogeneous. Deep learning models have recently achieved considerable success in a variety of disciplines due to their powerful generalization and feature extraction capabilities. Also, the Siamese network has been used in similarity models for more complicated data samples, especially with heterogeneous data samples. The main aim of this study is to enhance the performance of similarity searching, especially with molecules that are structurally heterogeneous. The Siamese architecture will be enhanced using two similarity distance layers with one fusion layer to further improve the similarity measurements between molecules and then adding many layers after the fusion layer for some models to improve the retrieval recall. In this architecture, several methods of deep learning have been used, which are long short-term memory (LSTM), gated recurrent unit (GRU), convolutional neural network-one dimension (CNN1D), and convolutional neural network-two dimensions (CNN2D). A series of experiments have been carried out on real-world data sets, and the results have shown that the proposed methods outperformed the existing methods. |
format |
Article |
author |
Altalib, Mohammed Khaldoon Salim, Naomie |
author_facet |
Altalib, Mohammed Khaldoon Salim, Naomie |
author_sort |
Altalib, Mohammed Khaldoon |
title |
Similarity-based virtual screen using enhanced siamese deep learning methods |
title_short |
Similarity-based virtual screen using enhanced siamese deep learning methods |
title_full |
Similarity-based virtual screen using enhanced siamese deep learning methods |
title_fullStr |
Similarity-based virtual screen using enhanced siamese deep learning methods |
title_full_unstemmed |
Similarity-based virtual screen using enhanced siamese deep learning methods |
title_sort |
similarity-based virtual screen using enhanced siamese deep learning methods |
publisher |
American Chemical Society |
publishDate |
2022 |
url |
http://eprints.utm.my/id/eprint/100560/1/ChuaLeeSuan2022_SimilarityBasedVirtualScreenUsingEnhanced.pdf http://eprints.utm.my/id/eprint/100560/ http://dx.doi.org/10.1021/acsomega.1c04587 |
_version_ |
1765296671375228928 |
score |
13.211853 |