Similarity-based virtual screen using enhanced siamese deep learning methods

Traditional drug production is a long and complex process that leads to new drug production. The virtual screening technique is a computational method that allows chemical compounds to be screened at an acceptable time and cost. Several databases contain information on various aspects of biologicall...

Full description

Saved in:
Bibliographic Details
Main Authors: Altalib, Mohammed Khaldoon, Salim, Naomie
Format: Article
Language:English
Published: American Chemical Society 2022
Subjects:
Online Access:http://eprints.utm.my/id/eprint/100560/1/ChuaLeeSuan2022_SimilarityBasedVirtualScreenUsingEnhanced.pdf
http://eprints.utm.my/id/eprint/100560/
http://dx.doi.org/10.1021/acsomega.1c04587
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.utm.100560
record_format eprints
spelling my.utm.1005602023-04-17T07:01:21Z http://eprints.utm.my/id/eprint/100560/ Similarity-based virtual screen using enhanced siamese deep learning methods Altalib, Mohammed Khaldoon Salim, Naomie QA75 Electronic computers. Computer science Traditional drug production is a long and complex process that leads to new drug production. The virtual screening technique is a computational method that allows chemical compounds to be screened at an acceptable time and cost. Several databases contain information on various aspects of biologically active substances. Simple statistical tools are difficult to use because of the enormous amount of information and complex data samples of molecules that are structurally heterogeneous recorded in these databases. Many techniques for capturing the biological similarity between a test compound and a known target ligand in LBVS have been established. However, despite the good performances of the above methods compared to their prior, especially when dealing with molecules that have homogeneous active structural elements, they are not satisfied when dealing with molecules that are structurally heterogeneous. Deep learning models have recently achieved considerable success in a variety of disciplines due to their powerful generalization and feature extraction capabilities. Also, the Siamese network has been used in similarity models for more complicated data samples, especially with heterogeneous data samples. The main aim of this study is to enhance the performance of similarity searching, especially with molecules that are structurally heterogeneous. The Siamese architecture will be enhanced using two similarity distance layers with one fusion layer to further improve the similarity measurements between molecules and then adding many layers after the fusion layer for some models to improve the retrieval recall. In this architecture, several methods of deep learning have been used, which are long short-term memory (LSTM), gated recurrent unit (GRU), convolutional neural network-one dimension (CNN1D), and convolutional neural network-two dimensions (CNN2D). A series of experiments have been carried out on real-world data sets, and the results have shown that the proposed methods outperformed the existing methods. American Chemical Society 2022 Article PeerReviewed application/pdf en http://eprints.utm.my/id/eprint/100560/1/ChuaLeeSuan2022_SimilarityBasedVirtualScreenUsingEnhanced.pdf Altalib, Mohammed Khaldoon and Salim, Naomie (2022) Similarity-based virtual screen using enhanced siamese deep learning methods. ACS Omega, 7 (6). pp. 4769-4786. ISSN 2470-1343 http://dx.doi.org/10.1021/acsomega.1c04587 DOI : 10.1021/acsomega.1c04587
institution Universiti Teknologi Malaysia
building UTM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Malaysia
content_source UTM Institutional Repository
url_provider http://eprints.utm.my/
language English
topic QA75 Electronic computers. Computer science
spellingShingle QA75 Electronic computers. Computer science
Altalib, Mohammed Khaldoon
Salim, Naomie
Similarity-based virtual screen using enhanced siamese deep learning methods
description Traditional drug production is a long and complex process that leads to new drug production. The virtual screening technique is a computational method that allows chemical compounds to be screened at an acceptable time and cost. Several databases contain information on various aspects of biologically active substances. Simple statistical tools are difficult to use because of the enormous amount of information and complex data samples of molecules that are structurally heterogeneous recorded in these databases. Many techniques for capturing the biological similarity between a test compound and a known target ligand in LBVS have been established. However, despite the good performances of the above methods compared to their prior, especially when dealing with molecules that have homogeneous active structural elements, they are not satisfied when dealing with molecules that are structurally heterogeneous. Deep learning models have recently achieved considerable success in a variety of disciplines due to their powerful generalization and feature extraction capabilities. Also, the Siamese network has been used in similarity models for more complicated data samples, especially with heterogeneous data samples. The main aim of this study is to enhance the performance of similarity searching, especially with molecules that are structurally heterogeneous. The Siamese architecture will be enhanced using two similarity distance layers with one fusion layer to further improve the similarity measurements between molecules and then adding many layers after the fusion layer for some models to improve the retrieval recall. In this architecture, several methods of deep learning have been used, which are long short-term memory (LSTM), gated recurrent unit (GRU), convolutional neural network-one dimension (CNN1D), and convolutional neural network-two dimensions (CNN2D). A series of experiments have been carried out on real-world data sets, and the results have shown that the proposed methods outperformed the existing methods.
format Article
author Altalib, Mohammed Khaldoon
Salim, Naomie
author_facet Altalib, Mohammed Khaldoon
Salim, Naomie
author_sort Altalib, Mohammed Khaldoon
title Similarity-based virtual screen using enhanced siamese deep learning methods
title_short Similarity-based virtual screen using enhanced siamese deep learning methods
title_full Similarity-based virtual screen using enhanced siamese deep learning methods
title_fullStr Similarity-based virtual screen using enhanced siamese deep learning methods
title_full_unstemmed Similarity-based virtual screen using enhanced siamese deep learning methods
title_sort similarity-based virtual screen using enhanced siamese deep learning methods
publisher American Chemical Society
publishDate 2022
url http://eprints.utm.my/id/eprint/100560/1/ChuaLeeSuan2022_SimilarityBasedVirtualScreenUsingEnhanced.pdf
http://eprints.utm.my/id/eprint/100560/
http://dx.doi.org/10.1021/acsomega.1c04587
_version_ 1765296671375228928
score 13.211853