Deep learning based methods for molecular similarity searching: a systematic review
In rational drug design, the concept of molecular similarity searching is frequently used to identify molecules with similar functionalities by looking up structurally related molecules in chemical databases. Different methods have been developed to measure the similarity of molecules to a target qu...
Saved in:
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI
2023
|
Subjects: | |
Online Access: | http://eprints.utm.my/106538/1/NaomieSalim2023_DeepLearningBasedMethodsforMolecular.pdf http://eprints.utm.my/106538/ http://dx.doi.org/10.3390/pr11051340 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my.utm.106538 |
---|---|
record_format |
eprints |
spelling |
my.utm.1065382024-07-09T06:48:06Z http://eprints.utm.my/106538/ Deep learning based methods for molecular similarity searching: a systematic review Nasser, Maged Yusof, Umi Kalsom Salim, Naomie Q Science (General) In rational drug design, the concept of molecular similarity searching is frequently used to identify molecules with similar functionalities by looking up structurally related molecules in chemical databases. Different methods have been developed to measure the similarity of molecules to a target query. Although the approaches perform effectively, particularly when dealing with molecules with homogenous active structures, they fall short when dealing with compounds that have heterogeneous structural compounds. In recent times, deep learning methods have been exploited for improving the performance of molecule searching due to their feature extraction power and generalization capabilities. However, despite numerous research studies on deep-learning-based molecular similarity searches, relatively few secondary research was carried out in the area. This research aims to provide a systematic literature review (SLR) on deep-learning-based molecular similarity searches to enable researchers and practitioners to better understand the current trends and issues in the field. The study accesses 875 distinctive papers from the selected journals and conferences, which were published over the last thirteen years (2010–2023). After the full-text eligibility analysis and careful screening of the abstract, 65 studies were selected for our SLR. The review’s findings showed that the multilayer perceptrons (MLPs) and autoencoders (AEs) are the most frequently used deep learning models for molecular similarity searching; next are the models based on convolutional neural networks (CNNs) techniques. The ChEMBL dataset and DrugBank standard dataset are the two datasets that are most frequently used for the evaluation of deep learning methods for molecular similarity searching based on the results. In addition, the results show that the most popular methods for optimizing the performance of molecular similarity searching are new representation approaches and reweighing features techniques, and, for evaluating the efficiency of deep-learning-based molecular similarity searching, the most widely used metrics are the area under the curve (AUC) and precision measures. MDPI 2023-05 Article PeerReviewed application/pdf en http://eprints.utm.my/106538/1/NaomieSalim2023_DeepLearningBasedMethodsforMolecular.pdf Nasser, Maged and Yusof, Umi Kalsom and Salim, Naomie (2023) Deep learning based methods for molecular similarity searching: a systematic review. Processes, 11 (5). pp. 1-27. ISSN 2227-9717 http://dx.doi.org/10.3390/pr11051340 DOI:10.3390/pr11051340 |
institution |
Universiti Teknologi Malaysia |
building |
UTM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Teknologi Malaysia |
content_source |
UTM Institutional Repository |
url_provider |
http://eprints.utm.my/ |
language |
English |
topic |
Q Science (General) |
spellingShingle |
Q Science (General) Nasser, Maged Yusof, Umi Kalsom Salim, Naomie Deep learning based methods for molecular similarity searching: a systematic review |
description |
In rational drug design, the concept of molecular similarity searching is frequently used to identify molecules with similar functionalities by looking up structurally related molecules in chemical databases. Different methods have been developed to measure the similarity of molecules to a target query. Although the approaches perform effectively, particularly when dealing with molecules with homogenous active structures, they fall short when dealing with compounds that have heterogeneous structural compounds. In recent times, deep learning methods have been exploited for improving the performance of molecule searching due to their feature extraction power and generalization capabilities. However, despite numerous research studies on deep-learning-based molecular similarity searches, relatively few secondary research was carried out in the area. This research aims to provide a systematic literature review (SLR) on deep-learning-based molecular similarity searches to enable researchers and practitioners to better understand the current trends and issues in the field. The study accesses 875 distinctive papers from the selected journals and conferences, which were published over the last thirteen years (2010–2023). After the full-text eligibility analysis and careful screening of the abstract, 65 studies were selected for our SLR. The review’s findings showed that the multilayer perceptrons (MLPs) and autoencoders (AEs) are the most frequently used deep learning models for molecular similarity searching; next are the models based on convolutional neural networks (CNNs) techniques. The ChEMBL dataset and DrugBank standard dataset are the two datasets that are most frequently used for the evaluation of deep learning methods for molecular similarity searching based on the results. In addition, the results show that the most popular methods for optimizing the performance of molecular similarity searching are new representation approaches and reweighing features techniques, and, for evaluating the efficiency of deep-learning-based molecular similarity searching, the most widely used metrics are the area under the curve (AUC) and precision measures. |
format |
Article |
author |
Nasser, Maged Yusof, Umi Kalsom Salim, Naomie |
author_facet |
Nasser, Maged Yusof, Umi Kalsom Salim, Naomie |
author_sort |
Nasser, Maged |
title |
Deep learning based methods for molecular similarity searching: a systematic review |
title_short |
Deep learning based methods for molecular similarity searching: a systematic review |
title_full |
Deep learning based methods for molecular similarity searching: a systematic review |
title_fullStr |
Deep learning based methods for molecular similarity searching: a systematic review |
title_full_unstemmed |
Deep learning based methods for molecular similarity searching: a systematic review |
title_sort |
deep learning based methods for molecular similarity searching: a systematic review |
publisher |
MDPI |
publishDate |
2023 |
url |
http://eprints.utm.my/106538/1/NaomieSalim2023_DeepLearningBasedMethodsforMolecular.pdf http://eprints.utm.my/106538/ http://dx.doi.org/10.3390/pr11051340 |
_version_ |
1805880827757723648 |
score |
13.211869 |