Deep learning based methods for molecular similarity searching: a systematic review

In rational drug design, the concept of molecular similarity searching is frequently used to identify molecules with similar functionalities by looking up structurally related molecules in chemical databases. Different methods have been developed to measure the similarity of molecules to a target qu...

Full description

Saved in:
Bibliographic Details
Main Authors: Nasser, Maged, Yusof, Umi Kalsom, Salim, Naomie
Format: Article
Language:English
Published: MDPI 2023
Subjects:
Online Access:http://eprints.utm.my/106538/1/NaomieSalim2023_DeepLearningBasedMethodsforMolecular.pdf
http://eprints.utm.my/106538/
http://dx.doi.org/10.3390/pr11051340
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.utm.106538
record_format eprints
spelling my.utm.1065382024-07-09T06:48:06Z http://eprints.utm.my/106538/ Deep learning based methods for molecular similarity searching: a systematic review Nasser, Maged Yusof, Umi Kalsom Salim, Naomie Q Science (General) In rational drug design, the concept of molecular similarity searching is frequently used to identify molecules with similar functionalities by looking up structurally related molecules in chemical databases. Different methods have been developed to measure the similarity of molecules to a target query. Although the approaches perform effectively, particularly when dealing with molecules with homogenous active structures, they fall short when dealing with compounds that have heterogeneous structural compounds. In recent times, deep learning methods have been exploited for improving the performance of molecule searching due to their feature extraction power and generalization capabilities. However, despite numerous research studies on deep-learning-based molecular similarity searches, relatively few secondary research was carried out in the area. This research aims to provide a systematic literature review (SLR) on deep-learning-based molecular similarity searches to enable researchers and practitioners to better understand the current trends and issues in the field. The study accesses 875 distinctive papers from the selected journals and conferences, which were published over the last thirteen years (2010–2023). After the full-text eligibility analysis and careful screening of the abstract, 65 studies were selected for our SLR. The review’s findings showed that the multilayer perceptrons (MLPs) and autoencoders (AEs) are the most frequently used deep learning models for molecular similarity searching; next are the models based on convolutional neural networks (CNNs) techniques. The ChEMBL dataset and DrugBank standard dataset are the two datasets that are most frequently used for the evaluation of deep learning methods for molecular similarity searching based on the results. In addition, the results show that the most popular methods for optimizing the performance of molecular similarity searching are new representation approaches and reweighing features techniques, and, for evaluating the efficiency of deep-learning-based molecular similarity searching, the most widely used metrics are the area under the curve (AUC) and precision measures. MDPI 2023-05 Article PeerReviewed application/pdf en http://eprints.utm.my/106538/1/NaomieSalim2023_DeepLearningBasedMethodsforMolecular.pdf Nasser, Maged and Yusof, Umi Kalsom and Salim, Naomie (2023) Deep learning based methods for molecular similarity searching: a systematic review. Processes, 11 (5). pp. 1-27. ISSN 2227-9717 http://dx.doi.org/10.3390/pr11051340 DOI:10.3390/pr11051340
institution Universiti Teknologi Malaysia
building UTM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Malaysia
content_source UTM Institutional Repository
url_provider http://eprints.utm.my/
language English
topic Q Science (General)
spellingShingle Q Science (General)
Nasser, Maged
Yusof, Umi Kalsom
Salim, Naomie
Deep learning based methods for molecular similarity searching: a systematic review
description In rational drug design, the concept of molecular similarity searching is frequently used to identify molecules with similar functionalities by looking up structurally related molecules in chemical databases. Different methods have been developed to measure the similarity of molecules to a target query. Although the approaches perform effectively, particularly when dealing with molecules with homogenous active structures, they fall short when dealing with compounds that have heterogeneous structural compounds. In recent times, deep learning methods have been exploited for improving the performance of molecule searching due to their feature extraction power and generalization capabilities. However, despite numerous research studies on deep-learning-based molecular similarity searches, relatively few secondary research was carried out in the area. This research aims to provide a systematic literature review (SLR) on deep-learning-based molecular similarity searches to enable researchers and practitioners to better understand the current trends and issues in the field. The study accesses 875 distinctive papers from the selected journals and conferences, which were published over the last thirteen years (2010–2023). After the full-text eligibility analysis and careful screening of the abstract, 65 studies were selected for our SLR. The review’s findings showed that the multilayer perceptrons (MLPs) and autoencoders (AEs) are the most frequently used deep learning models for molecular similarity searching; next are the models based on convolutional neural networks (CNNs) techniques. The ChEMBL dataset and DrugBank standard dataset are the two datasets that are most frequently used for the evaluation of deep learning methods for molecular similarity searching based on the results. In addition, the results show that the most popular methods for optimizing the performance of molecular similarity searching are new representation approaches and reweighing features techniques, and, for evaluating the efficiency of deep-learning-based molecular similarity searching, the most widely used metrics are the area under the curve (AUC) and precision measures.
format Article
author Nasser, Maged
Yusof, Umi Kalsom
Salim, Naomie
author_facet Nasser, Maged
Yusof, Umi Kalsom
Salim, Naomie
author_sort Nasser, Maged
title Deep learning based methods for molecular similarity searching: a systematic review
title_short Deep learning based methods for molecular similarity searching: a systematic review
title_full Deep learning based methods for molecular similarity searching: a systematic review
title_fullStr Deep learning based methods for molecular similarity searching: a systematic review
title_full_unstemmed Deep learning based methods for molecular similarity searching: a systematic review
title_sort deep learning based methods for molecular similarity searching: a systematic review
publisher MDPI
publishDate 2023
url http://eprints.utm.my/106538/1/NaomieSalim2023_DeepLearningBasedMethodsforMolecular.pdf
http://eprints.utm.my/106538/
http://dx.doi.org/10.3390/pr11051340
_version_ 1805880827757723648
score 13.211869