Improved prediction and characterization of anticancer activities of peptides using a novel flexible scoring card method

As anticancer peptides (ACPs) have attracted great interest for cancer treatment, several approaches based on machine learning have been proposed for ACP identification. Although existing methods have afforded high prediction accuracies, however such models are using a large number of descriptors to...

Full description

Saved in:
Bibliographic Details
Main Authors: Charoenkwan, Phasit, Chiangjong, Wararat, Lee, Vannajan Sanghiran, Nantasenamat, Chanin, Hasan, Md Mehedi, Shoombuatong, Watshara
Format: Article
Published: Nature Research 2021
Subjects:
Online Access:http://eprints.um.edu.my/28336/
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.um.eprints.28336
record_format eprints
spelling my.um.eprints.283362022-07-31T07:27:43Z http://eprints.um.edu.my/28336/ Improved prediction and characterization of anticancer activities of peptides using a novel flexible scoring card method Charoenkwan, Phasit Chiangjong, Wararat Lee, Vannajan Sanghiran Nantasenamat, Chanin Hasan, Md Mehedi Shoombuatong, Watshara Q Science (General) QD Chemistry As anticancer peptides (ACPs) have attracted great interest for cancer treatment, several approaches based on machine learning have been proposed for ACP identification. Although existing methods have afforded high prediction accuracies, however such models are using a large number of descriptors together with complex ensemble approaches that consequently leads to low interpretability and thus poses a challenge for biologists and biochemists. Therefore, it is desirable to develop a simple, interpretable and efficient predictor for accurate ACP identification as well as providing the means for the rational design of new anticancer peptides with promising potential for clinical application. Herein, we propose a novel flexible scoring card method (FSCM) making use of propensity scores of local and global sequential information for the development of a sequence-based ACP predictor (named iACP-FSCM) for improving the prediction accuracy and model interpretability. To the best of our knowledge, iACP-FSCM represents the first sequence-based ACP predictor for rationalizing an in-depth understanding into the molecular basis for the enhancement of anticancer activities of peptides via the use of FSCM-derived propensity scores. The independent testing results showed that the iACP-FSCM provided accuracies of 0.825 and 0.910 as evaluated on the main and alternative datasets, respectively. Results from comparative benchmarking demonstrated that iACP-FSCM could outperform seven other existing ACP predictors with marked improvements of 7% and 17% for accuracy and MCC, respectively, on the main dataset. Furthermore, the iACP-FSCM (0.910) achieved very comparable results to that of the state-of-the-art ensemble model AntiCP2.0 (0.920) as evaluated on the alternative dataset. Comparative results demonstrated that iACP-FSCM was the most suitable choice for ACP identification and characterization considering its simplicity, interpretability and generalizability. It is highly anticipated that the iACP-FSCM may be a robust tool for the rapid screening and identification of promising ACPs for clinical use. Nature Research 2021-02-04 Article PeerReviewed Charoenkwan, Phasit and Chiangjong, Wararat and Lee, Vannajan Sanghiran and Nantasenamat, Chanin and Hasan, Md Mehedi and Shoombuatong, Watshara (2021) Improved prediction and characterization of anticancer activities of peptides using a novel flexible scoring card method. Scientific Reports, 11 (1). ISSN 2045-2322, DOI https://doi.org/10.1038/s41598-021-82513-9 <https://doi.org/10.1038/s41598-021-82513-9>. 10.1038/s41598-021-82513-9
institution Universiti Malaya
building UM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Malaya
content_source UM Research Repository
url_provider http://eprints.um.edu.my/
topic Q Science (General)
QD Chemistry
spellingShingle Q Science (General)
QD Chemistry
Charoenkwan, Phasit
Chiangjong, Wararat
Lee, Vannajan Sanghiran
Nantasenamat, Chanin
Hasan, Md Mehedi
Shoombuatong, Watshara
Improved prediction and characterization of anticancer activities of peptides using a novel flexible scoring card method
description As anticancer peptides (ACPs) have attracted great interest for cancer treatment, several approaches based on machine learning have been proposed for ACP identification. Although existing methods have afforded high prediction accuracies, however such models are using a large number of descriptors together with complex ensemble approaches that consequently leads to low interpretability and thus poses a challenge for biologists and biochemists. Therefore, it is desirable to develop a simple, interpretable and efficient predictor for accurate ACP identification as well as providing the means for the rational design of new anticancer peptides with promising potential for clinical application. Herein, we propose a novel flexible scoring card method (FSCM) making use of propensity scores of local and global sequential information for the development of a sequence-based ACP predictor (named iACP-FSCM) for improving the prediction accuracy and model interpretability. To the best of our knowledge, iACP-FSCM represents the first sequence-based ACP predictor for rationalizing an in-depth understanding into the molecular basis for the enhancement of anticancer activities of peptides via the use of FSCM-derived propensity scores. The independent testing results showed that the iACP-FSCM provided accuracies of 0.825 and 0.910 as evaluated on the main and alternative datasets, respectively. Results from comparative benchmarking demonstrated that iACP-FSCM could outperform seven other existing ACP predictors with marked improvements of 7% and 17% for accuracy and MCC, respectively, on the main dataset. Furthermore, the iACP-FSCM (0.910) achieved very comparable results to that of the state-of-the-art ensemble model AntiCP2.0 (0.920) as evaluated on the alternative dataset. Comparative results demonstrated that iACP-FSCM was the most suitable choice for ACP identification and characterization considering its simplicity, interpretability and generalizability. It is highly anticipated that the iACP-FSCM may be a robust tool for the rapid screening and identification of promising ACPs for clinical use.
format Article
author Charoenkwan, Phasit
Chiangjong, Wararat
Lee, Vannajan Sanghiran
Nantasenamat, Chanin
Hasan, Md Mehedi
Shoombuatong, Watshara
author_facet Charoenkwan, Phasit
Chiangjong, Wararat
Lee, Vannajan Sanghiran
Nantasenamat, Chanin
Hasan, Md Mehedi
Shoombuatong, Watshara
author_sort Charoenkwan, Phasit
title Improved prediction and characterization of anticancer activities of peptides using a novel flexible scoring card method
title_short Improved prediction and characterization of anticancer activities of peptides using a novel flexible scoring card method
title_full Improved prediction and characterization of anticancer activities of peptides using a novel flexible scoring card method
title_fullStr Improved prediction and characterization of anticancer activities of peptides using a novel flexible scoring card method
title_full_unstemmed Improved prediction and characterization of anticancer activities of peptides using a novel flexible scoring card method
title_sort improved prediction and characterization of anticancer activities of peptides using a novel flexible scoring card method
publisher Nature Research
publishDate 2021
url http://eprints.um.edu.my/28336/
_version_ 1740826004319895552
score 13.160551