Comparison of partial least squares and random forests for evaluating relationship between phenolics and bioactivities of Neptunia oleracea
Background: Neptunia oleracea is a plant consumed as a vegetable and which has been used as a folk remedy for several diseases. Herein, two regression models (partial least squares, PLS; and random forest, RF) in a metabolomics approach were compared and applied to the evaluation of the relationship...
Saved in:
Main Authors: | , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
John Wiley & Sons
2018
|
Online Access: | http://psasir.upm.edu.my/id/eprint/72072/1/Comparison%20of%20partial%20least%20squares%20and%20random%20forests%20for%20evaluating%20relationship%20between%20phenolics%20and%20bioactivities%20of%20Neptunia%20oleracea.pdf http://psasir.upm.edu.my/id/eprint/72072/ https://onlinelibrary.wiley.com/doi/full/10.1002/jsfa.8462 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | Background: Neptunia oleracea is a plant consumed as a vegetable and which has been used as a folk remedy for several diseases. Herein, two regression models (partial least squares, PLS; and random forest, RF) in a metabolomics approach were compared and applied to the evaluation of the relationship between phenolics and bioactivities of N. oleracea. In addition, the effects of different extraction conditions on the phenolic constituents were assessed by pattern recognition analysis. Results: Comparison of the PLS and RF showed that RF exhibited poorer generalization and hence poorer predictive performance. Both the regression coefficient of PLS and the variable importance of RF revealed that quercetin and kaempferol derivatives, caffeic acid and vitexin-2-O-rhamnoside were significant towards the tested bioactivities. Furthermore, principal component analysis (PCA) and partial least squares-discriminant analysis (PLS-DA) results showed that sonication and absolute ethanol are the preferable extraction method and ethanol ratio, respectively, to produce N. oleracea extracts with high phenolic levels and therefore high DPPH scavenging and α-glucosidase inhibitory activities. Conclusion: Both PLS and RF are useful regression models in metabolomics studies. This work provides insight into the performance of different multivariate data analysis tools and the effects of different extraction conditions on the extraction of desired phenolics from plants. |
---|