Discovering the Predictive Value of Clinical Notes: Machine Learning Analysis with Text Representation

Hospital readmission shortly after discharge is threatening to plague the quality of inpatient care. Readmission is a severe episode that leads to increased medical care costs. Federal regulations and early readmission penalties have created an incentive for healthcare facilities to reduce their rea...

Full description

Saved in:
Bibliographic Details
Main Authors: Teo, Kareen, Yong, Ching Wai, Chuah, Joon Huang, Murphy, Belinda Pingguan, Lai, Khin Wee
Format: Article
Published: American Scientific Publishers 2020
Subjects:
Online Access:http://eprints.um.edu.my/36246/
https://www.webofscience.com/wos/woscc/full-record/WOS:000565930500013
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.um.eprints.36246
record_format eprints
spelling my.um.eprints.362462023-11-29T06:38:35Z http://eprints.um.edu.my/36246/ Discovering the Predictive Value of Clinical Notes: Machine Learning Analysis with Text Representation Teo, Kareen Yong, Ching Wai Chuah, Joon Huang Murphy, Belinda Pingguan Lai, Khin Wee T Technology (General) TK Electrical engineering. Electronics Nuclear engineering Hospital readmission shortly after discharge is threatening to plague the quality of inpatient care. Readmission is a severe episode that leads to increased medical care costs. Federal regulations and early readmission penalties have created an incentive for healthcare facilities to reduce their readmission rates by predicting patients at a high risk of readmission. Scientists have developed prediction models by using rule-based assessment scores and traditional statistical methods, and most have focused on structured patient records. Recently, a few researchers utilized unstructured clinical notes. However, they achieved moderate prediction accuracy by making predictions of a single diagnosis subpopulation via extensive feature engineering. This study proposes the use of machine learning to learn deep representation of patient notes for the identification of high-risk readmission in a hospital-wide population. We describe and train several predictive models (standard machine learning and neural network), to which several setups have not been applied. Results show that complex deep learning models significantly outperform (P<0.001) conventionally applied simple models in terms of discrimination ability. We also demonstrate a simple feature evaluation using a standard model, which allows the determination of potential clinical conditions/procedures for targeting. Unlike modeling using structured patient information with considerable variability in structure when different templates or databases are adopted, this study shows that the machine learning approach can be applied to prognosticate readmission with clinical free text in various healthcare settings. Using minimum feature engineering, the trained models perform comparably well or better than other predictive models established in previous literature. American Scientific Publishers 2020-12 Article PeerReviewed Teo, Kareen and Yong, Ching Wai and Chuah, Joon Huang and Murphy, Belinda Pingguan and Lai, Khin Wee (2020) Discovering the Predictive Value of Clinical Notes: Machine Learning Analysis with Text Representation. Journal of Medical Imaging and Health Informatics, 10 (12). pp. 2869-2875. ISSN 2156-7018, DOI https://doi.org/10.1166/jmihi.2020.3291 <https://doi.org/10.1166/jmihi.2020.3291>. https://www.webofscience.com/wos/woscc/full-record/WOS:000565930500013 10.1166/jmihi.2020.3291
institution Universiti Malaya
building UM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Malaya
content_source UM Research Repository
url_provider http://eprints.um.edu.my/
topic T Technology (General)
TK Electrical engineering. Electronics Nuclear engineering
spellingShingle T Technology (General)
TK Electrical engineering. Electronics Nuclear engineering
Teo, Kareen
Yong, Ching Wai
Chuah, Joon Huang
Murphy, Belinda Pingguan
Lai, Khin Wee
Discovering the Predictive Value of Clinical Notes: Machine Learning Analysis with Text Representation
description Hospital readmission shortly after discharge is threatening to plague the quality of inpatient care. Readmission is a severe episode that leads to increased medical care costs. Federal regulations and early readmission penalties have created an incentive for healthcare facilities to reduce their readmission rates by predicting patients at a high risk of readmission. Scientists have developed prediction models by using rule-based assessment scores and traditional statistical methods, and most have focused on structured patient records. Recently, a few researchers utilized unstructured clinical notes. However, they achieved moderate prediction accuracy by making predictions of a single diagnosis subpopulation via extensive feature engineering. This study proposes the use of machine learning to learn deep representation of patient notes for the identification of high-risk readmission in a hospital-wide population. We describe and train several predictive models (standard machine learning and neural network), to which several setups have not been applied. Results show that complex deep learning models significantly outperform (P<0.001) conventionally applied simple models in terms of discrimination ability. We also demonstrate a simple feature evaluation using a standard model, which allows the determination of potential clinical conditions/procedures for targeting. Unlike modeling using structured patient information with considerable variability in structure when different templates or databases are adopted, this study shows that the machine learning approach can be applied to prognosticate readmission with clinical free text in various healthcare settings. Using minimum feature engineering, the trained models perform comparably well or better than other predictive models established in previous literature.
format Article
author Teo, Kareen
Yong, Ching Wai
Chuah, Joon Huang
Murphy, Belinda Pingguan
Lai, Khin Wee
author_facet Teo, Kareen
Yong, Ching Wai
Chuah, Joon Huang
Murphy, Belinda Pingguan
Lai, Khin Wee
author_sort Teo, Kareen
title Discovering the Predictive Value of Clinical Notes: Machine Learning Analysis with Text Representation
title_short Discovering the Predictive Value of Clinical Notes: Machine Learning Analysis with Text Representation
title_full Discovering the Predictive Value of Clinical Notes: Machine Learning Analysis with Text Representation
title_fullStr Discovering the Predictive Value of Clinical Notes: Machine Learning Analysis with Text Representation
title_full_unstemmed Discovering the Predictive Value of Clinical Notes: Machine Learning Analysis with Text Representation
title_sort discovering the predictive value of clinical notes: machine learning analysis with text representation
publisher American Scientific Publishers
publishDate 2020
url http://eprints.um.edu.my/36246/
https://www.webofscience.com/wos/woscc/full-record/WOS:000565930500013
_version_ 1783945849550667776
score 13.211869