Predicting high-cost care in a mental health setting

Craig Colling; Mizanur Khondoker; Rashmi Patel; Marcella Fok; Robert Harland; Matthew Broadbent; Paul McCrone; Robert Stewart

doi:10.1192/bjo.2019.96

Predicting high-cost care in a mental health setting

BJPsych Open ◽

10.1192/bjo.2019.96 ◽

2020 ◽

Vol 6 (1) ◽

Author(s):

Craig Colling ◽

Mizanur Khondoker ◽

Rashmi Patel ◽

Marcella Fok ◽

Robert Harland ◽

...

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Receiver Operating Characteristic ◽

Language Processing ◽

Service Use ◽

Operating Characteristic ◽

Characteristic Curves ◽

Receiver Operating Characteristic Curves ◽

High Service ◽

Derived Data

Background The density of information in digital health records offers new potential opportunities for automated prediction of cost-relevant outcomes. Aims We investigated the extent to which routinely recorded data held in the electronic health record (EHR) predict priority service outcomes and whether natural language processing tools enhance the predictions. We evaluated three high priority outcomes: in-patient duration, readmission following in-patient care and high service cost after first presentation. Method We used data obtained from a clinical database derived from the EHR of a large mental healthcare provider within the UK. We combined structured data with text-derived data relating to diagnosis statements, medication and psychiatric symptomatology. Predictors of the three different clinical outcomes were modelled using logistic regression with performance evaluated against a validation set to derive areas under receiver operating characteristic curves. Results In validation samples, the full models (using all available data) achieved areas under receiver operating characteristic curves between 0.59 and 0.85 (in-patient duration 0.63, readmission 0.59, high service use 0.85). Adding natural language processing-derived data to the models increased the variance explained across all clinical scenarios (observed increase in r2 = 12–46%). Conclusions EHR data offer the potential to improve routine clinical predictions by utilising previously inaccessible data. Of our scenarios, prediction of high service use after initial presentation achieved the highest performance.

Download Full-text

Natural Language Processing Enhances Prediction of Functional Outcome After Acute Ischemic Stroke

Journal of the American Heart Association ◽

10.1161/jaha.121.023486 ◽

2021 ◽

Author(s):

Sheng‐Feng Sung ◽

Chih‐Hao Chen ◽

Ru‐Chiou Pan ◽

Ya‐Han Hu ◽

Jiann‐Shing Jeng

Keyword(s):

Ischemic Stroke ◽

Natural Language Processing ◽

Natural Language ◽

Acute Ischemic Stroke ◽

Receiver Operating Characteristic ◽

Functional Outcome ◽

Language Processing ◽

Operating Characteristic ◽

Receiver Operating Characteristic Curves ◽

Receiver Operating

Background Conventional prognostic scores usually require predefined clinical variables to predict outcome. The advancement of natural language processing has made it feasible to derive meaning from unstructured data. We aimed to test whether using unstructured text in electronic health records can improve the prediction of functional outcome after acute ischemic stroke. Methods and Results Patients hospitalized for acute ischemic stroke were identified from 2 hospital stroke registries (3847 and 2668 patients, respectively). Prediction models developed using the first cohort were externally validated using the second cohort, and vice versa. Free text in the history of present illness and computed tomography reports was used to build machine learning models using natural language processing to predict poor functional outcome at 90 days poststroke. Four conventional prognostic models were used as baseline models. The area under the receiver operating characteristic curves of the model using history of present illness in the internal and external validation sets were 0.820 and 0.792, respectively, which were comparable to the National Institutes of Health Stroke Scale score (0.811 and 0.807). The model using computed tomography reports achieved area under the receiver operating characteristic curves of 0.758 and 0.658. Adding information from clinical text significantly improved the predictive performance of each baseline model in terms of area under the receiver operating characteristic curves, net reclassification improvement, and integrated discrimination improvement indices (all P <0.001). Swapping the study cohorts led to similar results. Conclusions By using natural language processing, unstructured text in electronic health records can provide an alternative tool for stroke prognostication, and even enhance the performance of existing prognostic scores.

Download Full-text

Coarse-Grained Sentiment Analysis Berbasis Natural Language Processing – Ulasan Hotel

Jurnal Nasional Teknik Elektro dan Teknologi Informasi (JNTETI) ◽

10.22146/jnteti.v10i1.548 ◽

2021 ◽

Vol 10 (1) ◽

pp. 41-48

Author(s):

Warnia Nengsih ◽

M. Mahrus Zein ◽

Nazifa Hayati

Keyword(s):

Natural Language Processing ◽

Random Forest ◽

Natural Language ◽

Receiver Operating Characteristic ◽

Sentiment Analysis ◽

Language Processing ◽

Operating Characteristic ◽

Coarse Grained ◽

Receiver Operating

Sentiment analysis adalah metode untuk memperoleh data dari berbagai platform yang tersedia di internet. Kemajuan teknologi memungkinkan mesin untuk mengenali suatu istilah yang dianggap sebagai opini positif maupun sebaliknya. Data-data dan opini tersebut berperan penting sebagai umpan balik produk, layanan, dan topik lainnya. Tanpa perlu memperoleh opini secara langsung dari masyarakat, pihak penyedia telah mendapatkan evaluasi yang penting guna mengembangkan diri. Bisnis perhotelan merupakan bidang yang terkait dengan jasa memberikan layanan pada pelanggan. Indikator keberlangsungan bisnis ini juga bergantung pada umpan balik pelanggannya dan dijadikan sebagai acuan untuk pengambilan kebijakan strategis. Teknik sentiment analysis berbasis Natural Language Processing dapat mengatasi permasalahan tersebut. Pada makalah ini prediksi dilakukan menggunakan classifier Random Forest (RF), sementara untuk merangkum kualitas classifier, digunakan kurva Receiver Operating Characteristic (ROC). Kurva ROC berupa grafik yang baik untuk merangkum kualitas classifier. Semakin tinggi kurva berada di atas garis diagonal, semakin baik prediksinya, dengan nilai kurva ROC yang diperoleh sebesar 0,90. Terlihat hasil ulasan terhadap opini pelanggan terhadap jasa dan pelayanan yang diberikan oleh hotel untuk kategori positif lebih banyak daripada kategori negatif. Polaritas dari ulasan diperoleh 68% ulasan pelanggan berada pada area positif dan 32% berada pada area negatif.

Download Full-text

A conditional approach for the receiver operating characteristic curve construction to evaluate diagnostic test performance in a family-matched case–control design

Statistical Methods in Medical Research ◽

10.1177/0962280221995956 ◽

2021 ◽

pp. 096228022199595

Author(s):

Yalda Zarnegarnia ◽

Shari Messinger

Keyword(s):

Receiver Operating Characteristic Curve ◽

Receiver Operating Characteristic ◽

Operating Characteristic ◽

Characteristic Curve ◽

Case Control ◽

Paired Data ◽

Characteristic Curves ◽

Receiver Operating Characteristic Curves ◽

Operating Characteristic Curve ◽

Receiver Operating

Receiver operating characteristic curves are widely used in medical research to illustrate biomarker performance in binary classification, particularly with respect to disease or health status. Study designs that include related subjects, such as siblings, usually have common environmental or genetic factors giving rise to correlated biomarker data. The design could be used to improve detection of biomarkers informative of increased risk, allowing initiation of treatment to stop or slow disease progression. Available methods for receiver operating characteristic construction do not take advantage of correlation inherent in this design to improve biomarker performance. This paper will briefly review some developed methods for receiver operating characteristic curve estimation in settings with correlated data from case–control designs and will discuss the limitations of current methods for analyzing correlated familial paired data. An alternative approach using conditional receiver operating characteristic curves will be demonstrated. The proposed approach will use information about correlation among biomarker values, producing conditional receiver operating characteristic curves that evaluate the ability of a biomarker to discriminate between affected and unaffected subjects in a familial paired design.

Download Full-text

The length of the receiver operating characteristic curve and the two cutoff Youden index within a robust framework for discovery, evaluation, and cutoff estimation in biomarker studies involving improper receiver operating characteristic curves

Statistics in Medicine ◽

10.1002/sim.8869 ◽

2021 ◽

Author(s):

Leonidas E. Bantis ◽

John V. Tsimikas ◽

Gregory R. Chambers ◽

Michela Capello ◽

Samir Hanash ◽

...

Keyword(s):

Receiver Operating Characteristic Curve ◽

Receiver Operating Characteristic ◽

Operating Characteristic ◽

Characteristic Curve ◽

Youden Index ◽

Characteristic Curves ◽

Receiver Operating Characteristic Curves ◽

Operating Characteristic Curve ◽

Robust Framework ◽

Receiver Operating

Download Full-text

Three Approaches to Regression Analysis of Receiver Operating Characteristic Curves for Continuous Test Results

Biometrics ◽

10.2307/2534001 ◽

1998 ◽

Vol 54 (1) ◽

pp. 124 ◽

Cited By ~ 101

Author(s):

Margaret Sullivan Pepe

Keyword(s):

Regression Analysis ◽

Receiver Operating Characteristic ◽

Operating Characteristic ◽

Test Results ◽

Characteristic Curves ◽

Receiver Operating Characteristic Curves ◽

Continuous Test ◽

Receiver Operating

Download Full-text

Estimating Optimum Linear Combination of Multiple Correlated Diagnostic Tests at a Fixed Specificity with Receiver Operating Characteristic Curves

Journal of Data Science ◽

10.6339/jds.2008.06(1).368 ◽

2021 ◽

Vol 6 (1) ◽

pp. 1-13

Author(s):

Feng Gao

Keyword(s):

Receiver Operating Characteristic ◽

Linear Combination ◽

Diagnostic Tests ◽

Operating Characteristic ◽

Characteristic Curves ◽

Receiver Operating Characteristic Curves ◽

Receiver Operating

Download Full-text

Receiver-operating characteristic curves and likelihood ratios: improvements over traditional methods for the evaluation and application of veterinary clinical pathology tests

Veterinary Clinical Pathology ◽

10.1111/j.1939-165x.2006.tb00082.x ◽

2006 ◽

Vol 35 (1) ◽

pp. 8-17 ◽

Cited By ~ 137

Author(s):

Ian A. Gardner ◽

Matthias Greiner

Keyword(s):

Receiver Operating Characteristic ◽

Operating Characteristic ◽

Clinical Pathology ◽

Likelihood Ratios ◽

Traditional Methods ◽

Characteristic Curves ◽

Receiver Operating Characteristic Curves ◽

Receiver Operating

Download Full-text

Construction of Receiver Operating Characteristic Curves when Disease Verification Is Subject to Selection Bias

Medical Decision Making ◽

10.1177/0272989x8400400204 ◽

1984 ◽

Vol 4 (2) ◽

pp. 151-164 ◽

Cited By ~ 79

Author(s):

Robert Gray ◽

Colin B. Begg ◽

Robert A. Greenes

Keyword(s):

Receiver Operating Characteristic ◽

Selection Bias ◽

Operating Characteristic ◽

Characteristic Curves ◽

Receiver Operating Characteristic Curves ◽

Receiver Operating

Download Full-text

A Unified Approach to Nonparametric Comparison of Receiver Operating Characteristic Curves for Longitudinal and Clustered Data

Journal of the American Statistical Association ◽

10.1198/016214508000000364 ◽

2008 ◽

Vol 103 (482) ◽

pp. 705-713 ◽

Cited By ~ 18

Author(s):

Gang Li ◽

Kefei Zhou

Keyword(s):

Receiver Operating Characteristic ◽

Operating Characteristic ◽

Clustered Data ◽

Unified Approach ◽

Characteristic Curves ◽

Receiver Operating Characteristic Curves ◽

Nonparametric Comparison ◽

Receiver Operating

Download Full-text

Estimating and comparing time-dependent areas under receiver operating characteristic curves for censored event times with competing risks

Statistics in Medicine ◽

10.1002/sim.5958 ◽

2013 ◽

Vol 32 (30) ◽

pp. 5381-5397 ◽

Cited By ~ 255

Author(s):

Paul Blanche ◽

Jean-François Dartigues ◽

Hélène Jacqmin-Gadda

Keyword(s):

Receiver Operating Characteristic ◽

Competing Risks ◽

Operating Characteristic ◽

Time Dependent ◽

Characteristic Curves ◽

Receiver Operating Characteristic Curves ◽

Event Times ◽

Receiver Operating

Download Full-text