DeepSleep: Fast and Accurate Delineation of Sleep Arousals at Millisecond Resolution by Deep Learning

Mapping Intimacies ◽

10.1101/859256 ◽

2019 ◽

Cited By ~ 1

Author(s):

Hongyang Li ◽

Yuanfang Guan

Keyword(s):

Deep Learning ◽

Operating Characteristic ◽

Characteristic Curve ◽

Learning Approach ◽

Considerable Time ◽

Negative Effects ◽

High Quality ◽

Data Points ◽

Sleep Arousal ◽

Operating Characteristic Curve

AbstractSleep arousals are transient periods of wakefulness punctuated into sleep. Excessive sleep arousals are associated with many negative effects including daytime sleepiness and sleep disorders. High-quality annotation of polysomnographic recordings is crucial for the diagnosis of sleep arousal disorders. Currently, sleep arousals are mainly annotated by human experts through looking at millions of data points manually, which requires considerable time and effort. Here we present a deep learning approach, DeepSleep, which ranked first in the 2018 PhysioNet Challenge for automatically segmenting sleep arousal regions based on polysomnographic recordings. DeepSleep features accurate (area under receiver operating characteristic curve of 0.93), high-resolution (5-millisecond resolution), and fast (10 seconds per sleep record) delineation of sleep arousals.

Download Full-text

Validating the validation: reanalyzing a large-scale comparison of deep learning and machine learning models for bioactivity prediction

Journal of Computer-Aided Molecular Design ◽

10.1007/s10822-019-00274-0 ◽

2020 ◽

Vol 34 (7) ◽

pp. 717-730 ◽

Cited By ~ 9

Author(s):

Matthew C. Robinson ◽

Robert C. Glen ◽

Alpha A. Lee

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Numerical Experiments ◽

Large Scale ◽

Operating Characteristic ◽

Characteristic Curve ◽

Learning Models ◽

Bioactivity Prediction ◽

Operating Characteristic Curve ◽

Machine Learning Models

Abstract Machine learning methods may have the potential to significantly accelerate drug discovery. However, the increasing rate of new methodological approaches being published in the literature raises the fundamental question of how models should be benchmarked and validated. We reanalyze the data generated by a recently published large-scale comparison of machine learning models for bioactivity prediction and arrive at a somewhat different conclusion. We show that the performance of support vector machines is competitive with that of deep learning methods. Additionally, using a series of numerical experiments, we question the relevance of area under the receiver operating characteristic curve as a metric in virtual screening. We further suggest that area under the precision–recall curve should be used in conjunction with the receiver operating characteristic curve. Our numerical experiments also highlight challenges in estimating the uncertainty in model performance via scaffold-split nested cross validation.

Download Full-text

Postload Glycated Albumin as an Alternate Measure for Diabetes Screening in a Chinese Population

Journal of Diabetes Research ◽

10.1155/2018/7932528 ◽

2018 ◽

Vol 2018 ◽

pp. 1-7 ◽

Cited By ~ 1

Author(s):

Hang Su ◽

Junling Tang ◽

Xiaojing Ma ◽

Xingxing He ◽

Lingwen Ying ◽

...

Keyword(s):

Operating Characteristic ◽

Characteristic Curve ◽

Glycated Albumin ◽

Fasting State ◽

Diabetes Screening ◽

Limits Of Agreement ◽

Data Points ◽

Operating Characteristic Curve ◽

Good Agreement ◽

Time Point

In previous epidemiological screening in China, glycated albumin (GA) was mostly detected during the fasting state. This strict restriction causes some problems with diabetes screening. It is unclear if GA could help improve the efficiency of screening for diabetes for subjects who are not in the fasting state. The present study analyzed the differences between fasting and postload (30, 60, 120, and 180 min) GA levels. A total of 691 participants were enrolled in the present study. The Bland-Altman difference plots revealed that 95.4, 94.8, 93.6, and 93.9% of data points were within the limits of agreement for each time point. The receiver operating characteristic curve showed that the areas under the curve (AUC) for baseline GA and postload GA for every time point were 0.822 (95% CI 0.791–0.849), 0.821 (95% CI 0.790–0.848), 0.833 (95% CI 0.803–0.860), 0.840 (95% CI 0.811–0.867), and 0.840 (95% CI 0.810–0.867), with sensitivities of 67.5, 68.1, 69.3, 71.6, and 69.3%, respectively. There was no difference between the baseline and postload GA levels in either AUC or sensitivity (all p>0.05). In conclusion, postload serum GA levels were in good agreement with those at baseline, and thus, it may be reasonable to employ nonfasting measurements of GA levels for diabetes screening.

Download Full-text

Clinical application of artificial intelligence-assisted diagnosis using anteroposterior pelvic radiographs in children with developmental dysplasia of the hip

The Bone & Joint Journal ◽

10.1302/0301-620x.102b11.bjj-2020-0712.r2 ◽

2020 ◽

Vol 102-B (11) ◽

pp. 1574-1581

Author(s):

Si-Cheng Zhang ◽

Jun Sun ◽

Chuan-Bin Liu ◽

Ji-Hong Fang ◽

Hong-Tao Xie ◽

...

Keyword(s):

Artificial Intelligence ◽

Deep Learning ◽

Receiver Operating Characteristic Curve ◽

Operating Characteristic ◽

Characteristic Curve ◽

Developmental Dysplasia ◽

Learning System ◽

Pelvic Radiographs ◽

Operating Characteristic Curve ◽

Dysplasia Of The Hip

Aims The diagnosis of developmental dysplasia of the hip (DDH) is challenging owing to extensive variation in paediatric pelvic anatomy. Artificial intelligence (AI) may represent an effective diagnostic tool for DDH. Here, we aimed to develop an anteroposterior pelvic radiograph deep learning system for diagnosing DDH in children and analyze the feasibility of its application. Methods In total, 10,219 anteroposterior pelvic radiographs were retrospectively collected from April 2014 to December 2018. Clinicians labelled each radiograph using a uniform standard method. Radiographs were grouped according to age and into ‘dislocation’ (dislocation and subluxation) and ‘non-dislocation’ (normal cases and those with dysplasia of the acetabulum) groups based on clinical diagnosis. The deep learning system was trained and optimized using 9,081 radiographs; 1,138 test radiographs were then used to compare the diagnoses made by deep learning system and clinicians. The accuracy of the deep learning system was determined using a receiver operating characteristic curve, and the consistency of acetabular index measurements was evaluated using Bland-Altman plots. Results In all, 1,138 patients (242 males; 896 females; mean age 1.5 years (SD 1.79; 0 to 10) were included in this study. The area under the receiver operating characteristic curve, sensitivity, and specificity of the deep learning system for diagnosing hip dislocation were 0.975, 276/289 (95.5%), and 1,978/1,987 (99.5%), respectively. Compared with clinical diagnoses, the Bland-Altman 95% limits of agreement for acetabular index, as determined by the deep learning system from the radiographs of non-dislocated and dislocated hips, were -3.27° - 2.94° and -7.36° - 5.36°, respectively (p < 0.001). Conclusion The deep learning system was highly consistent, more convenient, and more effective for diagnosing DDH compared with clinician-led diagnoses. Deep learning systems should be considered for analysis of anteroposterior pelvic radiographs when diagnosing DDH. The deep learning system will improve the current artificially complicated screening referral process. Cite this article: Bone Joint J 2020;102-B(11):1574–1581.

Download Full-text

Recalibration of deep learning models for abnormality detection in smartphone-captured chest radiograph

npj Digital Medicine ◽

10.1038/s41746-021-00393-9 ◽

2021 ◽

Vol 4 (1) ◽

Author(s):

Po-Chih Kuo ◽

Cheng Che Tsai ◽

Diego M. López ◽

Alexandros Karargyris ◽

Tom J. Pollard ◽

...

Keyword(s):

Deep Learning ◽

Operating Characteristic ◽

Characteristic Curve ◽

Model Performance ◽

X Rays ◽

Abnormality Detection ◽

Learning Models ◽

Radiological Findings ◽

Operating Characteristic Curve ◽

Uncalibrated Model

AbstractImage-based teleconsultation using smartphones has become increasingly popular. In parallel, deep learning algorithms have been developed to detect radiological findings in chest X-rays (CXRs). However, the feasibility of using smartphones to automate this process has yet to be evaluated. This study developed a recalibration method to build deep learning models to detect radiological findings on CXR photographs. Two publicly available databases (MIMIC-CXR and CheXpert) were used to build the models, and four derivative datasets containing 6453 CXR photographs were collected to evaluate model performance. After recalibration, the model achieved areas under the receiver operating characteristic curve of 0.80 (95% confidence interval: 0.78–0.82), 0.88 (0.86–0.90), 0.81 (0.79–0.84), 0.79 (0.77–0.81), 0.84 (0.80–0.88), and 0.90 (0.88–0.92), respectively, for detecting cardiomegaly, edema, consolidation, atelectasis, pneumothorax, and pleural effusion. The recalibration strategy, respectively, recovered 84.9%, 83.5%, 53.2%, 57.8%, 69.9%, and 83.0% of performance losses of the uncalibrated model. We conclude that the recalibration method can transfer models from digital CXRs to CXR photographs, which is expected to help physicians’ clinical works.

Download Full-text

Analysis of shoulder MR imaging using Receiver Operating Characteristic curve

Journal of the Korean Radiological Society ◽

10.3348/jkrs.1998.38.4.723 ◽

1998 ◽

Vol 38 (4) ◽

pp. 723

Author(s):

Yoon Joon Hwang ◽

Jin Suck Suh ◽

Jae Hyun Cho

Keyword(s):

Mr Imaging ◽

Receiver Operating Characteristic Curve ◽

Receiver Operating Characteristic ◽

Operating Characteristic ◽

Characteristic Curve ◽

Operating Characteristic Curve ◽

Receiver Operating

Download Full-text

Serum miR-21 and miR-26a Levels Negatively Correlate with Severity of Cirrhosis in Patients with Chronic Hepatitis B

MicroRNA ◽

10.2174/2211536607666180821162850 ◽

2018 ◽

Vol 8 (1) ◽

pp. 86-92 ◽

Cited By ~ 2

Author(s):

Shili Jiang ◽

Wei Jiang ◽

Ying Xu ◽

Xiaoning Wang ◽

Yongping Mu ◽

...

Keyword(s):

Chronic Hepatitis ◽

Hepatitis B ◽

Chronic Hepatitis B ◽

Operating Characteristic ◽

Characteristic Curve ◽

Curve Analysis ◽

Pugh Class ◽

Operating Characteristic Curve ◽

Class C ◽

Circulating Levels

Background and Objective: Accurately evaluating the severity of liver cirrhosis is essential for clinical decision making and disease management. This study aimed to evaluate the value of circulating levels of microRNA (miR)-26a and miR-21 as novel noninvasive biomarkers in detecting severity of cirrhosis in patients with chronic hepatitis B. </P><P> Methods: Thirty patients with clinically diagnosed chronic hepatitis B-related cirrhosis and 30 healthy individuals were selected. The serum levels of miR-26a and miR-21 were quantified by qRT-PCR. Receiver operating characteristic curve analysis was performed to evaluate the sensitivity and specificity of the miRNAs for detecting the severity of cirrhosis. Results: Serum miR-26a and miR-21 levels were found to be significantly downregulated in patients with severe cirrhosis scored at Child-Pugh class C in comparison to healthy controls (miR-26a p<0.01, and miR-21 p<0.001, respectively). The circulating miR-26a and miR-21 levels in patients were positively correlated with serum albumin concentration but negatively correlated with serum total bilirubin concentration and prothrombin time. Receiver operating characteristic curve analysis revealed that both serum miR-26a and miR-21 levels were associated with a high diagnostic accuracy for patients with cirrhosis scored at Child-Pugh class C (miR-26a Cut-off fold change at ≤0.4, Sensitivity: 84.62%, Specificity: 89.36%, P<0.0001; miR-21 Cut-off fold change at ≤0.6, Sensitivity: 84.62%, Specificity: 78.72%, P<0.0001). Our results indicate that the circulating levels of miR-26a and miR-21 are closely related to the extent of liver decompensation, and the decreased levels are capable of discriminating patients with cirrhosis at Child-Pugh class C from the whole cirrhosis cases.

Download Full-text

A validated novel preoperative index to predict the extent of intraperitoneal contamination in patients with acute abdominal pathology: A cohort study

Journal of Perioperative Practice ◽

10.1177/1750458919875592 ◽

2019 ◽

Vol 30 (7-8) ◽

pp. 221-228

Author(s):

Shahab Hajibandeh ◽

Shahin Hajibandeh ◽

Nicholas Hobbs ◽

Jigar Shah ◽

Matthew Harris ◽

...

Keyword(s):

Receiver Operating Characteristic Curve ◽

Receiver Operating Characteristic ◽

Operating Characteristic ◽

Validation Cohort ◽

Characteristic Curve ◽

Curve Analysis ◽

Emergency Laparotomy ◽

Contamination Index ◽

Operating Characteristic Curve ◽

Receiver Operating

Aims To investigate whether an intraperitoneal contamination index (ICI) derived from combined preoperative levels of C-reactive protein, lactate, neutrophils, lymphocytes and albumin could predict the extent of intraperitoneal contamination in patients with acute abdominal pathology. Methods Patients aged over 18 who underwent emergency laparotomy for acute abdominal pathology between January 2014 and October 2018 were randomly divided into primary and validation cohorts. The proposed intraperitoneal contamination index was calculated for each patient in each cohort. Receiver operating characteristic curve analysis was performed to determine discrimination of the index and cut-off values of preoperative intraperitoneal contamination index that could predict the extent of intraperitoneal contamination. Results Overall, 468 patients were included in this study; 234 in the primary cohort and 234 in the validation cohort. The analyses identified intraperitoneal contamination index of 24.77 and 24.32 as cut-off values for purulent contamination in the primary cohort (area under the curve (AUC): 0.73, P < 0.0001; sensitivity: 84%, specificity: 60%) and validation cohort (AUC: 0.83, P < 0.0001; sensitivity: 91%, specificity: 69%), respectively. Receiver operating characteristic curve analysis also identified intraperitoneal contamination index of 33.70 and 33.41 as cut-off values for feculent contamination in the primary cohort (AUC: 0.78, P < 0.0001; sensitivity: 87%, specificity: 64%) and validation cohort (AUC: 0.79, P < 0.0001; sensitivity: 86%, specificity: 73%), respectively. Conclusions As a predictive measure which is derived purely from biomarkers, intraperitoneal contamination index may be accurate enough to predict the extent of intraperitoneal contamination in patients with acute abdominal pathology and to facilitate decision-making together with clinical and radiological findings.

Download Full-text

Development and Verification of a Deep Learning Algorithm to Evaluate Small-Bowel Preparation Quality

Diagnostics ◽

10.3390/diagnostics11061127 ◽

2021 ◽

Vol 11 (6) ◽

pp. 1127

Author(s):

Ji Hyung Nam ◽

Dong Jun Oh ◽

Sumin Lee ◽

Hyun Joo Song ◽

Yun Jeong Lim

Keyword(s):

Deep Learning ◽

Small Bowel ◽

Scoring System ◽

Operating Characteristic ◽

Clinical Evidence ◽

Learning Algorithm ◽

Characteristic Curve ◽

External Validation ◽

Test Results ◽

Deep Learning Algorithm

Capsule endoscopy (CE) quality control requires an objective scoring system to evaluate the preparation of the small bowel (SB). We propose a deep learning algorithm to calculate SB cleansing scores and verify the algorithm’s performance. A 5-point scoring system based on clarity of mucosal visualization was used to develop the deep learning algorithm (400,000 frames; 280,000 for training and 120,000 for testing). External validation was performed using additional CE cases (n = 50), and average cleansing scores (1.0 to 5.0) calculated using the algorithm were compared to clinical grades (A to C) assigned by clinicians. Test results obtained using 120,000 frames exhibited 93% accuracy. The separate CE case exhibited substantial agreement between the deep learning algorithm scores and clinicians’ assessments (Cohen’s kappa: 0.672). In the external validation, the cleansing score decreased with worsening clinical grade (scores of 3.9, 3.2, and 2.5 for grades A, B, and C, respectively, p < 0.001). Receiver operating characteristic curve analysis revealed that a cleansing score cut-off of 2.95 indicated clinically adequate preparation. This algorithm provides an objective and automated cleansing score for evaluating SB preparation for CE. The results of this study will serve as clinical evidence supporting the practical use of deep learning algorithms for evaluating SB preparation quality.

Download Full-text

Evaluation of factors that predict the success rate of trial of labor after the cesarean section

BMC Pregnancy and Childbirth ◽

10.1186/s12884-021-04004-z ◽

2021 ◽

Vol 21 (1) ◽

Author(s):

Yang Mi ◽

Pengfei Qu ◽

Na Guo ◽

Ruimiao Bai ◽

Jiayi Gao ◽

...

Keyword(s):

Logistic Regression ◽

Cesarean Section ◽

Receiver Operating Characteristic Curve ◽

Success Rate ◽

Operating Characteristic ◽

Characteristic Curve ◽

Predictive Ability ◽

Training Set ◽

History Of ◽

Operating Characteristic Curve

Abstract Background For most women who have had a previous cesarean section, vaginal birth after cesarean section (VBAC) is a reasonable and safe choice, but which will increase the risk of adverse outcomes such as uterine rupture. In order to reduce the risk, we evaluated the factors that may affect VBAC and and established a model for predicting the success rate of trial of the labor after cesarean section (TOLAC). Methods All patients who gave birth at Northwest Women’s and Children’s Hospital from January 2016 to December 2018, had a history of cesarean section and voluntarily chose the TOLAC were recruited. Among them, 80% of the population was randomly assigned to the training set, while the remaining 20% were assigned to the external validation set. In the training set, univariate and multivariate logistic regression models were used to identify indicators related to successful TOLAC. A nomogram was constructed based on the results of multiple logistic regression analysis, and the selected variables included in the nomogram were used to predict the probability of successfully obtaining TOLAC. The area under the receiver operating characteristic curve was used to judge the predictive ability of the model. Results A total of 778 pregnant women were included in this study. Among them, 595 (76.48%) successfully underwent TOLAC, whereas 183 (23.52%) failed and switched to cesarean section. In multi-factor logistic regression, parity = 1, pre-pregnancy BMI < 24 kg/m2, cervical score ≥ 5, a history of previous vaginal delivery and neonatal birthweight < 3300 g were associated with the success of TOLAC. The area under the receiver operating characteristic curve in the prediction and validation models was 0.815 (95% CI: 0.762–0.854) and 0.730 (95% CI: 0.652–0.808), respectively, indicating that the nomogram prediction model had medium discriminative power. Conclusion The TOLAC was useful to reducing the cesarean section rate. Being primiparous, not overweight or obese, having a cervical score ≥ 5, a history of previous vaginal delivery or neonatal birthweight < 3300 g were protective indicators. In this study, the validated model had an approving predictive ability.

Download Full-text

The gut hormone GLP-2 predicts cardiovascular risk in patients with acute myocardial infarction

European Heart Journal ◽

10.1093/ehjci/ehaa946.1592 ◽

2020 ◽

Vol 41 (Supplement_2) ◽

Author(s):

F Kahles ◽

R.W Mertens ◽

M.V Rueckbeil ◽

M.C Arrivas ◽

J Moellmann ◽

...

Keyword(s):

Myocardial Infarction ◽

Cardiovascular Disease ◽

Acute Myocardial Infarction ◽

Receiver Operating Characteristic Curve ◽

Operating Characteristic ◽

Characteristic Curve ◽

Funding Source ◽

Cardiovascular Prognosis ◽

Operating Characteristic Curve ◽

Hs Crp

Abstract Background GLP-1 and GLP-2 (glucagon-like peptide-1/2) are gut derived hormones that are co-secreted from intestinal L-cells in response to food intake. While GLP-1 is known to induce postprandial insulin secretion, GLP-2 enhances intestinal nutrient absorption and is clinically used for the treatment of patients with short bowel syndrome. The relevance of the GLP-2 system for cardiovascular disease is unknown. Purpose The aim of this study was to assess the predictive capacity of GLP-2 for cardiovascular prognosis in patients with myocardial infarction. Methods Total GLP-2 levels, NT-proBNP concentrations and the Global Registry of Acute Coronary Events (GRACE) score were assessed at time of admission in 918 patients with myocardial infarction, among them 597 patients with NSTEMI and 321 with STEMI. The primary composite outcome of the study was the first occurrence of cardiovascular death, nonfatal myocardial infarction, or nonfatal stroke (3-P-MACE) with a median follow-up of 311 days. Results Kaplan-Meier survival plots (separated by the median of GLP-2 with a cut-off value of 4.4 ng/mL) and univariable cox regression analyses found GLP-2 values to be associated with adverse outcome (logarithmized GLP-2 values HR: 2.87; 95% CI: 1.75–4.68; p<0.0001). Further adjustment for age, sex, smoking, hypertension, hypercholesterolemia, diabetes mellitus, family history of cardiovascular disease, hs-Troponin T, NT-proBNP and hs-CRP levels did not affect the association of GLP-2 with poor prognosis (logarithmized GLP-2 values HR: 2.96; 95% CI: 1.38–6.34; p=0.0053). Receiver operating characteristic curve (ROC) analyses illustrated that GLP-2 is a strong indicator for cardiovascular events and proved to be comparable to other established risk markers (area under the curve of the combined endpoint at 6 months; GLP-2: 0.72; hs-Troponin: 0.56; NT-proBNP: 0.70; hs-CRP: 0.62). Adjustment of the GRACE risk estimate by GLP-2 increased the area under the receiver-operating characteristic curve for the combined triple endpoint after 6 months from 0.70 (GRACE) to 0.75 (GRACE + GLP-2) in NSTEMI patients. Addition of GLP-2 to a model containing GRACE and NT-proBNP led to a further improvement in model performance (increase in AUC from 0.72 for GRACE + NT-proBNP to 0.77 for GRACE + NT-proBNP + GLP-2). Conclusions In patients admitted with acute myocardial infarction, GLP-2 levels are associated with adverse cardiovascular prognosis. This demonstrates a strong yet not appreciated crosstalk between the heart and the gut with relevance for cardiovascular outcome. Future studies are needed to further explore this crosstalk with the possibility of new treatment avenues for cardiovascular disease. Funding Acknowledgement Type of funding source: Public grant(s) – National budget only. Main funding source(s): German Society of Cardiology (DGK), German Research Foundation (DFG)

Download Full-text