Predicting Mechanical Ventilation and Mortality in COVID-19 Using Radiomics and Deep Learning on Chest Radiographs: A Multi-Institutional Study

Joseph Bae; Saarthak Kapse; Gagandeep Singh; Rishabh Gattu; Syed Ali; Neal Shah; Colin Marshall; Jonathan Pierce; Tej Phatak; Amit Gupta; Jeremy Green; Nikhil Madan; Prateek Prasanna

doi:10.3390/diagnostics11101812

Predicting Mechanical Ventilation and Mortality in COVID-19 Using Radiomics and Deep Learning on Chest Radiographs: A Multi-Institutional Study

Diagnostics ◽

10.3390/diagnostics11101812 ◽

2021 ◽

Vol 11 (10) ◽

pp. 1812

Author(s):

Joseph Bae ◽

Saarthak Kapse ◽

Gagandeep Singh ◽

Rishabh Gattu ◽

Syed Ali ◽

...

Keyword(s):

Mechanical Ventilation ◽

Deep Learning ◽

Discriminant Analysis ◽

Medical Center ◽

Characteristic Curve ◽

Chest Radiographs ◽

University Hospital ◽

Prognostic Information ◽

Linear Discriminant ◽

Severity Scores

In this study, we aimed to predict mechanical ventilation requirement and mortality using computational modeling of chest radiographs (CXRs) for coronavirus disease 2019 (COVID-19) patients. This two-center, retrospective study analyzed 530 deidentified CXRs from 515 COVID-19 patients treated at Stony Brook University Hospital and Newark Beth Israel Medical Center between March and August 2020. Linear discriminant analysis (LDA), quadratic discriminant analysis (QDA), and random forest (RF) machine learning classifiers to predict mechanical ventilation requirement and mortality were trained and evaluated using radiomic features extracted from patients’ CXRs. Deep learning (DL) approaches were also explored for the clinical outcome prediction task and a novel radiomic embedding framework was introduced. All results are compared against radiologist grading of CXRs (zone-wise expert severity scores). Radiomic classification models had mean area under the receiver operating characteristic curve (mAUCs) of 0.78 ± 0.05 (sensitivity = 0.72 ± 0.07, specificity = 0.72 ± 0.06) and 0.78 ± 0.06 (sensitivity = 0.70 ± 0.09, specificity = 0.73 ± 0.09), compared with expert scores mAUCs of 0.75 ± 0.02 (sensitivity = 0.67 ± 0.08, specificity = 0.69 ± 0.07) and 0.79 ± 0.05 (sensitivity = 0.69 ± 0.08, specificity = 0.76 ± 0.08) for mechanical ventilation requirement and mortality prediction, respectively. Classifiers using both expert severity scores and radiomic features for mechanical ventilation (mAUC = 0.79 ± 0.04, sensitivity = 0.71 ± 0.06, specificity = 0.71 ± 0.08) and mortality (mAUC = 0.83 ± 0.04, sensitivity = 0.79 ± 0.07, specificity = 0.74 ± 0.09) demonstrated improvement over either artificial intelligence or radiologist interpretation alone. Our results also suggest instances in which the inclusion of radiomic features in DL improves model predictions over DL alone. The models proposed in this study and the prognostic information they provide might aid physician decision making and efficient resource allocation during the COVID-19 pandemic.

Download Full-text

Automated Radiology Alert System for Pneumothorax Detection on Chest Radiographs Improves Efficiency and Diagnostic Performance

Diagnostics ◽

10.3390/diagnostics11071182 ◽

2021 ◽

Vol 11 (7) ◽

pp. 1182

Author(s):

Cheng-Yi Kao ◽

Chiao-Yun Lin ◽

Cheng-Chen Chao ◽

Han-Sheng Huang ◽

Hsing-Yu Lee ◽

...

Keyword(s):

Deep Learning ◽

Diagnostic Performance ◽

Medical Center ◽

Characteristic Curve ◽

Learning Model ◽

Chest Radiographs ◽

Alert System ◽

Before And After ◽

Set Up ◽

Deep Learning Model

We aimed to set up an Automated Radiology Alert System (ARAS) for the detection of pneumothorax in chest radiographs by a deep learning model, and to compare its efficiency and diagnostic performance with the existing Manual Radiology Alert System (MRAS) at the tertiary medical center. This study retrospectively collected 1235 chest radiographs with pneumothorax labeling from 2013 to 2019, and 337 chest radiographs with negative findings in 2019 were separated into training and validation datasets for the deep learning model of ARAS. The efficiency before and after using the model was compared in terms of alert time and report time. During parallel running of the two systems from September to October 2020, chest radiographs prospectively acquired in the emergency department with age more than 6 years served as the testing dataset for comparison of diagnostic performance. The efficiency was improved after using the model, with mean alert time improving from 8.45 min to 0.69 min and the mean report time from 2.81 days to 1.59 days. The comparison of the diagnostic performance of both systems using 3739 chest radiographs acquired during parallel running showed that the ARAS was better than the MRAS as assessed in terms of sensitivity (recall), area under receiver operating characteristic curve, and F1 score (0.837 vs. 0.256, 0.914 vs. 0.628, and 0.754 vs. 0.407, respectively), but worse in terms of positive predictive value (PPV) (precision) (0.686 vs. 1.000). This study had successfully designed a deep learning model for pneumothorax detection on chest radiographs and set up an ARAS with improved efficiency and overall diagnostic performance.

Download Full-text

Challenge for Diagnostic Assessment of Deep Learning Algorithm for Metastases Classification in Sentinel Lymph Nodes on Frozen Tissue Section Digital Slides in Women with Breast Cancer

10.21203/rs.2.23087/v1 ◽

2020 ◽

Author(s):

Young-Gon Kim ◽

In Hye Song ◽

Hyunna Lee ◽

Dong Hyun Yang ◽

Namkug Kim ◽

...

Keyword(s):

Breast Cancer ◽

Deep Learning ◽

Lymph Nodes ◽

Learning Algorithm ◽

Medical Center ◽

Characteristic Curve ◽

Lobular Carcinoma ◽

Sentinel Lymph Nodes ◽

Neoadjuvant Systemic Therapy ◽

Frozen Tissue

Abstract Assessing the status of metastasis in sentinel lymph nodes (SLNs) by pathologists is an essential task for the accurate staging of breast cancer. However, histopathological evaluation of sentinel lymph nodes by a pathologist is not easy and is a tedious and time-consuming task. The purpose of this study is to review a challenge competition (HeLP 2018) to develop automated solutions for the classification of metastases in hematoxylin and eosin–stained frozen tissue sections of SLNs in breast cancer patients. A total of 297 digital slides were obtained from frozen SLN sections, which include post–neoadjuvant cases (n = 144, 48.5%) in Asan Medical Center, South Korea. The slides were divided into training, development, and validation sets. All of the imaging datasets have been manually segmented by expert pathologists. A total of 10 participants were allowed to use the Kakao challenge platform for six weeks with two P40 GPUs. The algorithms were assessed in terms of the AUC (area under receiver operating characteristic curve). The top three teams showed 0.986, 0.985, and 0.945 AUCs for the development set and 0.805, 0.776, and 0.765 AUCs for the validation set. Micrometastatic tumors, neoadjuvant systemic therapy, invasive lobular carcinoma, and histologic grade 3 were associated with lower diagnostic accuracy. In a challenge competition, accurate deep learning algorithms have been developed, which can be helpful in making frozen diagnosis of intraoperative sentinel lymph node biopsy. Whether this approach has clinical utility will require evaluation in a clinical setting.

Download Full-text

Using Linear Discriminant Analysis to Classify Snowfall Situations into Avalanching and Non-Avalanching Ones

Journal of Glaciology ◽

10.1017/s0022143000029683 ◽

1977 ◽

Vol 19 (81) ◽

pp. 679-680

Author(s):

N.F. Drozdovskaya

Keyword(s):

Discriminant Analysis ◽

Statistical Methods ◽

Training Sample ◽

Dispersion Analysis ◽

Snow Accumulation ◽

Discriminant Functions ◽

Prognostic Information ◽

Individual Parameter ◽

Linear Discriminant ◽

Avalanche Forecasting

Abstract The existing methods of predicting avalanche danger often do not meet users’ demands because of the empiric character of the insufficient volume of information used. In such forecasts the contribution of each individual parameter into the prognostic information is unknown, and this is very important when studying such an event as avalanche formation, which is conditioned by a complex interaction of numerous factors, including snow accumulation, the state of snow thickness, and the conditions of its development. It is obvious that such problems can be successfully solved by statistical methods, and that explains the growing interest in numerical methods of avalanche forecasting. Problems of multi-dimensional observations arises in many scientific fields. The method suited for this problem is discriminant analysis, the purpose of which is to divide a multi-dimensional observation vector into predetermined classes. This study considers the prognostic (diagnostic) problems of fresh-snow avalanches released during snowfall or in the two days after it has ceased. The theoretical basis is a complex of statistical methods: correlation and dispersion analysis, “sifting" for the choice of predictors’ informative groups, construction of linear parametric discriminant functions, predictions based on training sample, and verification of discriminant functions based on independent material. The archive used in the study consisted of 500 avalanching cases and 1 300 non-avalanching ones. All situations were grouped according to geomorphological characteristics. Each situation is described by eight meteorological characteristics. The results of classification of snowfall situations into avalanching and non-avalanching ones are as follows: reliability of ρ is from 75% to 91%, H from 0.15 to 0.51; based on independent material the reliability of ρ is from 63% to 85%, H from 0.10 to 0.56. This paper has been accepted in revised form for publication in a later issue of the Journal of Glaciology.

Download Full-text

Modified Bidirectional Encoder Representations From Transformers Extractive Summarization Model for Hospital Information Systems Based on Character-Level Tokens (AlphaBERT): Development and Performance Evaluation

JMIR Medical Informatics ◽

10.2196/17787 ◽

2020 ◽

Vol 8 (4) ◽

pp. e17787 ◽

Cited By ~ 1

Author(s):

Yen-Pin Chen ◽

Yi-Ying Chen ◽

Jr-Jiun Lin ◽

Chien-Hua Huang ◽

Feipei Lai

Keyword(s):

Deep Learning ◽

Information Systems ◽

Characteristic Curve ◽

Hospital Information Systems ◽

University Hospital ◽

Free Text ◽

Sufficient Information ◽

Extractive Summarization ◽

Significant Difference ◽

Model Size

Background Doctors must care for many patients simultaneously, and it is time-consuming to find and examine all patients’ medical histories. Discharge diagnoses provide hospital staff with sufficient information to enable handling multiple patients; however, the excessive amount of words in the diagnostic sentences poses problems. Deep learning may be an effective solution to overcome this problem, but the use of such a heavy model may also add another obstacle to systems with limited computing resources. Objective We aimed to build a diagnoses-extractive summarization model for hospital information systems and provide a service that can be operated even with limited computing resources. Methods We used a Bidirectional Encoder Representations from Transformers (BERT)-based structure with a two-stage training method based on 258,050 discharge diagnoses obtained from the National Taiwan University Hospital Integrated Medical Database, and the highlighted extractive summaries written by experienced doctors were labeled. The model size was reduced using a character-level token, the number of parameters was decreased from 108,523,714 to 963,496, and the model was pretrained using random mask characters in the discharge diagnoses and International Statistical Classification of Diseases and Related Health Problems sets. We then fine-tuned the model using summary labels and cleaned up the prediction results by averaging all probabilities for entire words to prevent character level–induced fragment words. Model performance was evaluated against existing models BERT, BioBERT, and Long Short-Term Memory (LSTM) using the Recall-Oriented Understudy for Gisting Evaluation (ROUGE) L score, and a questionnaire website was built to collect feedback from more doctors for each summary proposal. Results The area under the receiver operating characteristic curve values of the summary proposals were 0.928, 0.941, 0.899, and 0.947 for BERT, BioBERT, LSTM, and the proposed model (AlphaBERT), respectively. The ROUGE-L scores were 0.697, 0.711, 0.648, and 0.693 for BERT, BioBERT, LSTM, and AlphaBERT, respectively. The mean (SD) critique scores from doctors were 2.232 (0.832), 2.134 (0.877), 2.207 (0.844), 1.927 (0.910), and 2.126 (0.874) for reference-by-doctor labels, BERT, BioBERT, LSTM, and AlphaBERT, respectively. Based on the paired t test, there was a statistically significant difference in LSTM compared to the reference (P<.001), BERT (P=.001), BioBERT (P<.001), and AlphaBERT (P=.002), but not in the other models. Conclusions Use of character-level tokens in a BERT model can greatly decrease the model size without significantly reducing performance for diagnoses summarization. A well-developed deep-learning model will enhance doctors’ abilities to manage patients and promote medical studies by providing the capability to use extensive unstructured free-text notes.

Download Full-text

Deep Learning Based Analysis of Breast Cancer Using Advanced Ensemble Classifier and Linear Discriminant Analysis

IEEE Access ◽

10.1109/access.2020.3005228 ◽

2020 ◽

Vol 8 ◽

pp. 120208-120217 ◽

Cited By ~ 1

Author(s):

Xinfeng Zhang ◽

Dianning He ◽

Yue Zheng ◽

Huaibi Huo ◽

Simiao Li ◽

...

Keyword(s):

Breast Cancer ◽

Deep Learning ◽

Discriminant Analysis ◽

Linear Discriminant Analysis ◽

Ensemble Classifier ◽

Linear Discriminant

Download Full-text

Deep Learning-Based Method to Differentiate Neuromyelitis Optica Spectrum Disorder From Multiple Sclerosis

Frontiers in Neurology ◽

10.3389/fneur.2020.599042 ◽

2020 ◽

Vol 11 ◽

Author(s):

Hyunjin Kim ◽

Youngin Lee ◽

Yong-Hwan Kim ◽

Young-Min Lim ◽

Ji Sung Lee ◽

...

Keyword(s):

Multiple Sclerosis ◽

Deep Learning ◽

Clinical Practice ◽

Neuromyelitis Optica ◽

Inflammatory Diseases ◽

Medical Center ◽

Characteristic Curve ◽

Brain Magnetic Resonance Imaging ◽

Spectrum Disorder ◽

Neuromyelitis Optica Spectrum Disorder

Background: Differentiating neuromyelitis optica spectrum disorder (NMOSD) from multiple sclerosis (MS) is crucial in the field of diagnostics because, despite their similarities, the treatments for these two diseases are substantially different, and disease-modifying treatments for MS can worsen NMOSD. As brain magnetic resonance imaging (MRI) is an important tool to distinguish the two diseases, extensive research has been conducted to identify the defining characteristics of MRI images corresponding to these two diseases. However, the application of such research in clinical practice is still limited. In this study, we investigate the applicability of a deep learning-based algorithm for differentiating NMOSD from MS.Methods: In this study, we included 338 participants (213 patients with MS, 125 patients with NMOSD) who visited the Asan medical center between February 2009 and February 2020. A 3D convolutional neural network, which is a deep learning-based algorithm, was trained using fluid-attenuated inversion recovery images and clinical information of the participants. The performance of the final model in differentiating NMOSD from MS was evaluated and compared with that of two neurologists.Results: The deep learning-based model exhibited an area under the receiver operating characteristic curve of 0.82 (95% CI, 0.75–0.89). It differentiated NMOSD from MS with an accuracy of 71.1% (sensitivity = 87.8%, specificity = 61.6%), which is comparable to that exhibited by the neurologists. The intra-rater reliability of the two neurologists was moderate (κ = 0.47, 0.50), which was in contrast with the consistent classification of the deep learning-based model.Conclusion: The proposed model was verified to be capable of differentiating NMOSD from MS with accuracy comparable to that of neurologists, exhibiting the advantage of consistent classification. As a result, it can aid differential diagnosis between two important central nervous system inflammatory diseases in clinical practice.

Download Full-text

CheXLocNet: Automatic localization of pneumothorax in chest radiographs using deep convolutional neural networks

PLoS ONE ◽

10.1371/journal.pone.0242013 ◽

2020 ◽

Vol 15 (11) ◽

pp. e0242013

Author(s):

Hongyu Wang ◽

Hong Gu ◽

Pan Qin ◽

Jia Wang

Keyword(s):

Neural Network ◽

Neural Networks ◽

Deep Learning ◽

Convolutional Neural Networks ◽

Characteristic Curve ◽

Healthcare Delivery ◽

Chest Radiographs ◽

Training Dataset ◽

Deep Convolutional Neural Networks ◽

Support Tool

Background Pneumothorax can lead to a life-threatening emergency. The experienced radiologists can offer precise diagnosis according to the chest radiographs. The localization of the pneumothorax lesions will help to quickly diagnose, which will be benefit for the patients in the underdevelopment areas lack of the experienced radiologists. In recent years, with the development of large neural network architectures and medical imaging datasets, deep learning methods have become a methodology of choice for analyzing medical images. The objective of this study was to the construct convolutional neural networks to localize the pneumothorax lesions in chest radiographs. Methods and findings We developed a convolutional neural network, called CheXLocNet, for the segmentation of pneumothorax lesions. The SIIM-ACR Pneumothorax Segmentation dataset was used to train and validate CheXLocNets. The training dataset contained 2079 radiographs with the annotated lesion areas. We trained six CheXLocNets with various hyperparameters. Another 300 annotated radiographs were used to select parameters of these CheXLocNets as the validation set. We determined the optimal parameters by the AP50 (average precision at the intersection over union (IoU) equal to 0.50), a segmentation evaluation metric used by several well-known competitions. Then CheXLocNets were evaluated by a test set (1082 normal radiographs and 290 disease radiographs), based on the classification metrics: area under the receiver operating characteristic curve (AUC), sensitivity, specificity, and positive predictive value (PPV); segmentation metrics: IoU and Dice score. For the classification, CheXLocNet with best sensitivity produced an AUC of 0.87, sensitivity of 0.78 (95% CI 0.73-0.83), and specificity of 0.78 (95% CI 0.76-0.81). CheXLocNet with best specificity produced an AUC of 0.79, sensitivity of 0.46 (95% CI 0.40-0.52), and specificity of 0.92 (95% CI 0.90-0.94). For the segmentation, CheXLocNet with best sensitivity produced an IoU of 0.69 and Dice score of 0.72. CheXLocNet with best specificity produced an IoU of 0.77 and Dice score of 0.79. We combined them to form an ensemble CheXLocNet. The ensemble CheXLocNet produced an IoU of 0.81 and Dice score of 0.82. Our CheXLocNet succeeded in automatically detecting pneumothorax lesions, without any human guidance. Conclusions In this study, we proposed a deep learning network, called, CheXLocNet, for the automatic segmentation of chest radiographs to detect pneumothorax. Our CheXLocNets generated accurate classification results and high-quality segmentation masks for the pneumothorax at the same time. This technology has the potential to improve healthcare delivery and increase access to chest radiograph expertise for the detection of diseases. Furthermore, the segmentation results can offer comprehensive geometric information of lesions, which can benefit monitoring the sequential development of lesions with high accuracy. Thus, CheXLocNets can be further extended to be a reliable clinical decision support tool. Although we used transfer learning in training CheXLocNet, the parameters of CheXLocNet was still large for the radiograph dataset. Further work is necessary to prune CheXLocNet suitable for the radiograph dataset.

Download Full-text

Development of cardiotoxicity model using ligand-centric and receptor-centric descriptors

Toxicology Research and Application ◽

10.1177/2397847320971259 ◽

2020 ◽

Vol 4 ◽

pp. 239784732097125

Author(s):

Chirag N Patel ◽

Sivakumar Prasanth Kumar ◽

Rakesh M Rawal ◽

Manishkumar B Thaker ◽

Himanshu A Pandya

Keyword(s):

Discriminant Analysis ◽

Linear Discriminant Analysis ◽

Statistical Significance ◽

Characteristic Curve ◽

Hybrid Approach ◽

Classification Model ◽

Test Set ◽

Linear Discriminant ◽

Specificity And Sensitivity ◽

Physico Chemical

Background: Bioinformatics and statistical analysis have been employed to develop a classification model to distinguish toxic and non-toxic molecules. Aims: The primary objective of this study is to enumerate the cut-off values of various physico-chemical (ligand-centric) and target interaction (receptor-centric) descriptors which forms the basis for classifying cardiotoxic and non-toxic molecules. We also sought correlation of molecular docking, absorption, distribution, metabolism, excretion, and toxicology (ADMET) parameters, Lipinski rules, physico-chemical parameters, etc. of human cardiotoxicity drugs. Methods: A training and test set of 91 compounds were applied to linear discriminant analysis (LDA) using 2D and 3D descriptors as discriminating variables representing various molecular modeling parameters to identify which function of descriptor type is responsible for cardiotoxicity. Internal validation was performed using the leave-one-out cross-validation methodology ensuing in good results, assuring the stability of the discriminant function (DF). Results: The values of the statistical parameters Fisher Discriminant Analysis (FDA) and Wilk’s λ for the DF showed reliable statistical significance, as long as the success rate in the prediction for both the training and the test set attained more than 93% accuracy, 87.50% sensitivity and 94.74% specificity. Conclusion: The predictive model was built using a hybrid approach using organ-specific targets for docking and ADMET properties for the FDA (Food and Drug Administration) approved and withdrawn drugs. Classifiers were developed by linear discriminant analysis and the cut-off was enumerated by receiver operating characteristic curve (ROC) analysis to achieve reliable specificity and sensitivity.

Download Full-text

Classification of Watermelon Seeds Using Morphological Patterns of X-ray Imaging: A Comparison of Conventional Machine Learning and Deep Learning

Sensors ◽

10.3390/s20236753 ◽

2020 ◽

Vol 20 (23) ◽

pp. 6753

Author(s):

Mohammed Raju Ahmed ◽

Jannat Yasmin ◽

Eunsung Park ◽

Geonwoo Kim ◽

Moon S. Kim ◽

...

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Discriminant Analysis ◽

Transfer Learning ◽

Image Features ◽

Linear Discriminant ◽

X Ray ◽

X Ray Imaging ◽

Conventional Machine ◽

Viable Seeds

In this study, conventional machine learning and deep leaning approaches were evaluated using X-ray imaging techniques for investigating the internal parameters (endosperm and air space) of three cultivars of watermelon seed. In the conventional machine learning, six types of image features were extracted after applying different types of image preprocessing, such as image intensity and contrast enhancement, and noise reduction. The sequential forward selection (SFS) method and Fisher objective function were used as the search strategy and feature optimization. Three classifiers were tested (linear discriminant analysis (LDA), quadratic discriminant analysis (QDA), and k-nearest neighbors algorithm (KNN)) to find the best performer. On the other hand, in the transfer learning (deep learning) approaches, simple ConvNet, AlexNet, VGG-19, ResNet-50, and ResNet-101 were used to train the dataset and class prediction of the seed. For the supervised model development (both conventional machine learning and deep learning), the germination test results of the samples were used where the seeds were divided into two classes: (1) normal viable seeds and (2) nonviable and abnormal viable seeds. In the conventional classification, 83.6% accuracy was obtained by LDA using 48 features. ResNet-50 performed better than other transfer learning architectures, with an 87.3% accuracy which was the highest accuracy in all classification models. The findings of this study manifested that transfer learning is a constructive strategy for classifying seeds by analyzing their morphology, where X-ray imaging can be adopted as a potential imaging technique.

Download Full-text

Edge Accelerator for Lifelong Deep Learning using Streaming Linear Discriminant Analysis

2021 IEEE 29th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM) ◽

10.1109/fccm51124.2021.00046 ◽

2021 ◽

Author(s):

Duvindu Piyasena ◽

Siew-Kei Lam ◽

Meiqing Wu

Keyword(s):

Deep Learning ◽

Discriminant Analysis ◽

Linear Discriminant Analysis ◽

Linear Discriminant

Download Full-text