Quantitative Assessment of the Effects of Compression on Deep Learning in Digital Pathology Image Analysis

JCO Clinical Cancer Informatics ◽

10.1200/cci.19.00068 ◽

2020 ◽

pp. 221-233

Author(s):

Yijiang Chen ◽

Andrew Janowczyk ◽

Anant Madabhushi

Keyword(s):

Deep Learning ◽

Image Compression ◽

Operating Characteristic ◽

Performance Metrics ◽

Signal To Noise Ratio ◽

Characteristic Curve ◽

Digital Pathology ◽

Jpeg2000 Compression ◽

Classifier Performance ◽

Pathology Image

PURPOSE Deep learning (DL), a class of approaches involving self-learned discriminative features, is increasingly being applied to digital pathology (DP) images for tasks such as disease identification and segmentation of tissue primitives (eg, nuclei, glands, lymphocytes). One application of DP is in telepathology, which involves digitally transmitting DP slides over the Internet for secondary diagnosis by an expert at a remote location. Unfortunately, the places benefiting most from telepathology often have poor Internet quality, resulting in prohibitive transmission times of DP images. Image compression may help, but the degree to which image compression affects performance of DL algorithms has been largely unexplored. METHODS We investigated the effects of image compression on the performance of DL strategies in the context of 3 representative use cases involving segmentation of nuclei (n = 137), segmentation of lymph node metastasis (n = 380), and lymphocyte detection (n = 100). For each use case, test images at various levels of compression (JPEG compression quality score ranging from 1-100 and JPEG2000 compression peak signal-to-noise ratio ranging from 18-100 dB) were evaluated by a DL classifier. Performance metrics including F1 score and area under the receiver operating characteristic curve were computed at the various compression levels. RESULTS Our results suggest that DP images can be compressed by 85% while still maintaining the performance of the DL algorithms at 95% of what is achievable without any compression. Interestingly, the maximum compression level sustainable by DL algorithms is similar to where pathologists also reported difficulties in providing accurate interpretations. CONCLUSION Our findings seem to suggest that in low-resource settings, DP images can be significantly compressed before transmission for DL-based telepathology applications.

Download Full-text

Development and Verification of a Deep Learning Algorithm to Evaluate Small-Bowel Preparation Quality

Diagnostics ◽

10.3390/diagnostics11061127 ◽

2021 ◽

Vol 11 (6) ◽

pp. 1127

Author(s):

Ji Hyung Nam ◽

Dong Jun Oh ◽

Sumin Lee ◽

Hyun Joo Song ◽

Yun Jeong Lim

Keyword(s):

Deep Learning ◽

Small Bowel ◽

Scoring System ◽

Operating Characteristic ◽

Clinical Evidence ◽

Learning Algorithm ◽

Characteristic Curve ◽

External Validation ◽

Test Results ◽

Deep Learning Algorithm

Capsule endoscopy (CE) quality control requires an objective scoring system to evaluate the preparation of the small bowel (SB). We propose a deep learning algorithm to calculate SB cleansing scores and verify the algorithm’s performance. A 5-point scoring system based on clarity of mucosal visualization was used to develop the deep learning algorithm (400,000 frames; 280,000 for training and 120,000 for testing). External validation was performed using additional CE cases (n = 50), and average cleansing scores (1.0 to 5.0) calculated using the algorithm were compared to clinical grades (A to C) assigned by clinicians. Test results obtained using 120,000 frames exhibited 93% accuracy. The separate CE case exhibited substantial agreement between the deep learning algorithm scores and clinicians’ assessments (Cohen’s kappa: 0.672). In the external validation, the cleansing score decreased with worsening clinical grade (scores of 3.9, 3.2, and 2.5 for grades A, B, and C, respectively, p < 0.001). Receiver operating characteristic curve analysis revealed that a cleansing score cut-off of 2.95 indicated clinically adequate preparation. This algorithm provides an objective and automated cleansing score for evaluating SB preparation for CE. The results of this study will serve as clinical evidence supporting the practical use of deep learning algorithms for evaluating SB preparation quality.

Download Full-text

Severely imbalanced Big Data challenges: investigating data sampling approaches

Journal Of Big Data ◽

10.1186/s40537-019-0274-4 ◽

2019 ◽

Vol 6 (1) ◽

Cited By ~ 3

Author(s):

Tawfiq Hasanin ◽

Taghi M. Khoshgoftaar ◽

Joffrey L. Leevy ◽

Richard A. Bauder

Keyword(s):

Big Data ◽

Operating Characteristic ◽

Performance Metrics ◽

Characteristic Curve ◽

Geometric Mean ◽

Class Imbalance ◽

Random Undersampling ◽

First Case ◽

Negative Class

AbstractSevere class imbalance between majority and minority classes in Big Data can bias the predictive performance of Machine Learning algorithms toward the majority (negative) class. Where the minority (positive) class holds greater value than the majority (negative) class and the occurrence of false negatives incurs a greater penalty than false positives, the bias may lead to adverse consequences. Our paper incorporates two case studies, each utilizing three learners, six sampling approaches, two performance metrics, and five sampled distribution ratios, to uniquely investigate the effect of severe class imbalance on Big Data analytics. The learners (Gradient-Boosted Trees, Logistic Regression, Random Forest) were implemented within the Apache Spark framework. The first case study is based on a Medicare fraud detection dataset. The second case study, unlike the first, includes training data from one source (SlowlorisBig Dataset) and test data from a separate source (POST dataset). Results from the Medicare case study are not conclusive regarding the best sampling approach using Area Under the Receiver Operating Characteristic Curve and Geometric Mean performance metrics. However, it should be noted that the Random Undersampling approach performs adequately in the first case study. For the SlowlorisBig case study, Random Undersampling convincingly outperforms the other five sampling approaches (Random Oversampling, Synthetic Minority Over-sampling TEchnique, SMOTE-borderline1 , SMOTE-borderline2 , ADAptive SYNthetic) when measuring performance with Area Under the Receiver Operating Characteristic Curve and Geometric Mean metrics. Based on its classification performance in both case studies, Random Undersampling is the best choice as it results in models with a significantly smaller number of samples, thus reducing computational burden and training time.

Download Full-text

Automated Breast Cancer Detection in Digital Mammograms of Various Densities via Deep Learning

Journal of Personalized Medicine ◽

10.3390/jpm10040211 ◽

2020 ◽

Vol 10 (4) ◽

pp. 211 ◽

Cited By ~ 1

Author(s):

Yong Joon Suh ◽

Jaewon Jung ◽

Bum-Joo Cho

Keyword(s):

Breast Cancer ◽

Deep Learning ◽

Operating Characteristic ◽

Meta Analysis ◽

Characteristic Curve ◽

Malignant Lesion ◽

Model Performance ◽

Mean Values ◽

The Mean ◽

Deep Learning Model

Mammography plays an important role in screening breast cancer among females, and artificial intelligence has enabled the automated detection of diseases on medical images. This study aimed to develop a deep learning model detecting breast cancer in digital mammograms of various densities and to evaluate the model performance compared to previous studies. From 1501 subjects who underwent digital mammography between February 2007 and May 2015, craniocaudal and mediolateral view mammograms were included and concatenated for each breast, ultimately producing 3002 merged images. Two convolutional neural networks were trained to detect any malignant lesion on the merged images. The performances were tested using 301 merged images from 284 subjects and compared to a meta-analysis including 12 previous deep learning studies. The mean area under the receiver-operating characteristic curve (AUC) for detecting breast cancer in each merged mammogram was 0.952 ± 0.005 by DenseNet-169 and 0.954 ± 0.020 by EfficientNet-B5, respectively. The performance for malignancy detection decreased as breast density increased (density A, mean AUC = 0.984 vs. density D, mean AUC = 0.902 by DenseNet-169). When patients’ age was used as a covariate for malignancy detection, the performance showed little change (mean AUC, 0.953 ± 0.005). The mean sensitivity and specificity of the DenseNet-169 (87 and 88%, respectively) surpassed the mean values (81 and 82%, respectively) obtained in a meta-analysis. Deep learning would work efficiently in screening breast cancer in digital mammograms of various densities, which could be maximized in breasts with lower parenchyma density.

Download Full-text

Validating the validation: reanalyzing a large-scale comparison of deep learning and machine learning models for bioactivity prediction

Journal of Computer-Aided Molecular Design ◽

10.1007/s10822-019-00274-0 ◽

2020 ◽

Vol 34 (7) ◽

pp. 717-730 ◽

Cited By ~ 9

Author(s):

Matthew C. Robinson ◽

Robert C. Glen ◽

Alpha A. Lee

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Numerical Experiments ◽

Large Scale ◽

Operating Characteristic ◽

Characteristic Curve ◽

Learning Models ◽

Bioactivity Prediction ◽

Operating Characteristic Curve ◽

Machine Learning Models

Abstract Machine learning methods may have the potential to significantly accelerate drug discovery. However, the increasing rate of new methodological approaches being published in the literature raises the fundamental question of how models should be benchmarked and validated. We reanalyze the data generated by a recently published large-scale comparison of machine learning models for bioactivity prediction and arrive at a somewhat different conclusion. We show that the performance of support vector machines is competitive with that of deep learning methods. Additionally, using a series of numerical experiments, we question the relevance of area under the receiver operating characteristic curve as a metric in virtual screening. We further suggest that area under the precision–recall curve should be used in conjunction with the receiver operating characteristic curve. Our numerical experiments also highlight challenges in estimating the uncertainty in model performance via scaffold-split nested cross validation.

Download Full-text

DeepSleep: Fast and Accurate Delineation of Sleep Arousals at Millisecond Resolution by Deep Learning

10.1101/859256 ◽

2019 ◽

Cited By ~ 1

Author(s):

Hongyang Li ◽

Yuanfang Guan

Keyword(s):

Deep Learning ◽

Operating Characteristic ◽

Characteristic Curve ◽

Learning Approach ◽

Considerable Time ◽

Negative Effects ◽

High Quality ◽

Data Points ◽

Sleep Arousal ◽

Operating Characteristic Curve

AbstractSleep arousals are transient periods of wakefulness punctuated into sleep. Excessive sleep arousals are associated with many negative effects including daytime sleepiness and sleep disorders. High-quality annotation of polysomnographic recordings is crucial for the diagnosis of sleep arousal disorders. Currently, sleep arousals are mainly annotated by human experts through looking at millions of data points manually, which requires considerable time and effort. Here we present a deep learning approach, DeepSleep, which ranked first in the 2018 PhysioNet Challenge for automatically segmenting sleep arousal regions based on polysomnographic recordings. DeepSleep features accurate (area under receiver operating characteristic curve of 0.93), high-resolution (5-millisecond resolution), and fast (10 seconds per sleep record) delineation of sleep arousals.

Download Full-text

Same same but different: a web-based deep learning application for the histopathologic distinction of cortical malformations

10.1101/804682 ◽

2019 ◽

Cited By ~ 1

Author(s):

J. Kubach ◽

A. Muehlebner-Farngruber ◽

F. Soylemezoglu ◽

H. Miyata ◽

P. Niehusmann ◽

...

Keyword(s):

Deep Learning ◽

Web Application ◽

Operating Characteristic ◽

Focal Cortical Dysplasia ◽

Characteristic Curve ◽

Digital Processing ◽

Web Based ◽

Histopathologic Diagnosis ◽

Dysmorphic Neurons ◽

Whole Slide Images

AbstractWe trained a convolutional neural network (CNN) to classify H.E. stained microscopic images of focal cortical dysplasia type IIb (FCD IIb) and cortical tuber of tuberous sclerosis complex (TSC). Both entities are distinct subtypes of human malformations of cortical development that share histopathological features consisting of neuronal dyslamination with dysmorphic neurons and balloon cells. The microscopic review of routine stainings of such surgical specimens remains challenging. A digital processing pipeline was developed for a series of 56 FCD IIb and TSC cases to obtain 4000 regions of interest and 200.000 sub-samples with different zoom and rotation angles to train a CNN. Our best performing network achieved 91% accuracy and 0.88 AUCROC (area under the receiver operating characteristic curve) on a hold-out test-set. Guided gradient-weighted class activation maps visualized morphological features used by the CNN to distinguish both entities. We then developed a web application, which combined the visualization of whole slide images (WSI) with the possibility for classification between FCD IIb and TSC on demand by our pretrained and build-in CNN classifier. This approach might help to introduce deep learning applications for the histopathologic diagnosis of rare and difficult-to-classify brain lesions.

Download Full-text

The impact of pre- and post-image processing techniques on deep learning frameworks: A comprehensive review for digital pathology image analysis

Computers in Biology and Medicine ◽

10.1016/j.compbiomed.2020.104129 ◽

2021 ◽

Vol 128 ◽

pp. 104129

Author(s):

Massimo Salvi ◽

U. Rajendra Acharya ◽

Filippo Molinari ◽

Kristen M. Meiburger

Keyword(s):

Image Processing ◽

Image Analysis ◽

Deep Learning ◽

Digital Pathology ◽

Comprehensive Review ◽

Image Processing Techniques ◽

Processing Techniques ◽

The Impact ◽

Learning Frameworks ◽

Pathology Image

Download Full-text

An annotation-free whole-slide training approach to pathological classification of lung cancer types using deep learning

Nature Communications ◽

10.1038/s41467-021-21467-y ◽

2021 ◽

Vol 12 (1) ◽

Author(s):

Chi-Long Chen ◽

Chi-Chung Chen ◽

Wei-Hsiang Yu ◽

Szu-Hua Chen ◽

Yu-Chan Chang ◽

...

Keyword(s):

Lung Cancer ◽

Deep Learning ◽

Characteristic Curve ◽

Digital Pathology ◽

Classification Performance ◽

Multiple Instance Learning ◽

Data Set ◽

Memory Mechanism ◽

Strong Localization ◽

Training Approach

AbstractDeep learning for digital pathology is hindered by the extremely high spatial resolution of whole-slide images (WSIs). Most studies have employed patch-based methods, which often require detailed annotation of image patches. This typically involves laborious free-hand contouring on WSIs. To alleviate the burden of such contouring and obtain benefits from scaling up training with numerous WSIs, we develop a method for training neural networks on entire WSIs using only slide-level diagnoses. Our method leverages the unified memory mechanism to overcome the memory constraint of compute accelerators. Experiments conducted on a data set of 9662 lung cancer WSIs reveal that the proposed method achieves areas under the receiver operating characteristic curve of 0.9594 and 0.9414 for adenocarcinoma and squamous cell carcinoma classification on the testing set, respectively. Furthermore, the method demonstrates higher classification performance than multiple-instance learning as well as strong localization results for small lesions through class activation mapping.

Download Full-text

Deep learning for digital pathology image analysis: A comprehensive tutorial with selected use cases

Journal of Pathology Informatics ◽

10.4103/2153-3539.186902 ◽

2016 ◽

Vol 7 (1) ◽

pp. 29 ◽

Cited By ~ 375

Author(s):

Andrew Janowczyk ◽

Anant Madabhushi

Keyword(s):

Image Analysis ◽

Deep Learning ◽

Digital Pathology ◽

Use Cases ◽

Pathology Image

Download Full-text

Deep learning in digital pathology image analysis: a survey

Frontiers of Medicine ◽

10.1007/s11684-020-0782-9 ◽

2020 ◽

Vol 14 (4) ◽

pp. 470-487

Author(s):

Shujian Deng ◽

Xin Zhang ◽

Wen Yan ◽

Eric I-Chao Chang ◽

Yubo Fan ◽

...

Keyword(s):

Image Analysis ◽

Deep Learning ◽

Digital Pathology ◽

Pathology Image

Download Full-text