Automatic Lung Segmentation on Chest X-rays Using Self-Attention Deep Neural Network

Minki Kim; Byoung-Dai Lee

doi:10.3390/s21020369

Automatic Lung Segmentation on Chest X-rays Using Self-Attention Deep Neural Network

Sensors ◽

10.3390/s21020369 ◽

2021 ◽

Vol 21 (2) ◽

pp. 369

Author(s):

Minki Kim ◽

Byoung-Dai Lee

Keyword(s):

Surgical Planning ◽

Medical Image Segmentation ◽

X Rays ◽

The Novel ◽

Feature Maps ◽

Accurate Identification ◽

Feature Map ◽

Diagnosis And Prognosis ◽

Input Feature ◽

Public Datasets

Accurate identification of the boundaries of organs or abnormal objects (e.g., tumors) in medical images is important in surgical planning and in the diagnosis and prognosis of diseases. In this study, we propose a deep learning-based method to segment lung areas in chest X-rays. The novel aspect of the proposed method is the self-attention module, where the outputs of the channel and spatial attention modules are combined to generate attention maps, with each highlighting those regions of feature maps that correspond to “what” and “where” to attend in the learning process, respectively. Thereafter, the attention maps are multiplied element-wise with the input feature map, and the intermediate results are added to the input feature map again for residual learning. Using X-ray images collected from public datasets for training and evaluation, we applied the proposed attention modules to U-Net for segmentation of lung areas and conducted experiments while changing the locations of the attention modules in the baseline network. The experimental results showed that our method achieved comparable or better performance than the existing medical image segmentation networks in terms of Dice score when the proposed attention modules were placed in lower layers of both the contracting and expanding paths of U-Net.

Download Full-text

Employing GRU to combine feature maps in DeeplabV3 for a better segmentation model

Nordic Machine Intelligence ◽

10.5617/nmi.9131 ◽

2021 ◽

Vol 1 (1) ◽

pp. 29-31

Author(s):

Mahmood Haithami ◽

Amr Ahmed ◽

Iman Yi Liao ◽

Hamid Jalab

Keyword(s):

Neural Network ◽

Recurrent Neural Network ◽

Feature Maps ◽

Feature Map ◽

Input Feature ◽

Spatial Pyramid Pooling ◽

Test Sets ◽

Public Datasets ◽

Spatial Pyramid

In this paper, we aim to enhance the segmentation capabilities of DeeplabV3 by employing Gated Recurrent Neural Network (GRU). A 1-by-1 convolution in DeeplabV3 was replaced by GRU after the Atrous Spatial Pyramid Pooling (ASSP) layer to combine the input feature maps. The convolution and GRU have sharable parameters, though, the latter has gates that enable/disable the contribution of each input feature map. The experiments on unseen test sets demonstrate that employing GRU instead of convolution would produce better segmentation results. The used datasets are public datasets provided by MedAI competition.

Download Full-text

A Self-Spatial Adaptive Weighting Based U-Net for Image Segmentation

Electronics ◽

10.3390/electronics10030348 ◽

2021 ◽

Vol 10 (3) ◽

pp. 348

Author(s):

Choongsang Cho ◽

Young Han Lee ◽

Jongyoul Park ◽

Sangkeun Lee

Keyword(s):

Image Segmentation ◽

Medical Image ◽

Medical Image Segmentation ◽

Feature Maps ◽

Data Set ◽

Feature Map ◽

Adaptive Weighting ◽

Spatially Adaptive ◽

Wide Range ◽

Decoder Architecture

Semantic image segmentation has a wide range of applications. When it comes to medical image segmentation, its accuracy is even more important than those of other areas because the performance gives useful information directly applicable to disease diagnosis, surgical planning, and history monitoring. The state-of-the-art models in medical image segmentation are variants of encoder-decoder architecture, which is called U-Net. To effectively reflect the spatial features in feature maps in encoder-decoder architecture, we propose a spatially adaptive weighting scheme for medical image segmentation. Specifically, the spatial feature is estimated from the feature maps, and the learned weighting parameters are obtained from the computed map, since segmentation results are predicted from the feature map through a convolutional layer. Especially in the proposed networks, the convolutional block for extracting the feature map is replaced with the widely used convolutional frameworks: VGG, ResNet, and Bottleneck Resent structures. In addition, a bilinear up-sampling method replaces the up-convolutional layer to increase the resolution of the feature map. For the performance evaluation of the proposed architecture, we used three data sets covering different medical imaging modalities. Experimental results show that the network with the proposed self-spatial adaptive weighting block based on the ResNet framework gave the highest IoU and DICE scores in the three tasks compared to other methods. In particular, the segmentation network combining the proposed self-spatially adaptive block and ResNet framework recorded the highest 3.01% and 2.89% improvements in IoU and DICE scores, respectively, in the Nerve data set. Therefore, we believe that the proposed scheme can be a useful tool for image segmentation tasks based on the encoder-decoder architecture.

Download Full-text

DR-Net: dual-rotation network with feature map enhancement for medical image segmentation

Complex & Intelligent Systems ◽

10.1007/s40747-021-00525-4 ◽

2021 ◽

Author(s):

Hongfeng You ◽

Long Yu ◽

Shengwei Tian ◽

Weiwei Cai

Keyword(s):

Image Segmentation ◽

Medical Image ◽

Medical Image Segmentation ◽

Small Samples ◽

Feature Maps ◽

Global Features ◽

Feature Map ◽

Net Method ◽

Dual Rotation ◽

Key Steps

AbstractTo obtain more semantic information with small samples for medical image segmentation, this paper proposes a simple and efficient dual-rotation network (DR-Net) that strengthens the quality of both local and global feature maps. The key steps of the DR-Net algorithm are as follows (as shown in Fig. 1). First, the number of channels in each layer is divided into four equal portions. Then, different rotation strategies are used to obtain a rotation feature map in multiple directions for each subimage. Then, the multiscale volume product and dilated convolution are used to learn the local and global features of feature maps. Finally, the residual strategy and integration strategy are used to fuse the generated feature maps. Experimental results demonstrate that the DR-Net method can obtain higher segmentation accuracy on both the CHAOS and BraTS data sets compared to the state-of-the-art methods.

Download Full-text

A Densely Connected Network Based on U-Net for Medical Image Segmentation

ACM Transactions on Multimedia Computing Communications and Applications ◽

10.1145/3446618 ◽

2021 ◽

Vol 17 (3) ◽

pp. 1-14

Author(s):

Zhenzhen Yang ◽

Pengfei Xu ◽

Yongpeng Yang ◽

Bing-Kun Bao

Keyword(s):

Feature Extraction ◽

Image Segmentation ◽

Loss Function ◽

Network Architecture ◽

Medical Image ◽

Medical Image Segmentation ◽

Cross Entropy ◽

Loss Functions ◽

Feature Maps ◽

Different Levels

The U-Net has become the most popular structure in medical image segmentation in recent years. Although its performance for medical image segmentation is outstanding, a large number of experiments demonstrate that the classical U-Net network architecture seems to be insufficient when the size of segmentation targets changes and the imbalance happens between target and background in different forms of segmentation. To improve the U-Net network architecture, we develop a new architecture named densely connected U-Net (DenseUNet) network in this article. The proposed DenseUNet network adopts a dense block to improve the feature extraction capability and employs a multi-feature fuse block fusing feature maps of different levels to increase the accuracy of feature extraction. In addition, in view of the advantages of the cross entropy and the dice loss functions, a new loss function for the DenseUNet network is proposed to deal with the imbalance between target and background. Finally, we test the proposed DenseUNet network and compared it with the multi-resolutional U-Net (MultiResUNet) and the classic U-Net networks on three different datasets. The experimental results show that the DenseUNet network has significantly performances compared with the MultiResUNet and the classic U-Net networks.

Download Full-text

Novel Radiographic Indexes for Elbow Stability Assessment: Part B—Preliminary Clinical Study

Indian Journal of Orthopaedics ◽

10.1007/s43465-021-00399-1 ◽

2021 ◽

Author(s):

Francesco Luceri ◽

Davide Cucchi ◽

Enrico Rosagrata ◽

Carlo Eugenio Zaolino ◽

Alessandra Menon ◽

...

Keyword(s):

Case Series ◽

Radiographic Evaluation ◽

Healthy Population ◽

X Rays ◽

The Novel ◽

Level Of Evidence ◽

Orthopaedic Surgeons ◽

Joint Congruency ◽

Radiological Parameters ◽

The Mean

Abstract Introduction The coronoid process plays a key-role in preserving elbow stability. Currently, there are no radiographic indexes conceived to assess the intrinsic elbow stability and the joint congruency. The aim of this study is to present new radiological parameters, which will help assess the intrinsic stability of the ulnohumeral joint and to define normal values of these indexes in a normal, healthy population. Methods Four independent observers (two orthopaedic surgeons and two radiologists) selected lateral view X-rays of subjects with no history of upper limb disease or surgery. The following radiographic indexes were defined: trochlear depth index (TDI); anterior coverage index (ACI); posterior coverage index (PCI); olecranon–coronoid angle (OCA); radiographic coverage angle (RCA). Inter-observer and intra-observer reproducibility were assessed for each index. Results 126 subjects were included. Standardized lateral elbow radiographs (62 left and 64 right elbows) were obtained and analysed. The mean TDI was 0.46 ± 0.06 (0.3–1.6), the mean ACI was 2.0 ± 0.2 (1.6–3.1) and the mean PCI was 1.3 ± 0.1 (1.0–1.9). The mean RCA was 179.6 ± 8.3° (normalized RCA: 49.9 ± 2.3%) and the mean OCA was 24.6 ± 3.7°. The indexes had a high-grade of inter-observer and intra-observer reliability for each of the four observers. Significantly higher values were found for males for TDI, ACI, PCI and RCA. Conclusion The novel radiological parameters described are simple, reliable and easily reproducible. These features make them a promising tool for radiographic evaluation both for orthopaedic surgeons and for radiologists in the emergency department setting or during outpatient services. Level of evidence Basic Science Study (Case Series). Clinical relevance The novel radiological parameters described are reliable, easily reproducible and become handy for orthopaedic surgeons as well as radiologists in daily clinical practice.

Download Full-text

Improved YOLO Network for Free-Angle Remote Sensing Target Detection

Remote Sensing ◽

10.3390/rs13112171 ◽

2021 ◽

Vol 13 (11) ◽

pp. 2171

Author(s):

Yuhao Qing ◽

Wenyi Liu ◽

Liuyan Feng ◽

Wanjia Gao

Keyword(s):

Remote Sensing ◽

Feature Extraction ◽

Target Detection ◽

Multiple Scales ◽

Classification Problem ◽

Input Image ◽

Detection Accuracy ◽

Feature Maps ◽

Regression Problem ◽

Public Datasets

Despite significant progress in object detection tasks, remote sensing image target detection is still challenging owing to complex backgrounds, large differences in target sizes, and uneven distribution of rotating objects. In this study, we consider model accuracy, inference speed, and detection of objects at any angle. We also propose a RepVGG-YOLO network using an improved RepVGG model as the backbone feature extraction network, which performs the initial feature extraction from the input image and considers network training accuracy and inference speed. We use an improved feature pyramid network (FPN) and path aggregation network (PANet) to reprocess feature output by the backbone network. The FPN and PANet module integrates feature maps of different layers, combines context information on multiple scales, accumulates multiple features, and strengthens feature information extraction. Finally, to maximize the detection accuracy of objects of all sizes, we use four target detection scales at the network output to enhance feature extraction from small remote sensing target pixels. To solve the angle problem of any object, we improved the loss function for classification using circular smooth label technology, turning the angle regression problem into a classification problem, and increasing the detection accuracy of objects at any angle. We conducted experiments on two public datasets, DOTA and HRSC2016. Our results show the proposed method performs better than previous methods.

Download Full-text

ResBCDU-Net: A Deep Learning Framework for Lung CT Image Segmentation

Sensors ◽

10.3390/s21010268 ◽

2021 ◽

Vol 21 (1) ◽

pp. 268

Author(s):

Yeganeh Jalali ◽

Mansoor Fateh ◽

Mohsen Rezvani ◽

Vahid Abolghasemi ◽

Mohammad Hossein Anisi

Keyword(s):

Image Segmentation ◽

Deep Learning ◽

Ct Images ◽

Medical Image Segmentation ◽

Great Success ◽

Morphological Operations ◽

Ct Image ◽

Feature Maps ◽

Learning Framework ◽

Lung Ct

Lung CT image segmentation is a key process in many applications such as lung cancer detection. It is considered a challenging problem due to existing similar image densities in the pulmonary structures, different types of scanners, and scanning protocols. Most of the current semi-automatic segmentation methods rely on human factors therefore it might suffer from lack of accuracy. Another shortcoming of these methods is their high false-positive rate. In recent years, several approaches, based on a deep learning framework, have been effectively applied in medical image segmentation. Among existing deep neural networks, the U-Net has provided great success in this field. In this paper, we propose a deep neural network architecture to perform an automatic lung CT image segmentation process. In the proposed method, several extensive preprocessing techniques are applied to raw CT images. Then, ground truths corresponding to these images are extracted via some morphological operations and manual reforms. Finally, all the prepared images with the corresponding ground truth are fed into a modified U-Net in which the encoder is replaced with a pre-trained ResNet-34 network (referred to as Res BCDU-Net). In the architecture, we employ BConvLSTM (Bidirectional Convolutional Long Short-term Memory)as an advanced integrator module instead of simple traditional concatenators. This is to merge the extracted feature maps of the corresponding contracting path into the previous expansion of the up-convolutional layer. Finally, a densely connected convolutional layer is utilized for the contracting path. The results of our extensive experiments on lung CT images (LIDC-IDRI database) confirm the effectiveness of the proposed method where a dice coefficient index of 97.31% is achieved.

Download Full-text

A Novel Detector Based on Convolution Neural Networks for Multiscale SAR Ship Detection in Complex Background

Sensors ◽

10.3390/s20092547 ◽

2020 ◽

Vol 20 (9) ◽

pp. 2547 ◽

Cited By ~ 2

Author(s):

Wenxin Dai ◽

Yuqing Mao ◽

Rongao Yuan ◽

Yijing Liu ◽

Xuemei Pu ◽

...

Keyword(s):

Region Of Interest ◽

Feature Representation ◽

Feature Maps ◽

Sar Images ◽

Feature Map ◽

The Public ◽

Single Feature ◽

Great Performance ◽

Residual Block ◽

Fusion Feature

Convolution neural network (CNN)-based detectors have shown great performance on ship detections of synthetic aperture radar (SAR) images. However, the performance of current models has not been satisfactory enough for detecting multiscale ships and small-size ones in front of complex backgrounds. To address the problem, we propose a novel SAR ship detector based on CNN, which consist of three subnetworks: the Fusion Feature Extractor Network (FFEN), Region Proposal Network (RPN), and Refine Detection Network (RDN). Instead of using a single feature map, we fuse feature maps in bottom–up and top–down ways and generate proposals from each fused feature map in FFEN. Furthermore, we further merge features generated by the region-of-interest (RoI) pooling layer in RDN. Based on the feature representation strategy, the CNN framework constructed can significantly enhance the location and semantics information for the multiscale ships, in particular for the small ships. On the other hand, the residual block is introduced to increase the network depth, through which the detection precision could be further improved. The public SAR ship dataset (SSDD) and China Gaofen-3 satellite SAR image are used to validate the proposed method. Our method shows excellent performance for detecting the multiscale and small-size ships with respect to some competitive models and exhibits high potential in practical application.

Download Full-text

Deep CNNs with Robust LBP Guiding Pooling for Face Recognition

Sensors ◽

10.3390/s18113876 ◽

2018 ◽

Vol 18 (11) ◽

pp. 3876 ◽

Cited By ~ 5

Author(s):

Zhongjian Ma ◽

Yuanyuan Ding ◽

Baoqing Li ◽

Xiaobing Yuan

Keyword(s):

Neural Networks ◽

Face Recognition ◽

Computational Complexity ◽

Local Binary Pattern ◽

Weighted Average ◽

Sliding Window ◽

Performance Gain ◽

Feature Maps ◽

Input Feature ◽

Noise Impact

Pooling layer in Convolutional Neural Networks (CNNs) is designed to reduce dimensions and computational complexity. Unfortunately, CNN is easily disturbed by noise in images when extracting features from input images. The traditional pooling layer directly samples the input feature maps without considering whether they are affected by noise, which brings about accumulated noise in the subsequent feature maps as well as undesirable network outputs. To address this issue, a robust Local Binary Pattern (LBP) Guiding Pooling (G-RLBP) mechanism is proposed in this paper to down sample the input feature maps and lower the noise impact simultaneously. The proposed G-RLBP method calculates the weighted average of all pixels in the sliding window of this pooling layer as the final results based on their corresponding probabilities of being affected by noise, thus lowers the noise impact from input images at the first several layers of the CNNs. The experimental results show that the carefully designed G-RLBP layer can successfully lower the noise impact and improve the recognition rates of the CNN models over the traditional pooling layer. The performance gain of the G-RLBP is quite remarkable when the images are severely affected by noise.

Download Full-text

Segmentation of thyroid nodules using Improvised U-Net Architecture

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.h6142.069820 ◽

2020 ◽

Vol 9 (8) ◽

pp. 56-60

Keyword(s):

Thyroid Nodules ◽

Performance Metrics ◽

Automatic Segmentation ◽

Research Work ◽

Ultrasound Image ◽

True Positive Rate ◽

Medical Image Segmentation ◽

Common Disease ◽

Low Resolution ◽

Feature Map

Thyroid nodules are considered as most common disease found in adults and thyroid cancer has increased over the years rapidly. Further automatic segmentation for ultrasound image is quite difficult due to the image poor quality, hence several researcher have focused and observed that U-Net achieves significant performance in medical image segmentation. However U-net faces the problem of low resolution which causes smoothness in image, hence in this research work we have proposed improvised U-Net which helps in achieving the better performance. The main aim of this research work is to achieve the probable Region of Interest through segmentation with better efficiency. In order to achieve that Improvised U-Net develops two distinctive feature map i.e. High level feature Map and low level feature map to avoid the problem of low resolution. Further proposed model is evaluated considering the standard dataset based on performance metrics such as Dice Coefficient and True positive Rate. Moreover our model achieves better performance than the existing model.

Download Full-text