Research on training set in image segmentation of terahertz digital holographic reconstructed image based on convolutional neural network

Remote sensing image segmentation based on the fuzzy deep convolutional neural network

International Journal of Remote Sensing ◽

10.1080/01431161.2021.1938738 ◽

2021 ◽

Vol 42 (16) ◽

pp. 6267-6286

Author(s):

Tianyu Zhao ◽

Jindong Xu ◽

Rui Chen ◽

Xiangyue Ma

Keyword(s):

Neural Network ◽

Remote Sensing ◽

Image Segmentation ◽

Convolutional Neural Network ◽

Remote Sensing Image ◽

Deep Convolutional Neural Network

Download Full-text

Geometric property-based convolutional neural network for indoor object detection

International Journal of Advanced Robotic Systems ◽

10.1177/1729881421993323 ◽

2021 ◽

Vol 18 (1) ◽

pp. 172988142199332

Author(s):

Xintao Ding ◽

Boquan Li ◽

Jinbao Wang

Keyword(s):

Neural Network ◽

Object Detection ◽

Convolutional Neural Network ◽

Geometric Property ◽

Ground Truth ◽

Geometric Constraints ◽

Depth Information ◽

Training Set ◽

Object Knowledge ◽

The Mean

Indoor object detection is a very demanding and important task for robot applications. Object knowledge, such as two-dimensional (2D) shape and depth information, may be helpful for detection. In this article, we focus on region-based convolutional neural network (CNN) detector and propose a geometric property-based Faster R-CNN method (GP-Faster) for indoor object detection. GP-Faster incorporates geometric property in Faster R-CNN to improve the detection performance. In detail, we first use mesh grids that are the intersections of direct and inverse proportion functions to generate appropriate anchors for indoor objects. After the anchors are regressed to the regions of interest produced by a region proposal network (RPN-RoIs), we then use 2D geometric constraints to refine the RPN-RoIs, in which the 2D constraint of every classification is a convex hull region enclosing the width and height coordinates of the ground-truth boxes on the training set. Comparison experiments are implemented on two indoor datasets SUN2012 and NYUv2. Since the depth information is available in NYUv2, we involve depth constraints in GP-Faster and propose 3D geometric property-based Faster R-CNN (DGP-Faster) on NYUv2. The experimental results show that both GP-Faster and DGP-Faster increase the performance of the mean average precision.

Download Full-text

Unraveling the deep learning gearbox in optical coherence tomography image segmentation towards explainable artificial intelligence

Communications Biology ◽

10.1038/s42003-021-01697-y ◽

2021 ◽

Vol 4 (1) ◽

Author(s):

Peter M. Maloca ◽

Philipp L. Müller ◽

Aaron Y. Lee ◽

Adnan Tufail ◽

Konstantinos Balaskas ◽

...

Keyword(s):

Neural Network ◽

Machine Learning ◽

Optical Coherence Tomography ◽

Image Segmentation ◽

Convolutional Neural Network ◽

Learning Algorithm ◽

Ground Truth ◽

Optical Coherence Tomography Image ◽

Optical Coherence ◽

Tomography Image

AbstractMachine learning has greatly facilitated the analysis of medical data, while the internal operations usually remain intransparent. To better comprehend these opaque procedures, a convolutional neural network for optical coherence tomography image segmentation was enhanced with a Traceable Relevance Explainability (T-REX) technique. The proposed application was based on three components: ground truth generation by multiple graders, calculation of Hamming distances among graders and the machine learning algorithm, as well as a smart data visualization (‘neural recording’). An overall average variability of 1.75% between the human graders and the algorithm was found, slightly minor to 2.02% among human graders. The ambiguity in ground truth had noteworthy impact on machine learning results, which could be visualized. The convolutional neural network balanced between graders and allowed for modifiable predictions dependent on the compartment. Using the proposed T-REX setup, machine learning processes could be rendered more transparent and understandable, possibly leading to optimized applications.

Download Full-text

Active Learning with Bayesian UNet for Efficient Semantic Image Segmentation

Journal of Imaging ◽

10.3390/jimaging7020037 ◽

2021 ◽

Vol 7 (2) ◽

pp. 37

Author(s):

Isah Charles Saidu ◽

Lehel Csató

Keyword(s):

Neural Network ◽

Image Segmentation ◽

Active Learning ◽

Convolutional Neural Network ◽

Medical Image ◽

Segmentation Method ◽

Semantic Image Segmentation ◽

Batch Normalization ◽

Set Up ◽

Image Datasets

We present a sample-efficient image segmentation method using active learning, we call it Active Bayesian UNet, or AB-UNet. This is a convolutional neural network using batch normalization and max-pool dropout. The Bayesian setup is achieved by exploiting the probabilistic extension of the dropout mechanism, leading to the possibility to use the uncertainty inherently present in the system. We set up our experiments on various medical image datasets and highlight that with a smaller annotation effort our AB-UNet leads to stable training and better generalization. Added to this, we can efficiently choose from an unlabelled dataset.

Download Full-text

Chicken Image Segmentation via Multi-Scale Attention-Based Deep Convolutional Neural Network

IEEE Access ◽

10.1109/access.2021.3074297 ◽

2021 ◽

pp. 1-1

Author(s):

Wei Li ◽

Yang Xiao ◽

Xibin Song ◽

Na Lv ◽

Xinbo Jiang ◽

...

Keyword(s):

Neural Network ◽

Image Segmentation ◽

Convolutional Neural Network ◽

Deep Convolutional Neural Network ◽

Multi Scale

Download Full-text

Style Transfer Using Convolutional Neural Network and Image Segmentation

TECHART Journal of Arts and Imaging Science ◽

10.15323/techart.2021.5.8.2.5 ◽

2021 ◽

Vol 8 (2) ◽

pp. 5-8

Author(s):

Minyeo Kim ◽

Hyeongseok Choi ◽

Joonki Paik

Keyword(s):

Neural Network ◽

Image Segmentation ◽

Convolutional Neural Network ◽

Style Transfer

Download Full-text

Lw-TISNet: Light-Weight Convolutional Neural Network Incorporating Attention Mechanism and Multiple Supervision Strategy for Tongue Image Segmentation

Sensing and Imaging ◽

10.1007/s11220-021-00375-x ◽

2022 ◽

Vol 23 (1) ◽

Author(s):

Xiaodong Huang ◽

Li Zhuo ◽

Hui Zhang ◽

Xiaoguang Li ◽

Jing Zhang

Keyword(s):

Neural Network ◽

Image Segmentation ◽

Convolutional Neural Network ◽

Attention Mechanism ◽

Light Weight

Download Full-text

Evaluation of Power Insulator Detection Efficiency with the Use of Limited Training Dataset

Applied Sciences ◽

10.3390/app10062104 ◽

2020 ◽

Vol 10 (6) ◽

pp. 2104

Author(s):

Michał Tomaszewski ◽

Paweł Michalski ◽

Jakub Osuchowski

Keyword(s):

Neural Network ◽

Neural Networks ◽

Object Detection ◽

Convolutional Neural Network ◽

Deep Neural Networks ◽

Detection Efficiency ◽

Training Data ◽

Training Dataset ◽

Training Set ◽

Convolutional Network

This article presents an analysis of the effectiveness of object detection in digital images with the application of a limited quantity of input. The possibility of using a limited set of learning data was achieved by developing a detailed scenario of the task, which strictly defined the conditions of detector operation in the considered case of a convolutional neural network. The described solution utilizes known architectures of deep neural networks in the process of learning and object detection. The article presents comparisons of results from detecting the most popular deep neural networks while maintaining a limited training set composed of a specific number of selected images from diagnostic video. The analyzed input material was recorded during an inspection flight conducted along high-voltage lines. The object detector was built for a power insulator. The main contribution of the presented papier is the evidence that a limited training set (in our case, just 60 training frames) could be used for object detection, assuming an outdoor scenario with low variability of environmental conditions. The decision of which network will generate the best result for such a limited training set is not a trivial task. Conducted research suggests that the deep neural networks will achieve different levels of effectiveness depending on the amount of training data. The most beneficial results were obtained for two convolutional neural networks: the faster region-convolutional neural network (faster R-CNN) and the region-based fully convolutional network (R-FCN). Faster R-CNN reached the highest AP (average precision) at a level of 0.8 for 60 frames. The R-FCN model gained a worse AP result; however, it can be noted that the relationship between the number of input samples and the obtained results has a significantly lower influence than in the case of other CNN models, which, in the authors’ assessment, is a desired feature in the case of a limited training set.

Download Full-text