Overwater Image Dehazing via Cycle-Consistent Generative Adversarial Network

Shunyuan Zheng; Jiamin Sun; Qinglin Liu; Yuankai Qi; Jianen Yan

doi:10.3390/electronics9111877

Overwater Image Dehazing via Cycle-Consistent Generative Adversarial Network

Electronics ◽

10.3390/electronics9111877 ◽

2020 ◽

Vol 9 (11) ◽

pp. 1877

Author(s):

Shunyuan Zheng ◽

Jiamin Sun ◽

Qinglin Liu ◽

Yuankai Qi ◽

Jianen Yan

Keyword(s):

Neural Network ◽

Image Quality Assessment ◽

State Of The Art ◽

Qualitative Evaluation ◽

Image Dehazing ◽

Generative Adversarial Network ◽

Test Dataset ◽

Adversarial Network ◽

Synthetic Test ◽

Content Preservation

In contrast to images taken on land scenes, images taken over water are more prone to degradation due to the influence of the haze. However, existing image dehazing methods are mainly developed for land-scene images and perform poorly when applied to overwater images. To address this problem, we collect the first overwater image dehazing dataset and propose a Generative Adversial Network (GAN)-based method called OverWater Image Dehazing GAN (OWI-DehazeGAN). Due to the difficulties of collecting paired hazy and clean images, the dataset contains unpaired hazy and clean images taken over water. The proposed OWI-DehazeGAN is composed of an encoder–decoder framework, supervised by a forward-backward translation consistency loss for self-supervision and a perceptual loss for content preservation. In addition to qualitative evaluation, we design an image quality assessment neural network to rank the dehazed images. Experimental results on both real and synthetic test data demonstrate that the proposed method performs superiorly against several state-of-the-art land dehazing methods. Compared with the state-of-the-art, our method gains a significant improvement by 1.94% for SSIM, 7.13% for PSNR and 4.00% for CIEDE2000 on the synthetic test dataset.

Download Full-text

Generative Model of Brain Microbleeds for MRI Detection of Vascular Marker of Neurodegenerative Diseases

Frontiers in Neuroscience ◽

10.3389/fnins.2021.778767 ◽

2021 ◽

Vol 15 ◽

Author(s):

Saba Momeni ◽

Amir Fazlollahi ◽

Leo Lebrat ◽

Paul Yates ◽

Christopher Rowe ◽

...

Keyword(s):

Neural Network ◽

Deep Learning ◽

State Of The Art ◽

Three Dimensional ◽

Ground Truth ◽

Imaging Features ◽

Neural Network Classifier ◽

Generative Adversarial Network ◽

Adversarial Network ◽

High Diversity

Cerebral microbleeds (CMB) are increasingly present with aging and can reveal vascular pathologies associated with neurodegeneration. Deep learning-based classifiers can detect and quantify CMB from MRI, such as susceptibility imaging, but are challenging to train because of the limited availability of ground truth and many confounding imaging features, such as vessels or infarcts. In this study, we present a novel generative adversarial network (GAN) that has been trained to generate three-dimensional lesions, conditioned by volume and location. This allows one to investigate CMB characteristics and create large training datasets for deep learning-based detectors. We demonstrate the benefit of this approach by achieving state-of-the-art CMB detection of real CMB using a convolutional neural network classifier trained on synthetic CMB. Moreover, we showed that our proposed 3D lesion GAN model can be applied on unseen dataset, with different MRI parameters and diseases, to generate synthetic lesions with high diversity and without needing laboriously marked ground truth.

Download Full-text

Adversarial Gaussian Denoiser for Multiple-Level Image Denoising

Sensors ◽

10.3390/s21092998 ◽

2021 ◽

Vol 21 (9) ◽

pp. 2998

Author(s):

Aamir Khan ◽

Weidong Jin ◽

Amir Haider ◽

MuhibUr Rahman ◽

Desheng Wang

Keyword(s):

Neural Network ◽

Image Processing ◽

Computer Vision ◽

Image Denoising ◽

Theoretical Study ◽

State Of The Art ◽

Multiple Level ◽

Generative Adversarial Network ◽

Adversarial Learning ◽

Adversarial Network

Image denoising is a challenging task that is essential in numerous computer vision and image processing problems. This study proposes and applies a generative adversarial network-based image denoising training architecture to multiple-level Gaussian image denoising tasks. Convolutional neural network-based denoising approaches come across a blurriness issue that produces denoised images blurry on texture details. To resolve the blurriness issue, we first performed a theoretical study of the cause of the problem. Subsequently, we proposed an adversarial Gaussian denoiser network, which uses the generative adversarial network-based adversarial learning process for image denoising tasks. This framework resolves the blurriness problem by encouraging the denoiser network to find the distribution of sharp noise-free images instead of blurry images. Experimental results demonstrate that the proposed framework can effectively resolve the blurriness problem and achieve significant denoising efficiency than the state-of-the-art denoising methods.

Download Full-text

Disentangled generative adversarial network for low-dose CT

EURASIP Journal on Advances in Signal Processing ◽

10.1186/s13634-021-00749-z ◽

2021 ◽

Vol 2021 (1) ◽

Author(s):

Wenchao Du ◽

Hu Chen ◽

Hongyu Yang ◽

Yi Zhang

Keyword(s):

Network Architecture ◽

Low Dose ◽

Noise Suppression ◽

State Of The Art ◽

Visual Quality ◽

Ct Images ◽

Generative Adversarial Network ◽

Low Dose Ct ◽

Adversarial Network ◽

Suppression Method

AbstractGenerative adversarial network (GAN) has been applied for low-dose CT images to predict normal-dose CT images. However, the undesired artifacts and details bring uncertainty to the clinical diagnosis. In order to improve the visual quality while suppressing the noise, in this paper, we mainly studied the two key components of deep learning based low-dose CT (LDCT) restoration models—network architecture and adversarial loss, and proposed a disentangled noise suppression method based on GAN (DNSGAN) for LDCT. Specifically, a generator network, which contains the noise suppression and structure recovery modules, is proposed. Furthermore, a multi-scaled relativistic adversarial loss is introduced to preserve the finer structures of generated images. Experiments on simulated and real LDCT datasets show that the proposed method can effectively remove noise while recovering finer details and provide better visual perception than other state-of-the-art methods.

Download Full-text

3D Convolutional Neural Network for Hyperspectral Image Classification Using Generative Adversarial Network

10.1109/icicta51737.2020.00065 ◽

2020 ◽

Author(s):

QiRui Yang ◽

Yu Liu ◽

Tong Zhou ◽

YuanXi Peng ◽

YuHua Tang

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Image Classification ◽

Hyperspectral Image ◽

Hyperspectral Image Classification ◽

Generative Adversarial Network ◽

Adversarial Network

Download Full-text

AN AI-BASED APPROACH TO ENHANCED FRACTURE RESOLUTION IN IMAGE LOGS

10.30632/spwla-2021-0081 ◽

2021 ◽

Author(s):

James Howard ◽

◽

Joe Tracey ◽

Mike Shen ◽

Shawn Zhang ◽

...

Keyword(s):

Neural Network ◽

Deep Learning ◽

Nearest Neighbor ◽

Rock Fracture ◽

Short Interval ◽

Acoustic Properties ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Deep Learning Neural Network ◽

Borehole Image

Borehole image logs are used to identify the presence and orientation of fractures, both natural and induced, found in reservoir intervals. The contrast in electrical or acoustic properties of the rock matrix and fluid-filled fractures is sufficiently large enough that sub-resolution features can be detected by these image logging tools. The resolution of these image logs is based on the design and operation of the tools, and generally is in the millimeter per pixel range. Hence the quantitative measurement of actual width remains problematic. An artificial intelligence (AI) -based workflow combines the statistical information obtained from a Machine-Learning (ML) segmentation process with a multiple-layer neural network that defines a Deep Learning process that enhances fractures in a borehole image. These new images allow for a more robust analysis of fracture widths, especially those that are sub-resolution. The images from a BHTV log were first segmented into rock and fluid-filled fractures using a ML-segmentation tool that applied multiple image processing filters that captured information to describe patterns in fracture-rock distribution based on nearest-neighbor behavior. The robust ML analysis was trained by users to identify these two components over a short interval in the well, and then the regression model-based coefficients applied to the remaining log. Based on the training, each pixel was assigned a probability value between 1.0 (being a fracture) and 0.0 (pure rock), with most of the pixels assigned one of these two values. Intermediate probabilities represented pixels on the edge of rock-fracture interface or the presence of one or more sub-resolution fractures within the rock. The probability matrix produced a map or image of the distribution of probabilities that determined whether a given pixel in the image was a fracture or partially filled with a fracture. The Deep Learning neural network was based on a Conditional Generative Adversarial Network (cGAN) approach where the probability map was first encoded and combined with a noise vector that acted as a seed for diverse feature generation. This combination was used to generate new images that represented the BHTV response. The second layer of the neural network, the adversarial or discriminator portion, determined whether the generated images were representative of the actual BHTV by comparing the generated images with actual images from the log and producing an output probability of whether it was real or fake. This probability was then used to train the generator and discriminator models that were then applied to the entire log. Several scenarios were run with different probability maps. The enhanced BHTV images brought out fractures observed in the core photos that were less obvious in the original BTHV log through enhanced continuity and improved resolution on fracture widths.

Download Full-text

Learning a Generative Model for Fusing Infrared and Visible Images via Conditional Generative Adversarial Network with Dual Discriminators

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2019/549 ◽

2019 ◽

Cited By ~ 12

Author(s):

Han Xu ◽

Pengwei Liang ◽

Wei Yu ◽

Junjun Jiang ◽

Jiayi Ma

Keyword(s):

Probability Distribution ◽

State Of The Art ◽

Infrared Image ◽

Infrared Images ◽

Generative Adversarial Network ◽

Visible Image ◽

Qualitative And Quantitative ◽

Adversarial Network ◽

Fused Image ◽

Visible Images

In this paper, we propose a new end-to-end model, called dual-discriminator conditional generative adversarial network (DDcGAN), for fusing infrared and visible images of different resolutions. Unlike the pixel-level methods and existing deep learning-based methods, the fusion task is accomplished through the adversarial process between a generator and two discriminators, in addition to the specially designed content loss. The generator is trained to generate real-like fused images to fool discriminators. The two discriminators are trained to calculate the JS divergence between the probability distribution of downsampled fused images and infrared images, and the JS divergence between the probability distribution of gradients of fused images and gradients of visible images, respectively. Thus, the fused images can compensate for the features that are not constrained by the single content loss. Consequently, the prominence of thermal targets in the infrared image and the texture details in the visible image can be preserved or even enhanced in the fused image simultaneously. Moreover, by constraining and distinguishing between the downsampled fused image and the low-resolution infrared image, DDcGAN can be preferably applied to the fusion of different resolution images. Qualitative and quantitative experiments on publicly available datasets demonstrate the superiority of our method over the state-of-the-art.

Download Full-text

Convolutional Neural Network Audio Classifier for Alarm Sound Detection

International Journal of Engineering and Advanced Technology - Regular Issue ◽

10.35940/ijeat.f8866.088619 ◽

2019 ◽

Vol 8 (6) ◽

pp. 4554-4557

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Short Term Memory ◽

Sound Recognition ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Differential Network ◽

Sound Detection ◽

Long Short Term Memory ◽

Lstm Network

Neural Networks (ANN) has evolved through many stages in the last three decades with many researchers contributing in this challenging field. With the power of math complex problems can also be solved by ANNs. ANNs like Convolutional Neural Network (CNN), Deep Neural network, Generative Adversarial Network (GAN), Long Short Term Memory (LSTM) network, Recurrent Neural Network (RNN), Ordinary Differential Network etc., are playing promising roles in many MNCs and IT industries for their predictions and accuracy. In this paper, Convolutional Neural Network is used for prediction of Beep sounds in high noise levels. Based on Supervised Learning, the research is developed the best CNN architecture for Beep sound recognition in noisy situations. The proposed method gives better results with an accuracy of 96%. The prototype is tested with few architectures for the training and test data out of which a two layer CNN classifier predictions were the best.

Download Full-text

Application of deep neural network and generative adversarial network to industrial maintenance: A case study of induction motor fault detection

2017 IEEE International Conference on Big Data (Big Data) ◽

10.1109/bigdata.2017.8258307 ◽

2017 ◽

Cited By ~ 21

Author(s):

Yong Oh Lee ◽

Jun Jo ◽

Jongwoon Hwang

Keyword(s):

Neural Network ◽

Fault Detection ◽

Induction Motor ◽

Deep Neural Network ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Industrial Maintenance

Download Full-text

Precise No-Reference Image Quality Evaluation Based on Distortion Identification

ACM Transactions on Multimedia Computing Communications and Applications ◽

10.1145/3468872 ◽

2021 ◽

Vol 17 (3s) ◽

pp. 1-21

Author(s):

Chenggang Yan ◽

Tong Teng ◽

Yutao Liu ◽

Yongbing Zhang ◽

Haoqian Wang ◽

...

Keyword(s):

Neural Network ◽

Image Quality ◽

Quality Assessment ◽

Large Scale ◽

Quality Evaluation ◽

Image Quality Assessment ◽

State Of The Art ◽

Gaussian White Noise ◽

The State ◽

Reference Image

The difficulty of no-reference image quality assessment (NR IQA) often lies in the lack of knowledge about the distortion in the image, which makes quality assessment blind and thus inefficient. To tackle such issue, in this article, we propose a novel scheme for precise NR IQA, which includes two successive steps, i.e., distortion identification and targeted quality evaluation. In the first step, we employ the well-known Inception-ResNet-v2 neural network to train a classifier that classifies the possible distortion in the image into the four most common distortion types, i.e., Gaussian white noise (WN), Gaussian blur (GB), jpeg compression (JPEG), and jpeg2000 compression (JP2K). Specifically, the deep neural network is trained on the large-scale Waterloo Exploration database, which ensures the robustness and high performance of distortion classification. In the second step, after determining the distortion type of the image, we then design a specific approach to quantify the image distortion level, which can estimate the image quality specially and more precisely. Extensive experiments performed on LIVE, TID2013, CSIQ, and Waterloo Exploration databases demonstrate that (1) the accuracy of our distortion classification is higher than that of the state-of-the-art distortion classification methods, and (2) the proposed NR IQA method outperforms the state-of-the-art NR IQA methods in quantifying the image quality.

Download Full-text