Image-to-Image Translation Using Identical-Pair Adversarial Networks

Thai Leang Sung; Hyo Jong Lee

doi:10.3390/app9132668

Image-to-Image Translation Using Identical-Pair Adversarial Networks

Applied Sciences ◽

10.3390/app9132668 ◽

2019 ◽

Vol 9 (13) ◽

pp. 2668 ◽

Cited By ~ 2

Author(s):

Thai Leang Sung ◽

Hyo Jong Lee

Keyword(s):

Loss Function ◽

Network Architectures ◽

Perceptual Similarity ◽

Image Transformation ◽

Identical Pair ◽

Similarity Network ◽

Adversarial Networks ◽

Image Translation ◽

Minimax Game

We propose Identical-pair Adversarial Networks (iPANs) to solve image-to-image translation problems, such as aerial-to-map, edge-to-photo, de-raining, and night-to-daytime. Our iPANs rely mainly on the effectiveness of adversarial loss function and its network architectures. Our iPANs consist of two main networks, an image transformation network T and a discriminative network D. We use U-NET for the transformation network T and a perceptual similarity network, which has two streams of VGG16 that share the same weights for network D. Our proposed adversarial losses play a minimax game against each other based on a real identical-pair and a fake identical-pair distinguished by the discriminative network D; e.g. a discriminative network D considers two inputs as a real pair only when they are identical, otherwise a fake pair. Meanwhile, the transformation network T tries to persuade the discriminator network D that the fake pair is a real pair. We experimented on several problems of image-to-image translation and achieved results that are comparable to those of some existing approaches, such as pix2pix, and PAN.

Download Full-text

Enforcing perceptual consistency on Generative Adversarial Networks by using the Normalised Laplacian Pyramid Distance

Proceedings of the Northern Lights Deep Learning Workshop ◽

10.7557/18.5124 ◽

2020 ◽

Vol 1 ◽

pp. 6

Author(s):

Alexander Hepburn ◽

Valero Laparra ◽

Ryan McConville ◽

Raul Santos-Rodriguez

Keyword(s):

Visual Inspection ◽

Human Perception ◽

Generative Adversarial Networks ◽

Perceptual Similarity ◽

Image Generation ◽

Laplacian Pyramid ◽

Training Process ◽

Adversarial Networks ◽

Image Translation ◽

Do So

In recent years there has been a growing interest in image generation through deep learning. While an important part of the evaluation of the generated images usually involves visual inspection, the inclusion of human perception as a factor in the training process is often overlooked. In this paper we propose an alternative perceptual regulariser for image-to-image translation using conditional generative adversarial networks (cGANs). To do so automatically (avoiding visual inspection), we use the Normalised Laplacian Pyramid Distance (NLPD) to measure the perceptual similarity between the generated image and the original image. The NLPD is based on the principle of normalising the value of coefficients with respect to a local estimate of mean energy at different scales and has already been successfully tested in different experiments involving human perception. We compare this regulariser with the originally proposed L1 distance and note that when using NLPD the generated images contain more realistic values for both local and global contrast.

Download Full-text

Co-DGAN: cooperating discriminator generative adversarial networks for unpaired image-to-image translation

Soft Computing ◽

10.1007/s00500-020-05475-6 ◽

2021 ◽

Author(s):

Huajun Liu ◽

Ziyan Wang ◽

Haigang Sui ◽

Qing Zhu ◽

Shubo Liu ◽

...

Keyword(s):

Generative Adversarial Networks ◽

Adversarial Networks ◽

Image Translation

Download Full-text

Generative Adversarial Networks (GANs)

ACM Computing Surveys ◽

10.1145/3446374 ◽

2021 ◽

Vol 54 (3) ◽

pp. 1-42

Author(s):

Divya Saxena ◽

Jiannong Cao

Keyword(s):

Optimization Technique ◽

Generative Models ◽

Generative Adversarial Networks ◽

Network Architectures ◽

Research Directions ◽

Research Issues ◽

Design And Optimization ◽

Adversarial Networks ◽

Comprehensive Survey ◽

Selection Of

Generative Adversarial Networks (GANs) is a novel class of deep generative models that has recently gained significant attention. GANs learn complex and high-dimensional distributions implicitly over images, audio, and data. However, there exist major challenges in training of GANs, i.e., mode collapse, non-convergence, and instability, due to inappropriate design of network architectre, use of objective function, and selection of optimization algorithm. Recently, to address these challenges, several solutions for better design and optimization of GANs have been investigated based on techniques of re-engineered network architectures, new objective functions, and alternative optimization algorithms. To the best of our knowledge, there is no existing survey that has particularly focused on the broad and systematic developments of these solutions. In this study, we perform a comprehensive survey of the advancements in GANs design and optimization solutions proposed to handle GANs challenges. We first identify key research issues within each design and optimization technique and then propose a new taxonomy to structure solutions by key research issues. In accordance with the taxonomy, we provide a detailed discussion on different GANs variants proposed within each solution and their relationships. Finally, based on the insights gained, we present promising research directions in this rapidly growing field.

Download Full-text

GAN meets chemometrics: segmenting spectral images with pixel2pixel image translation with conditional generative adversarial networks

Chemometrics and Intelligent Laboratory Systems ◽

10.1016/j.chemolab.2021.104362 ◽

2021 ◽

pp. 104362

Author(s):

Puneet Mishra ◽

Ittai Herrmann

Keyword(s):

Generative Adversarial Networks ◽

Adversarial Networks ◽

Image Translation

Download Full-text

Frequency-domain loss function for deep exposure correction of dark images

Signal Image and Video Processing ◽

10.1007/s11760-021-01915-4 ◽

2021 ◽

Author(s):

Ojasvi Yadav ◽

Koustav Ghosal ◽

Sebastian Lutz ◽

Aljosa Smolic

Keyword(s):

Loss Function ◽

Correct Choice ◽

Frequency Space ◽

Multi Scale ◽

Quantitative Metrics ◽

Image Translation ◽

Domain Loss ◽

In The Wild ◽

End To End ◽

Classical Image

AbstractWe address the problem of exposure correction of dark, blurry and noisy images captured in low-light conditions in the wild. Classical image-denoising filters work well in the frequency space but are constrained by several factors such as the correct choice of thresholds and frequency estimates. On the other hand, traditional deep networks are trained end to end in the RGB space by formulating this task as an image translation problem. However, that is done without any explicit constraints on the inherent noise of the dark images and thus produces noisy and blurry outputs. To this end, we propose a DCT/FFT-based multi-scale loss function, which when combined with traditional losses, trains a network to translate the important features for visually pleasing output. Our loss function is end to end differentiable, scale-agnostic and generic; i.e., it can be applied to both RAW and JPEG images in most existing frameworks without additional overhead. Using this loss function, we report significant improvements over the state of the art using quantitative metrics and subjective tests.

Download Full-text

Aircraft Type Recognition in Remote Sensing Images Based on Feature Learning with Conditional Generative Adversarial Networks

Remote Sensing ◽

10.3390/rs10071123 ◽

2018 ◽

Vol 10 (7) ◽

pp. 1123 ◽

Cited By ~ 8

Author(s):

Yuhang Zhang ◽

Hao Sun ◽

Jiawei Zuo ◽

Hongqi Wang ◽

Guangluan Xu ◽

...

Keyword(s):

Remote Sensing ◽

Feature Extraction ◽

Loss Function ◽

Extraction Method ◽

Generative Adversarial Networks ◽

Support Vector ◽

Svm Classifier ◽

Feature Extraction Method ◽

Adversarial Networks ◽

Aircraft Type

Aircraft type recognition plays an important role in remote sensing image interpretation. Traditional methods suffer from bad generalization performance, while deep learning methods require large amounts of data with type labels, which are quite expensive and time-consuming to obtain. To overcome the aforementioned problems, in this paper, we propose an aircraft type recognition framework based on conditional generative adversarial networks (GANs). First, we design a new method to precisely detect aircrafts’ keypoints, which are used to generate aircraft masks and locate the positions of the aircrafts. Second, a conditional GAN with a region of interest (ROI)-weighted loss function is trained on unlabeled aircraft images and their corresponding masks. Third, an ROI feature extraction method is carefully designed to extract multi-scale features from the GAN in the regions of aircrafts. After that, a linear support vector machine (SVM) classifier is adopted to classify each sample using their features. Benefiting from the GAN, we can learn features which are strong enough to represent aircrafts based on a large unlabeled dataset. Additionally, the ROI-weighted loss function and the ROI feature extraction method make the features more related to the aircrafts rather than the background, which improves the quality of features and increases the recognition accuracy significantly. Thorough experiments were conducted on a challenging dataset, and the results prove the effectiveness of the proposed aircraft type recognition framework.

Download Full-text

Semi-paired Image-to-Image Translation using Neighbor-based Generative Adversarial Networks

10.1109/ijcnn52387.2021.9534353 ◽

2021 ◽

Author(s):

Le Xu ◽

Weiling Cai ◽

Honghan Zhou

Keyword(s):

Generative Adversarial Networks ◽

Adversarial Networks ◽

Image Translation

Download Full-text

Crossing-Domain Generative Adversarial Networks for Unsupervised Multi-Domain Image-to-Image Translation

2018 ACM Multimedia Conference on Multimedia Conference - MM '18 ◽

10.1145/3240508.3240716 ◽

2018 ◽

Cited By ~ 7

Author(s):

Xuewen Yang ◽

Dongliang Xie ◽

Xin Wang

Keyword(s):

Generative Adversarial Networks ◽

Adversarial Networks ◽

Image Translation

Download Full-text

Best Selection of Generative Adversarial Networks Hyper-Parameters using Genetic Algorithm

10.21203/rs.3.rs-95571/v1 ◽

2020 ◽

Author(s):

Fajr Alarsan ◽

Mamoon Younes

Keyword(s):

Genetic Algorithm ◽

Programming Language ◽

Loss Function ◽

Generative Adversarial Networks ◽

Accuracy Function ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Adversarial Networks ◽

Parameters Selection ◽

Selection Of

Abstract Generative Adversarial Networks (GANs) are most popular generative frameworks that have achieved compelling performance. They follow an adversarial approach where two deep models generator and discriminator compete with each other In this paper, we propose a Generative Adversarial Network with best hyper-parameters selection to generate fake images for digits number 1 to 9 with generator and train discriminator to decide whereas the generated images are fake or true. Using Genetic Algorithm technique to adapt GAN hyper-parameters, the final method is named GANGA:Generative Adversarial Network with Genetic Algorithm. Anaconda environment with tensorflow library facilitates was used, python as programming language also used with needed libraries. The implementation was done using MNIST dataset to validate our work. The proposed method is to let Genetic algorithm to choose best values of hyper-parameters depending on minimizing a cost function such as a loss function or maximizing accuracy function. GA was used to select values of Learning rate, Batch normalization, Number of neurons and a parameter of Dropout layer.

Download Full-text

Perceptual Adversarial Networks With a Feature Pyramid for Image Translation

IEEE Computer Graphics and Applications ◽

10.1109/mcg.2019.2914426 ◽

2019 ◽

Vol 39 (4) ◽

pp. 68-77

Author(s):

Zhuorong Li ◽

Minghui Wu ◽

Jianwei Zheng ◽

Hongchuan Yu

Keyword(s):

Adversarial Networks ◽

Image Translation ◽

Feature Pyramid

Download Full-text