Dialectical GAN for SAR Image Translation: From Sentinel-1 to TerraSAR-X

Dongyang Ao; Corneliu Octavian Dumitru; Gottfried Schwarz; Mihai Datcu

doi:10.3390/rs10101597

Dialectical GAN for SAR Image Translation: From Sentinel-1 to TerraSAR-X

Remote Sensing ◽

10.3390/rs10101597 ◽

2018 ◽

Vol 10 (10) ◽

pp. 1597 ◽

Cited By ~ 5

Author(s):

Dongyang Ao ◽

Corneliu Octavian Dumitru ◽

Gottfried Schwarz ◽

Mihai Datcu

Keyword(s):

Sar Image ◽

Sar Images ◽

High Quality ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Image Translation ◽

Gram Matrices ◽

Processing Resources ◽

Ground Coverage

With more and more SAR applications, the demand for enhanced high-quality SAR images has increased considerably. However, high-quality SAR images entail high costs, due to the limitations of current SAR devices and their image processing resources. To improve the quality of SAR images and to reduce the costs of their generation, we propose a Dialectical Generative Adversarial Network (Dialectical GAN) to generate high-quality SAR images. This method is based on the analysis of hierarchical SAR information and the “dialectical” structure of GAN frameworks. As a demonstration, a typical example will be shown, where a low-resolution SAR image (e.g., a Sentinel-1 image) with large ground coverage is translated into a high-resolution SAR image (e.g., a TerraSAR-X image). A new algorithm is proposed based on a network framework by combining conditional WGAN-GP (Wasserstein Generative Adversarial Network—Gradient Penalty) loss functions and Spatial Gram matrices under the rule of dialectics. Experimental results show that the SAR image translation works very well when we compare the results of our proposed method with the selected traditional methods.

Download Full-text

Dialectical GAN for SAR Image Translation: From Sentinel-1 to TerraSAR-X

10.20944/preprints201807.0340.v1 ◽

2018 ◽

Author(s):

Dongyang Ao ◽

Corneliu Octavian Dumitru ◽

Gottfried Schwarz ◽

Mihai Datcu

Keyword(s):

Electromagnetic Spectrum ◽

Sar Image ◽

Sar Images ◽

High Quality ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Optical Images ◽

Image Translation ◽

Gram Matrices ◽

Processing Resources

Contrary to optical images, Synthetic Aperture Radar (SAR) images are in different electromagnetic spectrum where the human visual system is not accustomed to. Thus, with more and more SAR applications, the demand for enhanced high-quality SAR images has increased considerably. However, high-quality SAR images entail high costs due to the limitations of current SAR devices and their image processing resources. To improve the quality of SAR images and to reduce the costs of their generation, we propose a Dialectical Generative Adversarial Network (Dialectical GAN) to generate high-quality SAR images. This method is based on the analysis of hierarchical SAR information and the “dialectical” structure of GAN frameworks.  As a demonstration, a typical example will be shown where a low-resolution SAR image (e.g., a Sentinel-1 image) with large ground coverage is translated into a high-resolution SAR image (e.g., a TerraSAR-X image). Three traditional algorithms are compared, and a new algorithm is proposed based on a network framework by combining conditional WGAN-GP (Wasserstein Generative Adversarial Network - Gradient Penalty) loss functions and Spatial Gram matrices under the rule of dialectics. Experimental results show that the SAR image translation works very well when we compare the results of our proposed method with the selected traditional methods.

Download Full-text

Seg2pix: Few Shot Training Line Art Colorization with Segmented Image Data

Applied Sciences ◽

10.3390/app11041464 ◽

2021 ◽

Vol 11 (4) ◽

pp. 1464

Author(s):

Chang Wook Seo ◽

Yongduek Seo

Keyword(s):

Image Data ◽

Ground Truth ◽

Semantic Segmentation ◽

Training Data ◽

High Quality ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Segmented Image ◽

Image Translation ◽

Segmentation Image

There are various challenging issues in automating line art colorization. In this paper, we propose a GAN approach incorporating semantic segmentation image data. Our GAN-based method, named Seg2pix, can automatically generate high quality colorized images, aiming at computerizing one of the most tedious and repetitive jobs performed by coloring workers in the webtoon industry. The network structure of Seg2pix is mostly a modification of the architecture of Pix2pix, which is a convolution-based generative adversarial network for image-to-image translation. Through this method, we can generate high quality colorized images of a particular character with only a few training data. Seg2pix is designed to reproduce a segmented image, which becomes the suggestion data for line art colorization. The segmented image is automatically generated through a generative network with a line art image and a segmentation ground truth. In the next step, this generative network creates a colorized image from the line art and segmented image, which is generated from the former step of the generative network. To summarize, only one line art image is required for testing the generative model, and an original colorized image and segmented image are additionally required as the ground truth for training the model. These generations of the segmented image and colorized image proceed by an end-to-end method sharing the same loss functions. By using this method, we produce better qualitative results for automatic colorization of a particular character’s line art. This improvement can also be measured by quantitative results with Learned Perceptual Image Patch Similarity (LPIPS) comparison. We believe this may help artists exercise their creative expertise mainly in the area where computerization is not yet capable.

Download Full-text

Improving the Quality of Synthetic FLAIR Images with Deep Learning Using a Conditional Generative Adversarial Network for Pixel-by-Pixel Image Translation

American Journal of Neuroradiology ◽

10.3174/ajnr.a5927 ◽

2019 ◽

Vol 40 (2) ◽

pp. 224-230 ◽

Cited By ~ 16

Author(s):

A. Hagiwara ◽

Y. Otsuka ◽

M. Hori ◽

Y. Tachibana ◽

K. Yokoyama ◽

...

Keyword(s):

Deep Learning ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Image Translation ◽

Pixel Image

Download Full-text

TomoSAR 3D Reconstruction for Buildings Using Very Few Tracks of Observation: A Conditional Generative Adversarial Network Approach

Remote Sensing ◽

10.3390/rs13245055 ◽

2021 ◽

Vol 13 (24) ◽

pp. 5055

Author(s):

Shihong Wang ◽

Jiayi Guo ◽

Yueting Zhang ◽

Yuxin Hu ◽

Chibiao Ding ◽

...

Keyword(s):

3D Reconstruction ◽

Signal To Noise Ratio ◽

Three Dimensional ◽

Super Resolution ◽

Quality Data ◽

Sar Images ◽

High Quality ◽

Generative Adversarial Network ◽

Adversarial Network ◽

High Quality Result

SAR tomography (TomoSAR) is an important technology for three-dimensional (3D) reconstruction of buildings through multiple coherent SAR images. In order to obtain sufficient signal-to-noise ratio (SNR), typical TomoSAR applications often require dozens of scenes of SAR images. However, limited by time and cost, the available SAR images are often only 3–5 scenes in practice, which makes the traditional TomoSAR technique unable to produce satisfactory SNR and elevation resolution. To tackle this problem, the conditional generative adversarial network (CGAN) is proposed to improve the TomoSAR 3D reconstruction by learning the prior information of building. Moreover, the number of tracks required can be reduced to three. Firstly, a TomoSAR 3D super-resolution dataset is constructed using high-quality data from the airborne array and low-quality data obtained from a small amount of tracks sampled from all observations. Then, the CGAN model is trained to estimate the corresponding high-quality result from the low-quality input. Airborne data experiments prove that the reconstruction results are improved in areas with and without overlap, both qualitatively and quantitatively. Furthermore, the network pretrained on the airborne dataset is directly used to process the spaceborne dataset without any tuning, and generates satisfactory results, proving the effectiveness and robustness of our method. The comparative experiment with nonlocal algorithm also shows that the proposed method has better height estimation and higher time efficiency.

Download Full-text

Semantic Image to Image Translation using Machine Learning Algorithms

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.f4712.049620 ◽

2020 ◽

Vol 9 (4) ◽

pp. 1973-1976

Keyword(s):

Input Image ◽

Machine Learning Algorithms ◽

Real Image ◽

Generative Adversarial Network ◽

Training Time ◽

Photographic Images ◽

Output Image ◽

Adversarial Network ◽

Image Translation

In semantic image-to-image translation, the goal will be to learn mapping between an input image and the output image. A model of semantic image to image translation problem using Cycle GAN algorithm is proposed. Given a set of paired or unpaired images a transformation is learned to translate the input image into the specified domain. The dataset considered is cityscape dataset. In the cityscape dataset, the semantic images are converted into photographic images. Here a Generative Adversarial Network algorithm called Cycle GAN algorithm with cycle consistency loss is used. The cycle GAN algorithm can be used to transform the semantic image into a photographic or real image. The cycle consistency loss compares the real image and the output image of the second generator and gives the loss functions. In this paper, the model shows that by considering more training time we get the accurate results and the image quality will be improved. The model can be used when images from one domain needs to be converted into another domain inorder to obtain high quality of images.

Download Full-text

Parallel Connected Generative Adversarial Network with Quadratic Operation for SAR Image Generation and Application for Classification

Sensors ◽

10.3390/s19040871 ◽

2019 ◽

Vol 19 (4) ◽

pp. 871 ◽

Cited By ~ 2

Author(s):

Chu He ◽

Dehui Xiong ◽

Qingyi Zhang ◽

Mingsheng Liao

Keyword(s):

Image Classification ◽

Large Scale ◽

Statistical Characteristics ◽

Sar Image ◽

Deep Convolutional Neural Networks ◽

Sar Images ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Optical Images ◽

Sar Data

Thanks to the availability of large-scale data, deep Convolutional Neural Networks (CNNs) have witnessed success in various applications of computer vision. However, the performance of CNNs on Synthetic Aperture Radar (SAR) image classification is unsatisfactory due to the lack of well-labeled SAR data, as well as the differences in imaging mechanisms between SAR images and optical images. Therefore, this paper addresses the problem of SAR image classification by employing the Generative Adversarial Network (GAN) to produce more labeled SAR data. We propose special GANs for generating SAR images to be used in the training process. First, we incorporate the quadratic operation into the GAN, extending the convolution to make the discriminator better represent the SAR data; second, the statistical characteristics of SAR images are integrated into the GAN to make its value function more reasonable; finally, two types of parallel connected GANs are designed, one of which we call PWGAN, combining the Deep Convolutional GAN (DCGAN) and Wasserstein GAN with Gradient Penalty (WGAN-GP) together in the structure, and the other, which we call CNN-PGAN, applying a pre-trained CNN as a discriminator to the parallel GAN. Both PWGAN and CNN-PGAN consist of a number of discriminators and generators according to the number of target categories. Experimental results on the TerraSAR-X single polarization dataset demonstrate the effectiveness of the proposed method.

Download Full-text

An Underwater Image Enhancement Algorithm Based on Generative Adversarial Network and Natural Image Quality Evaluation Index

Journal of Marine Science and Engineering ◽

10.3390/jmse9070691 ◽

2021 ◽

Vol 9 (7) ◽

pp. 691

Author(s):

Kai Hu ◽

Yanwen Zhang ◽

Chenghang Weng ◽

Pengsheng Wang ◽

Zhiliang Deng ◽

...

Keyword(s):

Image Quality ◽

Image Enhancement ◽

Quality Evaluation ◽

High Efficiency ◽

Natural Image ◽

High Quality ◽

Generative Adversarial Network ◽

Image Quality Evaluation ◽

Adversarial Network ◽

Underwater Image

When underwater vehicles work, underwater images are often absorbed by light and scattered and diffused by floating objects, which leads to the degradation of underwater images. The generative adversarial network (GAN) is widely used in underwater image enhancement tasks because it can complete image-style conversions with high efficiency and high quality. Although the GAN converts low-quality underwater images into high-quality underwater images (truth images), the dataset of truth images also affects high-quality underwater images. However, an underwater truth image lacks underwater image enhancement, which leads to a poor effect of the generated image. Thus, this paper proposes to add the natural image quality evaluation (NIQE) index to the GAN to provide generated images with higher contrast and make them more in line with the perception of the human eye, and at the same time, grant generated images a better effect than the truth images set by the existing dataset. In this paper, several groups of experiments are compared, and through the subjective evaluation and objective evaluation indicators, it is verified that the enhanced image of this algorithm is better than the truth image set by the existing dataset.

Download Full-text

Optimizing the quality of Fourier single-pixel imaging via generative adversarial network

Optik ◽

10.1016/j.ijleo.2020.166060 ◽

2021 ◽

Vol 227 ◽

pp. 166060

Author(s):

Yangdi Hu ◽

Zhengdong Cheng ◽

Xiaochun Fan ◽

Zhenyu Liang ◽

Xiang Zhai

Keyword(s):

Generative Adversarial Network ◽

Adversarial Network ◽

Single Pixel

Download Full-text

LC-GAN: Image-to-image Translation Based on Generative Adversarial Network for Endoscopic Images

2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) ◽

10.1109/iros45743.2020.9341556 ◽

2020 ◽

Author(s):

Shan Lin ◽

Fangbo Qin ◽

Yangming Li ◽

Randall A. Bly ◽

Kris S. Moe ◽

...

Keyword(s):

Generative Adversarial Network ◽

Endoscopic Images ◽

Adversarial Network ◽

Image Translation

Download Full-text

Machine Learning for Dissimulating Reality

Proceedings ◽

10.3390/proceedings2021077017 ◽

2021 ◽

Vol 77 (1) ◽

pp. 17

Author(s):

Andrea Giussani

Keyword(s):

Machine Learning ◽

Language Processing ◽

Huge Amount ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Technological Advances ◽

Textual Data ◽

Musical Scores ◽

Mathematical Formulas

In the last decade, advances in statistical modeling and computer science have boosted the production of machine-produced contents in different fields: from language to image generation, the quality of the generated outputs is remarkably high, sometimes better than those produced by a human being. Modern technological advances such as OpenAI’s GPT-2 (and recently GPT-3) permit automated systems to dramatically alter reality with synthetic outputs so that humans are not able to distinguish the real copy from its counteracts. An example is given by an article entirely written by GPT-2, but many other examples exist. In the field of computer vision, Nvidia’s Generative Adversarial Network, commonly known as StyleGAN (Karras et al. 2018), has become the de facto reference point for the production of a huge amount of fake human face portraits; additionally, recent algorithms were developed to create both musical scores and mathematical formulas. This presentation aims to stimulate participants on the state-of-the-art results in this field: we will cover both GANs and language modeling with recent applications. The novelty here is that we apply a transformer-based machine learning technique, namely RoBerta (Liu et al. 2019), to the detection of human-produced versus machine-produced text concerning fake news detection. RoBerta is a recent algorithm that is based on the well-known Bidirectional Encoder Representations from Transformers algorithm, known as BERT (Devlin et al. 2018); this is a bi-directional transformer used for natural language processing developed by Google and pre-trained over a huge amount of unlabeled textual data to learn embeddings. We will then use these representations as an input of our classifier to detect real vs. machine-produced text. The application is demonstrated in the presentation.

Download Full-text