Zernike Coefficient Prediction Technique for Interference Based on Generation Adversarial Network

Allen Jong-Woei Whang; Yi-Yung Chen; Tsai-Hsien Yang; Cheng-Tse Lin; Zhi-Jia Jian; Chun-Han Chou

doi:10.3390/app11156933

Zernike Coefficient Prediction Technique for Interference Based on Generation Adversarial Network

Applied Sciences ◽

10.3390/app11156933 ◽

2021 ◽

Vol 11 (15) ◽

pp. 6933

Author(s):

Allen Jong-Woei Whang ◽

Yi-Yung Chen ◽

Tsai-Hsien Yang ◽

Cheng-Tse Lin ◽

Zhi-Jia Jian ◽

...

Keyword(s):

Ground Truth ◽

The Novel ◽

Simulated Image ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Mosaic Image ◽

Image Translation ◽

Prediction Technique ◽

Simulated Images ◽

The Ideal

In the paper, we propose a novel prediction technique to predict Zernike coefficients from interference fringes based on Generative Adversarial Network (GAN). In general, the task of GAN is image-to-image translation, but we design GAN for image-to-number translation. In the GAN model, the Generator’s input is the interference fringe image, and its output is a mosaic image. Moreover, each piece of the mosaic image links to the number of Zernike coefficients. Root Mean Square Error (RMSE) is our criterion for quantifying the ground truth and prediction coefficients. After training the GAN model, we use two different methods: the formula (ideal images) and optics simulation (simulated images) to estimate the GAN model. As a result, the RMSE is about 0.0182 ± 0.0035λ with the ideal image case and the RMSE is about 0.101 ± 0.0263λ with the simulated image case. Since the outcome in the simulated image case is poor, we use the transfer learning method to improve the RMSE to about 0.0586 ± 0.0035λ. The prediction technique applies not only to the ideal case but also to the actual interferometer. In addition, the novel prediction technique makes predicting Zernike coefficients more accurate than our previous research.

Download Full-text

Seg2pix: Few Shot Training Line Art Colorization with Segmented Image Data

Applied Sciences ◽

10.3390/app11041464 ◽

2021 ◽

Vol 11 (4) ◽

pp. 1464

Author(s):

Chang Wook Seo ◽

Yongduek Seo

Keyword(s):

Image Data ◽

Ground Truth ◽

Semantic Segmentation ◽

Training Data ◽

High Quality ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Segmented Image ◽

Image Translation ◽

Segmentation Image

There are various challenging issues in automating line art colorization. In this paper, we propose a GAN approach incorporating semantic segmentation image data. Our GAN-based method, named Seg2pix, can automatically generate high quality colorized images, aiming at computerizing one of the most tedious and repetitive jobs performed by coloring workers in the webtoon industry. The network structure of Seg2pix is mostly a modification of the architecture of Pix2pix, which is a convolution-based generative adversarial network for image-to-image translation. Through this method, we can generate high quality colorized images of a particular character with only a few training data. Seg2pix is designed to reproduce a segmented image, which becomes the suggestion data for line art colorization. The segmented image is automatically generated through a generative network with a line art image and a segmentation ground truth. In the next step, this generative network creates a colorized image from the line art and segmented image, which is generated from the former step of the generative network. To summarize, only one line art image is required for testing the generative model, and an original colorized image and segmented image are additionally required as the ground truth for training the model. These generations of the segmented image and colorized image proceed by an end-to-end method sharing the same loss functions. By using this method, we produce better qualitative results for automatic colorization of a particular character’s line art. This improvement can also be measured by quantitative results with Learned Perceptual Image Patch Similarity (LPIPS) comparison. We believe this may help artists exercise their creative expertise mainly in the area where computerization is not yet capable.

Download Full-text

Equivariant Adversarial Network for Image-to-image Translation

ACM Transactions on Multimedia Computing Communications and Applications ◽

10.1145/3458280 ◽

2021 ◽

Vol 17 (2s) ◽

pp. 1-14

Author(s):

Masoumeh Zareapoor ◽

Jie Yang

Keyword(s):

State Of The Art ◽

Generative Models ◽

Generative Model ◽

Target Domain ◽

Adversarial Network ◽

Proposed Model ◽

Image Translation ◽

Great Performance ◽

Representative Model ◽

The Ideal

Image-to-Image translation aims to learn an image from a source domain to a target domain. However, there are three main challenges, such as lack of paired datasets, multimodality, and diversity, that are associated with these problems and need to be dealt with. Convolutional neural networks (CNNs), despite of having great performance in many computer vision tasks, they fail to detect the hierarchy of spatial relationships between different parts of an object and thus do not form the ideal representative model we look for. This article presents a new variation of generative models that aims to remedy this problem. We use a trainable transformer, which explicitly allows the spatial manipulation of data within training. This differentiable module can be augmented into the convolutional layers in the generative model, and it allows to freely alter the generated distributions for image-to-image translation. To reap the benefits of proposed module into generative model, our architecture incorporates a new loss function to facilitate an effective end-to-end generative learning for image-to-image translation. The proposed model is evaluated through comprehensive experiments on image synthesizing and image-to-image translation, along with comparisons with several state-of-the-art algorithms.

Download Full-text

LC-GAN: Image-to-image Translation Based on Generative Adversarial Network for Endoscopic Images

2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) ◽

10.1109/iros45743.2020.9341556 ◽

2020 ◽

Author(s):

Shan Lin ◽

Fangbo Qin ◽

Yangming Li ◽

Randall A. Bly ◽

Kris S. Moe ◽

...

Keyword(s):

Generative Adversarial Network ◽

Endoscopic Images ◽

Adversarial Network ◽

Image Translation

Download Full-text

SGAN4AbSum: A Semantic-Enhanced Generative Adversarial Network for Abstractive Text Summarization

10.21203/rs.3.rs-648146/v1 ◽

2021 ◽

Author(s):

Tham Vo

Keyword(s):

Ground Truth ◽

Text Summarization ◽

Generative Adversarial Network ◽

Convolutional Network ◽

Training Strategy ◽

Adversarial Network ◽

Deep Recurrent Neural Network ◽

Benchmark Datasets ◽

Latent Representations ◽

Abstractive Summarization

Abstract In abstractive summarization task, most of proposed models adopt the deep recurrent neural network (RNN)-based encoder-decoder architecture to learn and generate meaningful summary for a given input document. However, most of recent RNN-based models always suffer the challenges related to the involvement of much capturing high-frequency/reparative phrases in long documents during the training process which leads to the outcome of trivial and generic summaries are generated. Moreover, the lack of thorough analysis on the sequential and long-range dependency relationships between words within different contexts while learning the textual representation also make the generated summaries unnatural and incoherent. To deal with these challenges, in this paper we proposed a novel semantic-enhanced generative adversarial network (GAN)-based approach for abstractive text summarization task, called as: SGAN4AbSum. We use an adversarial training strategy for our text summarization model in which train the generator and discriminator to simultaneously handle the summary generation and distinguishing the generated summary with the ground-truth one. The input of generator is the jointed rich-semantic and global structural latent representations of training documents which are achieved by applying a combined BERT and graph convolutional network (GCN) textual embedding mechanism. Extensive experiments in benchmark datasets demonstrate the effectiveness of our proposed SGAN4AbSum which achieve the competitive ROUGE-based scores in comparing with state-of-the-art abstractive text summarization baselines.

Download Full-text

Synthesizing Depth Hand Images with GANs and Style Transfer for Hand Pose Estimation

Sensors ◽

10.3390/s19132919 ◽

2019 ◽

Vol 19 (13) ◽

pp. 2919 ◽

Cited By ~ 2

Author(s):

Wangyong He ◽

Zhongzhao Xie ◽

Yongbo Li ◽

Xinmei Wang ◽

Wendi Cai

Keyword(s):

Pose Estimation ◽

Ground Truth ◽

Training Image ◽

Training Data ◽

Generative Adversarial Network ◽

Style Transfer ◽

Visual Appearance ◽

Hand Pose Estimation ◽

Adversarial Network ◽

Hand Pose

Hand pose estimation is a critical technology of computer vision and human-computer interaction. Deep-learning methods require a considerable amount of tagged data. Accordingly, numerous labeled training data are required. This paper aims to generate depth hand images. Given a ground-truth 3D hand pose, the developed method can generate depth hand images. To be specific, a ground truth can be 3D hand poses with the hand structure contained, while the synthesized image has an identical size to that of the training image and a similar visual appearance to the training set. The developed method, inspired by the progress in the generative adversarial network (GAN) and image-style transfer, helps model the latent statistical relationship between the ground-truth hand pose and the corresponding depth hand image. The images synthesized using the developed method are demonstrated to be feasible for enhancing performance. On public hand pose datasets (NYU, MSRA, ICVL), comprehensive experiments prove that the developed method outperforms the existing works.

Download Full-text

Joint Entity and Event Extraction with Generative Adversarial Imitation Learning

Data Intelligence ◽

10.1162/dint_a_00014 ◽

2019 ◽

Vol 1 (2) ◽

pp. 99-120 ◽

Cited By ~ 6

Author(s):

Tongtao Zhang ◽

Heng Ji ◽

Avirup Sil

Keyword(s):

State Of The Art ◽

Ground Truth ◽

Event Extraction ◽

Imitation Learning ◽

Learning Method ◽

Inverse Reinforcement Learning ◽

Generative Adversarial Network ◽

Adversarial Network ◽

The Difference ◽

New Framework

We propose a new framework for entity and event extraction based on generative adversarial imitation learning—an inverse reinforcement learning method using a generative adversarial network (GAN). We assume that instances and labels yield to various extents of difficulty and the gains and penalties (rewards) are expected to be diverse. We utilize discriminators to estimate proper rewards according to the difference between the labels committed by the ground-truth (expert) and the extractor (agent). Our experiments demonstrate that the proposed framework outperforms state-of-the-art methods.

Download Full-text

A NEPHOGRAM PREDICTION METHOD BASED ON GENERATIVE ADVERSARIAL NETWORK

ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-archives-xlii-3-w10-925-2020 ◽

2020 ◽

Vol XLII-3/W10 ◽

pp. 925-930

Author(s):

Y. Xun ◽

W. Q. Yu

Keyword(s):

Loss Function ◽

Internal Representation ◽

Prediction Method ◽

Ground Truth ◽

Generative Adversarial Network ◽

Multi Scale ◽

Adversarial Network ◽

Basic Characteristics ◽

Cloud Evolution

Abstract. As one of the important sources of meteorological information, satellite nephogram is playing an increasingly important role in the detection and forecast of disastrous weather. The predictions about the movement and transformation of cloud with certain timeliness can enhance the practicability of satellite nephogram. Based on the generative adversarial network in unsupervised learning, we propose a prediction model of time series nephogram, which construct the internal representation of cloud evolution accurately and realize nephogram prediction for the next several hours. We improve the traditional generative adversarial network by constructing the generator and discriminator used the multi-scale convolution network. After the scale transform process, different scales operate convolutions in parallel and then merge the features. This structure can solve the problem of long-term dependence in the traditional network, and both global and detailed features are considered. Then according to the network structure and practical application, we define a new loss function combined with adversarial loss function to accelerate the convergence of model and sharpen predictions which keeps the effectivity of predictions further. Our method has no need to carry out the stack mathematics calculation and the manual operations, has greatly enhanced the feasibility and the efficiency. The results show that this model can reasonably describe the basic characteristics and evolution trend of cloud cluster, the prediction nephogram has very high similarity to the ground-truth nephogram.

Download Full-text

Fast and accurate learned multiresolution dynamical downscaling for precipitation

10.5194/gmd-2020-412 ◽

2021 ◽

Author(s):

Jiali Wang ◽

Zhengchun Liu ◽

Ian Foster ◽

Won Chang ◽

Rajkumar Kettimuthu ◽

...

Keyword(s):

Neural Network ◽

High Resolution ◽

Dynamical Downscaling ◽

Computational Cost ◽

Super Resolution ◽

Ground Truth ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Spatial And Temporal Distributions ◽

Data Variability

Abstract. This study develops a neural network-based approach for emulating high-resolution modeled precipitation data with comparable statistical properties but at greatly reduced computational cost. The key idea is to use combination of low- and high- resolution simulations to train a neural network to map from the former to the latter. Specifically, we define two types of CNNs, one that stacks variables directly and one that encodes each variable before stacking, and we train each CNN type both with a conventional loss function, such as mean square error (MSE), and with a conditional generative adversarial network (CGAN), for a total of four CNN variants.We compare the four new CNN-derived high-resolution precipitation results with precipitation generated from original high resolution simulations, a bilinear interpolater and the state-of-the-art CNN-based super-resolution (SR) technique. Results show that the SR technique produces results similar to those of the bilinear interpolator with smoother spatial and temporal distributions and smaller data variabilities and extremes than the high resolution simulations. While the new CNNs trained by MSE generate better results over some regions than the interpolator and SR technique do, their predictions are still not as close as ground truth. The CNNs trained by CGAN generate more realistic and physically reasonable results, better capturing not only data variability in time and space but also extremes such as intense and long-lasting storms. The new proposed CNN-based downscaling approach can downscale precipitation from 50 km to 12 km in 14 min for 30 years once the network is trained (training takes 4 hours using 1 GPU), while the conventional dynamical downscaling would take 1 months using 600 CPU cores to generate simulations at the resolution of 12 km over contiguous United States.

Download Full-text

An Adversarial and Densely Dilated Network for Connectomes Segmentation

Symmetry ◽

10.3390/sym10100467 ◽

2018 ◽

Vol 10 (10) ◽

pp. 467 ◽

Cited By ~ 2

Author(s):

Ke Chen ◽

Dandan Zhu ◽

Jianwei Lu ◽

Ye Luo

Keyword(s):

State Of The Art ◽

Piriform Cortex ◽

Receptive Fields ◽

Ground Truth ◽

Training Data ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Boundary Map ◽

Segmentation Task ◽

The Brain

Automatic reconstructing of neural circuits in the brain is one of the most crucial studies in neuroscience. Connectomes segmentation plays an important role in reconstruction from electron microscopy (EM) images; however, it is rather challenging due to highly anisotropic shapes with inferior quality and various thickness. In our paper, we propose a novel connectomes segmentation framework called adversarial and densely dilated network (ADDN) to address these issues. ADDN is based on the conditional Generative Adversarial Network (cGAN) structure which is the latest advance in machine learning with power to generate images similar to the ground truth especially when the training data is limited. Specifically, we design densely dilated network (DDN) as the segmentor to allow a deeper architecture and larger receptive fields for more accurate segmentation. Discriminator is trained to distinguish generated segmentation from manual segmentation. During training, such adversarial loss function is optimized together with dice loss. Extensive experimental results demonstrate that our ADDN is effective for such connectomes segmentation task, helping to retrieve more accurate segmentation and attenuate the blurry effects of generated boundary map. Our method obtains state-of-the-art performance while requiring less computation on ISBI 2012 EM dataset and mouse piriform cortex dataset.

Download Full-text

Asymmetric Encoder-Decoder Structured FCN Based LiDAR to Color Image Generation

Sensors ◽

10.3390/s19214818 ◽

2019 ◽

Vol 19 (21) ◽

pp. 4818 ◽

Cited By ~ 2

Author(s):

Hyun-Koo Kim ◽

Kook-Yeol Yoo ◽

Ju H. Park ◽

Ho-Youl Jung

Keyword(s):

Color Image ◽

Ground Truth ◽

Sensor Data ◽

Image Generation ◽

Generative Adversarial Network ◽

Convolutional Network ◽

Adversarial Network ◽

Ground Truth Image ◽

And Performance ◽

Reflection Intensity

In this paper, we propose a method of generating a color image from light detection and ranging (LiDAR) 3D reflection intensity. The proposed method is composed of two steps: projection of LiDAR 3D reflection intensity into 2D intensity, and color image generation from the projected intensity by using a fully convolutional network (FCN). The color image should be generated from a very sparse projected intensity image. For this reason, the FCN is designed to have an asymmetric network structure, i.e., the layer depth of the decoder in the FCN is deeper than that of the encoder. The well-known KITTI dataset for various scenarios is used for the proposed FCN training and performance evaluation. Performance of the asymmetric network structures are empirically analyzed for various depth combinations for the encoder and decoder. Through simulations, it is shown that the proposed method generates fairly good visual quality of images while maintaining almost the same color as the ground truth image. Moreover, the proposed FCN has much higher performance than conventional interpolation methods and generative adversarial network based Pix2Pix. One interesting result is that the proposed FCN produces shadow-free and daylight color images. This result is caused by the fact that the LiDAR sensor data is produced by the light reflection and is, therefore, not affected by sunlight and shadow.

Download Full-text