Improving Multi-Agent Generative Adversarial Nets with Variational Latent Representation

Huan Zhao; Tingting Li; Yufeng Xiao; Yu Wang

doi:10.3390/e22091055

Improving Multi-Agent Generative Adversarial Nets with Variational Latent Representation

Entropy ◽

10.3390/e22091055 ◽

2020 ◽

Vol 22 (9) ◽

pp. 1055

Author(s):

Huan Zhao ◽

Tingting Li ◽

Yufeng Xiao ◽

Yu Wang

Keyword(s):

Large Scale ◽

Random Noise ◽

Qualitative Evaluation ◽

Training Data ◽

Generative Adversarial Networks ◽

Generative Adversarial Network ◽

Proposed Model ◽

Multi Agent ◽

Latent Representations ◽

Real World Datasets

Generative adversarial networks (GANs), which are a promising type of deep generative network, have recently drawn considerable attention and made impressive progress. However, GAN models suffer from the well-known problem of mode collapse. This study focuses on this challenge and introduces a new model design, called the encoded multi-agent generative adversarial network (E-MGAN), which tackles the mode collapse problem by introducing the variational latent representations learned from a variable auto-encoder (VAE) to a multi-agent GAN. The variational latent representations are extracted from training data to replace the random noise input of the general multi-agent GANs. The generator in E-MGAN employs multiple generators and is penalized by a classifier. This integration guarantees that the proposed model not only enhances the quality of generated samples but also improves the diversity of generated samples to avoid the mode collapse problem. Moreover, extensive experiments are conducted on both a synthetic dataset and two large-scale real-world datasets. The generated samples are visualized for qualitative evaluation. The inception score (IS) and Fréchet inception distance (FID) are adopted to measure the performance of the model for quantitative assessment. The results confirmed that the proposed model achieves outstanding performances compared to other state-of-the-art GAN variants.

Download Full-text

COVID-GAN+: Estimating Human Mobility Responses to COVID-19 through Spatio-temporal Generative Adversarial Networks with Enhanced Features

ACM Transactions on Intelligent Systems and Technology ◽

10.1145/3481617 ◽

2022 ◽

Vol 13 (2) ◽

pp. 1-23

Author(s):

Han Bao ◽

Xun Zhou ◽

Yiqun Xie ◽

Yingxue Zhang ◽

Yanhua Li

Keyword(s):

Spatial Heterogeneity ◽

Real World ◽

Large Scale ◽

Census Data ◽

Human Mobility ◽

Training Data ◽

Estimation Accuracy ◽

Generative Adversarial Networks ◽

Generative Adversarial Network ◽

Spatio Temporal

Estimating human mobility responses to the large-scale spreading of the COVID-19 pandemic is crucial, since its significance guides policymakers to give Non-pharmaceutical Interventions, such as closure or reopening of businesses. It is challenging to model due to complex social contexts and limited training data. Recently, we proposed a conditional generative adversarial network (COVID-GAN) to estimate human mobility response under a set of social and policy conditions integrated from multiple data sources. Although COVID-GAN achieves a good average estimation accuracy under real-world conditions, it produces higher errors in certain regions due to the presence of spatial heterogeneity and outliers. To address these issues, in this article, we extend our prior work by introducing a new spatio-temporal deep generative model, namely, COVID-GAN+. COVID-GAN+ deals with the spatial heterogeneity issue by introducing a new spatial feature layer that utilizes the local Moran statistic to model the spatial heterogeneity strength in the data. In addition, we redesign the training objective to learn the estimated mobility changes from historical average levels to mitigate the effects of spatial outliers. We perform comprehensive evaluations using urban mobility data derived from cell phone records and census data. Results show that COVID-GAN+ can better approximate real-world human mobility responses than prior methods, including COVID-GAN.

Download Full-text

Combining Variational Autoencoders & Generative Adversarial Networks to Improve Image Quality

10.31219/osf.io/8bmdu ◽

2019 ◽

Author(s):

Atin Sakkeer Hussain

Keyword(s):

Image Quality ◽

Random Noise ◽

Training Data ◽

Generative Adversarial Networks ◽

Improve Image Quality ◽

Adversarial Networks ◽

Proposed Model ◽

Variational Autoencoder ◽

Proper Training ◽

Better Than

Generative Adversarial Networks(GAN) are trained to generate images from random noise vectors, but often these images turn out poorly due to any of several reasons such as model collapse, lack of proper training data, lack of training, etc. To combat this issue this paper, makes use of a Variational Autoencoder(VAE). The VAE is trained on a combination of the training & generated data, after this the VAE can be used to map images generated by the GAN to better versions of it. (This is similar to Denoising, but with few variations in the image). In addition to improving quality the proposed model is shown to work better than normal WGAN’s on sparse datasets with higher variety, in equal number of training epochs.

Download Full-text

Linear electromagnetic inverse scattering via generative adversarial networks

International Journal of Microwave and Wireless Technologies ◽

10.1017/s1759078721001331 ◽

2021 ◽

pp. 1-9

Author(s):

Huilin Zhou ◽

Huimin Zheng ◽

Qiegen Liu ◽

Jian Liu ◽

Yuhao Wang

Keyword(s):

Inverse Scattering ◽

Optimization Methods ◽

Training Data ◽

Generative Adversarial Networks ◽

Scattering Problems ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Adversarial Networks ◽

Highly Nonlinear ◽

Electromagnetic Inverse Scattering

Abstract Electromagnetic inverse-scattering problems (ISPs) are concerned with determining the properties of an unknown object using measured scattered fields. ISPs are often highly nonlinear, causing the problem to be very difficult to address. In addition, the reconstruction images of different optimization methods are distorted which leads to inaccurate reconstruction results. To alleviate these issues, we propose a new linear model solution of generative adversarial network-based (LM-GAN) inspired by generative adversarial networks (GAN). Two sub-networks are trained alternately in the adversarial framework. A linear deep iterative network as a generative network captures the spatial distribution of the data, and a discriminative network estimates the probability of a sample from the training data. Numerical results validate that LM-GAN has admirable fidelity and accuracy when reconstructing complex scatterers.

Download Full-text

MetaTP

Proceedings of the ACM on Interactive Mobile Wearable and Ubiquitous Technologies ◽

10.1145/3478083 ◽

2021 ◽

Vol 5 (3) ◽

pp. 1-28

Author(s):

Weida Zhong ◽

Qiuling Suo ◽

Abhishek Gupta ◽

Xiaowei Jia ◽

Chunming Qiao ◽

...

Keyword(s):

Time Series ◽

Large Scale ◽

Multivariate Time Series ◽

Modern Society ◽

Training Data ◽

Traffic Prediction ◽

Temporal Prediction ◽

Reference Space ◽

Meta Learning ◽

Real World Datasets

With the popularity of smartphones, large-scale road sensing data is being collected to perform traffic prediction, which is an important task in modern society. Due to the nature of the roving sensors on smartphones, the collected traffic data which is in the form of multivariate time series, is often temporally sparse and unevenly distributed across regions. Moreover, different regions can have different traffic patterns, which makes it challenging to adapt models learned from regions with sufficient training data to target regions. Given that many regions may have very sparse data, it is also impossible to build individual models for each region separately. In this paper, we propose a meta-learning based framework named MetaTP to overcome these challenges. MetaTP has two key parts, i.e., basic traffic prediction network (base model) and meta-knowledge transfer. In base model, a two-layer interpolation network is employed to map original time series onto uniformly-spaced reference time points, so that temporal prediction can be effectively performed in the reference space. The meta-learning framework is employed to transfer knowledge from source regions with a large amount of data to target regions with a few data examples via fast adaptation, in order to improve model generalizability on target regions. Moreover, we use two memory networks to capture the global patterns of spatial and temporal information across regions. We evaluate the proposed framework on two real-world datasets, and experimental results show the effectiveness of the proposed framework.

Download Full-text

An E2GPGP-GASA-Based Multi-Agent Job Shop Scheduling System

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.505.65 ◽

2012 ◽

Vol 505 ◽

pp. 65-74

Author(s):

Lin Lin Lu ◽

Xin Ma ◽

Ya Xuan Wang

Keyword(s):

Large Scale ◽

Job Shop ◽

Job Shop Scheduling ◽

Simulation Software ◽

Prototype System ◽

Shop Scheduling ◽

Model Combining ◽

Proposed Model ◽

Multi Agent ◽

Global Planning

In this paper, a job shop scheduling model combining MAS (Multi-Agent System) with GASA (Simulated Annealing-Genetic Algorithm) is presented. The proposed model is based on the E2GPGP (extended extended generalized partial global planning) mechanism and utilizes the advantages of static intelligence algorithms with dynamic MAS. A scheduling process from ‘initialized macro-scheduling’ to ‘repeated micro-scheduling’ is designed for large-scale complex problems to enable to implement an effective and widely applicable prototype system for the job shop scheduling problem (JSSP). Under a set of theoretic strategies in the GPGP which is summarized in detail, E2GPGP is also proposed further. The GPGP-cooperation-mechanism is simulated by using simulation software DECAF for the JSSP. The results show that the proposed model based on the E2GPGP-GASA not only improves the effectiveness, but also reduces the resource cost.

Download Full-text

Three-Dimensional Liver Image Segmentation Using Generative Adversarial Networks Based on Feature Restoration

Frontiers in Medicine ◽

10.3389/fmed.2021.794969 ◽

2022 ◽

Vol 8 ◽

Author(s):

Runnan He ◽

Shiqi Xu ◽

Yashu Liu ◽

Qince Li ◽

Yang Liu ◽

...

Keyword(s):

Medical Imaging ◽

Random Noise ◽

Three Dimensional ◽

Poor Quality ◽

Training Data ◽

Generative Adversarial Networks ◽

Liver Segmentation ◽

Deep Convolutional Neural Networks ◽

Adversarial Networks ◽

Liver Region

Medical imaging provides a powerful tool for medical diagnosis. In the process of computer-aided diagnosis and treatment of liver cancer based on medical imaging, accurate segmentation of liver region from abdominal CT images is an important step. However, due to defects of liver tissue and limitations of CT imaging procession, the gray level of liver region in CT image is heterogeneous, and the boundary between the liver and those of adjacent tissues and organs is blurred, which makes the liver segmentation an extremely difficult task. In this study, aiming at solving the problem of low segmentation accuracy of the original 3D U-Net network, an improved network based on the three-dimensional (3D) U-Net, is proposed. Moreover, in order to solve the problem of insufficient training data caused by the difficulty of acquiring labeled 3D data, an improved 3D U-Net network is embedded into the framework of generative adversarial networks (GAN), which establishes a semi-supervised 3D liver segmentation optimization algorithm. Finally, considering the problem of poor quality of 3D abdominal fake images generated by utilizing random noise as input, deep convolutional neural networks (DCNN) based on feature restoration method is designed to generate more realistic fake images. By testing the proposed algorithm on the LiTS-2017 and KiTS19 dataset, experimental results show that the proposed semi-supervised 3D liver segmentation method can greatly improve the segmentation performance of liver, with a Dice score of 0.9424 outperforming other methods.

Download Full-text

Multi-agent Attentional Activity Recognition

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2019/186 ◽

2019 ◽

Cited By ~ 3

Author(s):

Kaixuan Chen ◽

Lina Yao ◽

Dalin Zhang ◽

Bin Guo ◽

Zhiwen Yu

Keyword(s):

Activity Recognition ◽

State Of The Art ◽

Body Part ◽

Body Parts ◽

Temporal Attention ◽

Attention Model ◽

Proposed Model ◽

Collective Motions ◽

Multi Agent ◽

Real World Datasets

Multi-modality is an important feature of sensor based activity recognition. In this work, we consider two inherent characteristics of human activities, the spatially-temporally varying salience of features and the relations between activities and corresponding body part motions. Based on these, we propose a multi-agent spatial-temporal attention model. The spatial-temporal attention mechanism helps intelligently select informative modalities and their active periods. And the multiple agents in the proposed model represent activities with collective motions across body parts by independently selecting modalities associated with single motions. With a joint recognition goal, the agents share gained information and coordinate their selection policies to learn the optimal recognition model. The experimental results on four real-world datasets demonstrate that the proposed model outperforms the state-of-the-art methods.

Download Full-text

INFRASTRUCTURE DEGRADATION AND POST-DISASTER DAMAGE DETECTION USING ANOMALY DETECTING GENERATIVE ADVERSARIAL NETWORKS

ISPRS Annals of Photogrammetry Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-annals-v-2-2020-573-2020 ◽

2020 ◽

Vol V-2-2020 ◽

pp. 573-582 ◽

Cited By ~ 1

Author(s):

S. M. Tilon ◽

F. Nex ◽

D. Duarte ◽

N. Kerle ◽

G. Vosselman

Keyword(s):

Deep Learning ◽

Damage Detection ◽

Training Data ◽

Generative Adversarial Networks ◽

Generative Adversarial Network ◽

Essential Information ◽

Urban Scenes ◽

Adversarial Network ◽

Collapsed Buildings ◽

Post Disaster

Abstract. Degradation and damage detection provides essential information to maintenance workers in routine monitoring and to first responders in post-disaster scenarios. Despite advance in Earth Observation (EO), image analysis and deep learning techniques, the quality and quantity of training data for deep learning is still limited. As a result, no robust method has been found yet that can transfer and generalize well over a variety of geographic locations and typologies of damages. Since damages can be seen as anomalies, occurring sparingly over time and space, we propose to use an anomaly detecting Generative Adversarial Network (GAN) to detect damages. The main advantages of using GANs are that only healthy unannotated images are needed, and that a variety of damages, including the never before seen damage, can be detected. In this study we aimed to investigate 1) the ability of anomaly detecting GANs to detect degradation (potholes and cracks) in asphalt road infrastructures using Mobile Mapper imagery and building damage (collapsed buildings, rubble piles) using post-disaster aerial imagery, and 2) the sensitivity of this method against various types of pre-processing. Our results show that we can detect damages in urban scenes at satisfying levels but not on asphalt roads. Future work will investigate how to further classify the found damages and how to improve damage detection for asphalt roads.

Download Full-text

Learning Representations of Inorganic Materials from Generative Adversarial Networks

Symmetry ◽

10.3390/sym12111889 ◽

2020 ◽

Vol 12 (11) ◽

pp. 1889

Author(s):

Tiantian Hu ◽

Hui Song ◽

Tao Jiang ◽

Shaobo Li

Keyword(s):

Large Scale ◽

Inorganic Materials ◽

Molecular Composition ◽

Materials Characterization ◽

Generative Adversarial Networks ◽

Feature Engineering ◽

Material Research ◽

Generative Adversarial Network ◽

Characterization Of Materials

The two most important aspects of material research using deep learning (DL) or machine learning (ML) are the characteristics of materials data and learning algorithms, where the proper characterization of materials data is essential for generating accurate models. At present, the characterization of materials based on the molecular composition includes some methods based on feature engineering, such as Magpie and One-hot. Although these characterization methods have achieved significant results in materials research, these methods based on feature engineering cannot guarantee the integrity of materials characterization. One possible approach is to learn the materials characterization via neural networks using the chemical knowledge and implicit composition rules shown in large-scale known materials. This article chooses an adversarial method to learn the composition of atoms using the Generative Adversarial Network (GAN), which makes sense for data symmetry. The total loss value of the discriminator on the test set is reduced from 4.1e13 to 0.3194, indicating that the designed GAN network can well capture the combination of atoms in real materials. We then use the trained discriminator weights for material characterization and predict bandgap, formation energy, critical temperature (Tc) of superconductors on the Open Quantum Materials Database (OQMD), Materials Project (MP), and SuperCond datasets. Experiments show that when using the same predictive model, our proposed method performs better than One-hot and Magpie. This article provides an effective method for characterizing materials based on molecular composition in addition to Magpie, One-hot, etc. In addition, the generator learned in this study generates hypothetical materials with the same distribution as known materials, and these hypotheses can be used as a source for new material discovery.

Download Full-text

Inferring Concept Prerequisite Relations from Online Educational Resources

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33019589 ◽

2019 ◽

Vol 33 ◽

pp. 9589-9594 ◽

Cited By ~ 2

Author(s):

Sudeshna Roy ◽

Meghana Madhyastha ◽

Sheril Lawrence ◽

Vaibhav Rajan

Keyword(s):

Network Architecture ◽

Large Scale ◽

Latent Dirichlet Allocation ◽

Training Data ◽

Allocation Model ◽

Latent Dirichlet Allocation Model ◽

Technology Applications ◽

Benchmark Datasets ◽

Latent Representations ◽

Online Educational Resources

The Internet has rich and rapidly increasing sources of high quality educational content. Inferring prerequisite relations between educational concepts is required for modern large-scale online educational technology applications such as personalized recommendations and automatic curriculum creation. We present PREREQ, a new supervised learning method for inferring concept prerequisite relations. PREREQ is designed using latent representations of concepts obtained from the Pairwise Latent Dirichlet Allocation model, and a neural network based on the Siamese network architecture. PREREQ can learn unknown concept prerequisites from course prerequisites and labeled concept prerequisite data. It outperforms state-of-the-art approaches on benchmark datasets and can effectively learn from very less training data. PREREQ can also use unlabeled video playlists, a steadily growing source of training data, to learn concept prerequisites, thus obviating the need for manual annotation of course prerequisites.

Download Full-text