Structured variational inference for simulating populations of radio galaxies

David J Bastien; Anna M M Scaife; Hongming Tang; Micah Bowles; Fiona Porter

doi:10.1093/mnras/stab588

Penalized Variational Autoencoder for Molecular Design

10.26434/chemrxiv.7977131 ◽

2019 ◽

Author(s):

Sadegh Mohammadi ◽

Bing O'Dowd ◽

Christian Paulitz-Erdmann ◽

Linus Goerlitz

Keyword(s):

Molecular Design ◽

Biological Properties ◽

Biological Information ◽

Penalty Term ◽

Cross Domain ◽

Latent Space ◽

Variational Autoencoder ◽

Linear String ◽

Weight Penalty

Variational autoencoders have emerged as one of the most common approaches for automating molecular generation. We seek to learn a cross-domain latent space capturing chemical and biological information, simultaneously. To do so, we introduce the Penalized Variational Autoencoder which directly operates on SMILES, a linear string representation of molecules, with a weight penalty term in the decoder to address the imbalance in the character distribution of SMILES strings. We find that this greatly improves upon previous variational autoencoder approaches in the quality of the latent space and the generalization ability of the latent space to new chemistry. Next, we organize the latent space according to chemical and biological properties by jointly training the Penalized Variational Autoencoder with linear units. Extensive experiments on a range of tasks, including reconstruction, validity, and transferability demonstrates that the proposed methods here substantially outperform previous SMILES and graph-based methods, as well as introduces a new way to generate molecules from a set of desired properties, without prior knowledge of a chemical structure.

Download Full-text

Cada-Fvae-Gan: Adversarial Training for Few-Shot Event Detection

10.5121/csit.2020.101402 ◽

2020 ◽

Author(s):

Xiaoxiang Zhu ◽

Mengshu Hou ◽

Xiaoyang Zeng ◽

Hao Zhu

Keyword(s):

Event Detection ◽

Semantic Features ◽

Learning Framework ◽

Latent Space ◽

Variational Autoencoder ◽

Adversarial Training ◽

Human Effort ◽

Latent Representations ◽

Typical Solution

Most supervised systems of event detection (ED) task reply heavily on manual annotations and suffer from high-cost human effort when applied to new event types. To tackle this general problem, we turn our attention to few-shot learning (FSL). As a typical solution to FSL, cross-modal feature generation based frameworks achieve promising performance on images classification, which inspires us to advance this approach to ED task. In this work, we propose a model which extracts latent semantic features from event mentions, type structures and type names, then these three modalities are mapped into a shared low-dimension latent space by modality-specific aligned variational autoencoder enhanced by adversarial training. We evaluate the quality of our latent representations by training a CNN classifier to perform ED task. Experiments conducted on ACE2005 dataset show an improvement with 12.67% on F1-score when introducing adversarial training to VAE model, and our method is comparable with existing transfer learning framework for ED.

Download Full-text

OptAGAN: Entropy-based Finetuning on Text VAE-GAN

10.5121/csit.2021.112303 ◽

2021 ◽

Author(s):

Paolo Tirotta ◽

Stefano Lodi

Keyword(s):

Language Processing ◽

Generative Adversarial Networks ◽

Word Generation ◽

Adversarial Networks ◽

Current State ◽

Latent Space ◽

Variational Autoencoder ◽

Maximum Likelihood Methods ◽

High Level

Transfer learning through large pre-trained models has changed the landscape of current applications in natural language processing (NLP). Recently Optimus, a variational autoencoder (VAE) which combines two pre-trained models, BERT and GPT-2, has been released, and its combination with generative adversarial networks (GANs) has been shown to produce novel, yet very human-looking text. The Optimus and GANs combination avoids the troublesome application of GANs to the discrete domain of text, and prevents the exposure bias of standard maximum likelihood methods. We combine the training of GANs in the latent space, with the finetuning of the decoder of Optimus for single word generation. This approach lets us model both the high-level features of the sentences, and the low-level word-by-word generation. We finetune using reinforcement learning (RL) by exploiting the structure of GPT-2 and by adding entropy-based intrinsically motivated rewards to balance between quality and diversity. We benchmark the results of the VAE-GAN model, and show the improvements brought by our RL finetuning on three widely used datasets for text generation, with results that greatly surpass the current state-of-the-art for the quality of the generated texts.

Download Full-text

Penalized Variational Autoencoder for Molecular Design

10.26434/chemrxiv.7977131.v2 ◽

2019 ◽

Cited By ~ 1

Author(s):

Sadegh Mohammadi ◽

Bing O'Dowd ◽

Christian Paulitz-Erdmann ◽

Linus Goerlitz

Keyword(s):

Molecular Design ◽

Biological Properties ◽

Biological Information ◽

Penalty Term ◽

Cross Domain ◽

Latent Space ◽

Variational Autoencoder ◽

Linear String ◽

Weight Penalty

Variational autoencoders have emerged as one of the most common approaches for automating molecular generation. We seek to learn a cross-domain latent space capturing chemical and biological information, simultaneously. To do so, we introduce the Penalized Variational Autoencoder which directly operates on SMILES, a linear string representation of molecules, with a weight penalty term in the decoder to address the imbalance in the character distribution of SMILES strings. We find that this greatly improves upon previous variational autoencoder approaches in the quality of the latent space and the generalization ability of the latent space to new chemistry. Next, we organize the latent space according to chemical and biological properties by jointly training the Penalized Variational Autoencoder with linear units. Extensive experiments on a range of tasks, including reconstruction, validity, and transferability demonstrates that the proposed methods here substantially outperform previous SMILES and graph-based methods, as well as introduces a new way to generate molecules from a set of desired properties, without prior knowledge of a chemical structure.

Download Full-text

Penalized Variational Autoencoder for Molecular Design

10.26434/chemrxiv.7977131.v1 ◽

2019 ◽

Author(s):

Sadegh Mohammadi ◽

Bing O'Dowd ◽

Christian Paulitz-Erdmann ◽

Linus Goerlitz

Keyword(s):

Molecular Design ◽

Biological Properties ◽

Biological Information ◽

Penalty Term ◽

Cross Domain ◽

Latent Space ◽

Variational Autoencoder ◽

Linear String ◽

Weight Penalty

Variational autoencoders have emerged as one of the most common approaches for automating molecular generation. We seek to learn a cross-domain latent space capturing chemical and biological information, simultaneously. To do so, we introduce the Penalized Variational Autoencoder which directly operates on SMILES, a linear string representation of molecules, with a weight penalty term in the decoder to address the imbalance in the character distribution of SMILES strings. We find that this greatly improves upon previous variational autoencoder approaches in the quality of the latent space and the generalization ability of the latent space to new chemistry. Next, we organize the latent space according to chemical and biological properties by jointly training the Penalized Variational Autoencoder with linear units. Extensive experiments on a range of tasks, including reconstruction, validity, and transferability demonstrates that the proposed methods here substantially outperform previous SMILES and graph-based methods, as well as introduces a new way to generate molecules from a set of desired properties, without prior knowledge of a chemical structure.

Download Full-text

Data augmentation and feature extraction using variational autoencoder for acoustic modeling

2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) ◽

10.1109/apsipa.2017.8282225 ◽

2017 ◽

Cited By ~ 4

Author(s):

Hiromitsu Nishizaki

Keyword(s):

Feature Extraction ◽

Data Augmentation ◽

Acoustic Modeling ◽

Variational Autoencoder

Download Full-text

Data Augmentation via Latent Space Interpolation for Image Classification

2018 24th International Conference on Pattern Recognition (ICPR) ◽

10.1109/icpr.2018.8545506 ◽

2018 ◽

Cited By ~ 7

Author(s):

Xiaofeng Liu ◽

Yang Zou ◽

Lingsheng Kong ◽

Zhihui Diao ◽

Junliang Yan ◽

...

Keyword(s):

Image Classification ◽

Data Augmentation ◽

Latent Space

Download Full-text

Latent-Space Data Augmentation for Visually-Grounded Language Understanding

Advances in Intelligent Systems and Computing - Advances in Artificial Intelligence ◽

10.1007/978-3-030-39878-1_17 ◽

2020 ◽

pp. 179-187

Author(s):

Aly Magassouba ◽

Komei Sugiura ◽

Hisashi Kawai

Keyword(s):

Data Augmentation ◽

Language Understanding ◽

Latent Space ◽

Space Data

Download Full-text

Predicting drug polypharmacology from cell morphology readouts using variational autoencoder latent space arithmetic

10.1101/2021.09.02.458673 ◽

2021 ◽

Author(s):

Yuen Ler Chow ◽

Shantanu Singh ◽

Anne E Carpenter ◽

Gregory P. Way

Keyword(s):

Gene Expression ◽

Cell Morphology ◽

Learning Algorithm ◽

Simulated Data ◽

Biomedical Data ◽

Data Types ◽

Generative Capacity ◽

Latent Space ◽

Variational Autoencoder ◽

Target Effects

A variational autoencoder (VAE) is a machine learning algorithm, useful for generating a compressed and interpretable latent space. These representations have been generated from various biomedical data types and can be used to produce realistic-looking simulated data. However, standard vanilla VAEs suffer from entangled and uninformative latent spaces, which can be mitigated using other types of VAEs such as β-VAE and MMD-VAE. In this project, we evaluated the ability of VAEs to learn cell morphology characteristics derived from cell images. We trained and evaluated these three VAE variants-Vanilla VAE, β-VAE, and MMD-VAE-on cell morphology readouts and explored the generative capacity of each model to predict compound polypharmacology (the interactions of a drug with more than one target) using an approach called latent space arithmetic (LSA). To test the generalizability of the strategy, we also trained these VAEs using gene expression data of the same compound perturbations and found that gene expression provides complementary information. We found that the β-VAE and MMD-VAE disentangle morphology signals and reveal a more interpretable latent space. We reliably simulated morphology and gene expression readouts from certain compounds thereby predicting cell states perturbed with compounds of known polypharmacology. Inferring cell state for specific drug mechanisms could aid researchers in developing and identifying targeted therapeutics and categorizing off-target effects in the future.

Download Full-text

Unsupervised domain adaptation for robust speech recognition via variational autoencoder-based data augmentation

2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) ◽

10.1109/asru.2017.8268911 ◽

2017 ◽

Cited By ~ 17

Author(s):

Wei-Ning Hsu ◽

Yu Zhang ◽

James Glass

Keyword(s):

Speech Recognition ◽

Data Augmentation ◽

Domain Adaptation ◽

Robust Speech Recognition ◽

Unsupervised Domain Adaptation ◽

Variational Autoencoder

Download Full-text