Context-Aware Content Generation for Virtual Environments

Volume 1B: 36th Computers and Information in Engineering Conference ◽

10.1115/detc2016-59997 ◽

2016 ◽

Author(s):

Andrew Brock ◽

Theodore Lim ◽

J. M. Ritchie ◽

Nick Weston

Keyword(s):

Large Scale ◽

Dimensional Space ◽

Context Aware ◽

3 Dimensional ◽

Latent Space ◽

Variational Autoencoder ◽

Computationally Intensive ◽

Expert Input ◽

Low Dimensional ◽

Content Generation

Large scale scene generation is a computationally intensive operation, and added complexities arise when dynamic content generation is required. We propose a system capable of generating virtual content from non-expert input. The proposed system uses a 3-dimensional variational autoencoder to interactively generate new virtual objects by interpolating between extant objects in a learned low-dimensional space, as well as by randomly sampling in that space. We present an interface that allows a user to intuitively explore the latent manifold, taking advantage of the network’s ability to perform algebra in the latent space to help infer context and generalize to previously unseen inputs.

Download Full-text

Projection Analysis Optimization for Human Transition Motion Estimation

International Journal of Digital Multimedia Broadcasting ◽

10.1155/2019/6816453 ◽

2019 ◽

Vol 2019 ◽

pp. 1-9

Author(s):

Wanyi Li ◽

Feifei Zhang ◽

Qiang Chen ◽

Qian Zhang

Keyword(s):

Video Game ◽

Human Motion ◽

High Dimensional ◽

Dynamical Models ◽

Excellent Performance ◽

3 Dimensional ◽

Latent Space ◽

Projection Analysis ◽

Low Dimensional ◽

Working Efficiency

It is a difficult task to estimate the human transition motion without the specialized software. The 3-dimensional (3D) human motion animation is widely used in video game, movie, and so on. When making the animation, human transition motion is necessary. If there is a method that can generate the transition motion, the making time will cost less and the working efficiency will be improved. Thus a new method called latent space optimization based on projection analysis (LSOPA) is proposed to estimate the human transition motion. LSOPA is carried out under the assistance of Gaussian process dynamical models (GPDM); it builds the object function to optimize the data in the low dimensional (LD) space, and the optimized data in LD space will be obtained to generate the human transition motion. The LSOPA can make the GPDM learn the high dimensional (HD) data to estimate the needed transition motion. The excellent performance of LSOPA will be tested by the experiments.

Download Full-text

Collective dynamics of repeated inference in variational autoencoder rapidly find cluster structure

Scientific Reports ◽

10.1038/s41598-020-72593-4 ◽

2020 ◽

Vol 10 (1) ◽

Author(s):

Yoshihiro Nagano ◽

Ryo Karakida ◽

Masato Okada

Keyword(s):

Activity Pattern ◽

Latent Variables ◽

Cluster Structure ◽

Generative Models ◽

Cluster Center ◽

Specific Data ◽

Global Cluster ◽

Latent Space ◽

Variational Autoencoder ◽

Low Dimensional

Abstract Deep neural networks are good at extracting low-dimensional subspaces (latent spaces) that represent the essential features inside a high-dimensional dataset. Deep generative models represented by variational autoencoders (VAEs) can generate and infer high-quality datasets, such as images. In particular, VAEs can eliminate the noise contained in an image by repeating the mapping between latent and data space. To clarify the mechanism of such denoising, we numerically analyzed how the activity pattern of trained networks changes in the latent space during inference. We considered the time development of the activity pattern for specific data as one trajectory in the latent space and investigated the collective behavior of these inference trajectories for many data. Our study revealed that when a cluster structure exists in the dataset, the trajectory rapidly approaches the center of the cluster. This behavior was qualitatively consistent with the concept retrieval reported in associative memory models. Additionally, the larger the noise contained in the data, the closer the trajectory was to a more global cluster. It was demonstrated that by increasing the number of the latent variables, the trend of the approach a cluster center can be enhanced, and the generalization ability of the VAE can be improved.

Download Full-text

A theory of multineuronal dimensionality, dynamics and measurement

10.1101/214262 ◽

2017 ◽

Cited By ~ 38

Author(s):

Peiran Gao ◽

Eric Trautmann ◽

Byron Yu ◽

Gopal Santhanam ◽

Stephen Ryu ◽

...

Keyword(s):

Experimental Design ◽

Large Scale ◽

Task Complexity ◽

Dimensional Space ◽

Neural Dynamics ◽

Firing Rates ◽

Control Behavior ◽

Reduction Methods ◽

Low Dimensional

AbstractIn many experiments, neuroscientists tightly control behavior, record many trials, and obtain trial-averaged firing rates from hundreds of neurons in circuits containing billions of behaviorally relevant neurons. Di-mensionality reduction methods reveal a striking simplicity underlying such multi-neuronal data: they can be reduced to a low-dimensional space, and the resulting neural trajectories in this space yield a remarkably insightful dynamical portrait of circuit computation. This simplicity raises profound and timely conceptual questions. What are its origins and its implications for the complexity of neural dynamics? How would the situation change if we recorded more neurons? When, if at all, can we trust dynamical portraits obtained from measuring an infinitesimal fraction of task relevant neurons? We present a theory that answers these questions, and test it using physiological recordings from reaching monkeys. This theory reveals conceptual insights into how task complexity governs both neural dimensionality and accurate recovery of dynamic portraits, thereby providing quantitative guidelines for future large-scale experimental design.

Download Full-text

A learned embedding for efficient joint analysis of millions of mass spectra

10.1101/483263 ◽

2018 ◽

Cited By ~ 4

Author(s):

Damon H. May ◽

Jeffrey Bilmes ◽

William S. Noble

Keyword(s):

Mass Spectra ◽

Large Scale ◽

Dimensional Space ◽

Software Implementation ◽

Mass Spectrometry Data ◽

Joint Analysis ◽

Clustering Methods ◽

Peptide Mass ◽

Public Repositories ◽

Low Dimensional

AbstractDespite an explosion of data in public repositories, peptide mass spectra are usually analyzed by each laboratory in isolation, treating each experiment as if it has no relationship to any others. This approach fails to exploit the wealth of existing, previously analyzed mass spectrometry data. Others have jointly analyzed many mass spectra, often using clustering. However, mass spectra are not necessarily best summarized as clusters, and although new spectra can be added to existing clusters, clustering methods previously applied to mass spectra do not allow new clusters to be defined without completely re-clustering. As an alternative, we propose to train a deep neural network, called “GLEAMS,” to learn an embedding of spectra into a low-dimensional space in which spectra generated by the same peptide are close to one another. We demonstrate empirically the utility of this learned embedding by propagating annotations from labeled to unlabeled spectra. We further use GLEAMS to detect groups of unidentified, proximal spectra representing the same peptide, and we show how to use these spectral communities to reveal misidentified spectra and to characterize frequently observed but consistently unidentified molecular species. We provide a software implementation of our approach, along with a tool to quickly embed additional spectra using a pre-trained model, to facilitate large-scale analyses.

Download Full-text

Deciphering protein evolution and fitness landscapes with latent space models

Nature Communications ◽

10.1038/s41467-019-13633-0 ◽

2019 ◽

Vol 10 (1) ◽

Cited By ~ 4

Author(s):

Xinqiang Ding ◽

Zhengting Zou ◽

Charles L. Brooks III

Keyword(s):

Protein Evolution ◽

Fitness Landscape ◽

Dimensional Space ◽

Gaussian Process Regression ◽

Protein Sequences ◽

Fitness Landscapes ◽

Space Representation ◽

Latent Space ◽

Low Dimensional ◽

Latent Space Models

AbstractProtein sequences contain rich information about protein evolution, fitness landscapes, and stability. Here we investigate how latent space models trained using variational auto-encoders can infer these properties from sequences. Using both simulated and real sequences, we show that the low dimensional latent space representation of sequences, calculated using the encoder model, captures both evolutionary and ancestral relationships between sequences. Together with experimental fitness data and Gaussian process regression, the latent space representation also enables learning the protein fitness landscape in a continuous low dimensional space. Moreover, the model is also useful in predicting protein mutational stability landscapes and quantifying the importance of stability in shaping protein evolution. Overall, we illustrate that the latent space models learned using variational auto-encoders provide a mechanism for exploration of the rich data contained in protein sequences regarding evolution, fitness and stability and hence are well-suited to help guide protein engineering efforts.

Download Full-text

Geometric anomaly detection in data

Proceedings of the National Academy of Sciences ◽

10.1073/pnas.2001741117 ◽

2020 ◽

Vol 117 (33) ◽

pp. 19664-19669

Author(s):

Bernadette J. Stolz ◽

Jared Tanner ◽

Heather A. Harrington ◽

Vidit Nanda

Keyword(s):

Social Sciences ◽

Dimensional Space ◽

High Dimensional ◽

3 Dimensional ◽

Modeling Techniques ◽

Low Dimensional ◽

Local Topology ◽

Systematic Framework ◽

Dominant Paradigm ◽

Local Nature

The quest for low-dimensional models which approximate high-dimensional data is pervasive across the physical, natural, and social sciences. The dominant paradigm underlying most standard modeling techniques assumes that the data are concentrated near a single unknown manifold of relatively small intrinsic dimension. Here, we present a systematic framework for detecting interfaces and related anomalies in data which may fail to satisfy the manifold hypothesis. By computing the local topology of small regions around each data point, we are able to partition a given dataset into disjoint classes, each of which can be individually approximated by a single manifold. Since these manifolds may have different intrinsic dimensions, local topology discovers singular regions in data even when none of the points have been sampled precisely from the singularities. We showcase this method by identifying the intersection of two surfaces in the 24-dimensional space of cyclo-octane conformations and by locating all of the self-intersections of a Henneberg minimal surface immersed in 3-dimensional space. Due to the local nature of the topological computations, the algorithmic burden of performing such data stratification is readily distributable across several processors.

Download Full-text

Deep Learning-Based Action Recognition Using 3D Skeleton Joints Information

Inventions ◽

10.3390/inventions5030049 ◽

2020 ◽

Vol 5 (3) ◽

pp. 49

Author(s):

Nusrat Tasnim ◽

Md. Mahbubul Islam ◽

Joong-Hwan Baek

Keyword(s):

Action Recognition ◽

Large Scale ◽

Dimensional Space ◽

Human Action Recognition ◽

Human Action ◽

Human Machine Interaction ◽

Human Actions ◽

3 Dimensional ◽

3D Skeleton ◽

Color Depth

Human action recognition has turned into one of the most attractive and demanding fields of research in computer vision and pattern recognition for facilitating easy, smart, and comfortable ways of human-machine interaction. With the witnessing of massive improvements to research in recent years, several methods have been suggested for the discrimination of different types of human actions using color, depth, inertial, and skeleton information. Despite having several action identification methods using different modalities, classifying human actions using skeleton joints information in 3-dimensional space is still a challenging problem. In this paper, we conceive an efficacious method for action recognition using 3D skeleton data. First, large-scale 3D skeleton joints information was analyzed and accomplished some meaningful pre-processing. Then, a simple straight-forward deep convolutional neural network (DCNN) was designed for the classification of the desired actions in order to evaluate the effectiveness and embonpoint of the proposed system. We also conducted prior DCNN models such as ResNet18 and MobileNetV2, which outperform existing systems using human skeleton joints information.

Download Full-text

Deep soft K-means clustering with self-training for single-cell RNA sequence data

NAR Genomics and Bioinformatics ◽

10.1093/nargab/lqaa039 ◽

2020 ◽

Vol 2 (2) ◽

Cited By ~ 2

Author(s):

Liang Chen ◽

Weinan Wang ◽

Yuyao Zhai ◽

Minghua Deng

Keyword(s):

Deep Learning ◽

Single Cell ◽

Large Scale ◽

Sequence Data ◽

Dimensional Space ◽

Expression Profiles ◽

Single Cells ◽

Clustering Algorithms ◽

Training Procedure ◽

Latent Space

Abstract Single-cell RNA sequencing (scRNA-seq) allows researchers to study cell heterogeneity at the cellular level. A crucial step in analyzing scRNA-seq data is to cluster cells into subpopulations to facilitate subsequent downstream analysis. However, frequent dropout events and increasing size of scRNA-seq data make clustering such high-dimensional, sparse and massive transcriptional expression profiles challenging. Although some existing deep learning-based clustering algorithms for single cells combine dimensionality reduction with clustering, they either ignore the distance and affinity constraints between similar cells or make some additional latent space assumptions like mixture Gaussian distribution, failing to learn cluster-friendly low-dimensional space. Therefore, in this paper, we combine the deep learning technique with the use of a denoising autoencoder to characterize scRNA-seq data while propose a soft self-training K-means algorithm to cluster the cell population in the learned latent space. The self-training procedure can effectively aggregate the similar cells and pursue more cluster-friendly latent space. Our method, called ‘scziDesk’, alternately performs data compression, data reconstruction and soft clustering iteratively, and the results exhibit excellent compatibility and robustness in both simulated and real data. Moreover, our proposed method has perfect scalability in line with cell size on large-scale datasets.

Download Full-text

Physical model of the genotype-to-phenotype map of proteins

10.1101/069039 ◽

2016 ◽

Author(s):

Tsvi Tlusty ◽

Albert Libchaber ◽

Jean-Pierre Eckmann

Keyword(s):

Shear Band ◽

Physical Model ◽

Protein Function ◽

Large Scale ◽

Dimensional Space ◽

Binary Sequences ◽

Basic Question ◽

Mechanical Basis ◽

Low Dimensional ◽

Scale Motion

How DNA is mapped to functional proteins is a basic question of living matter. We introduce and study a physical model of protein evolution which suggests a mechanical basis for this map. Many proteins rely on large-scale motion to function. We therefore treat protein as learning amorphous matter that evolves towards such a mechanical function: Genes are binary sequences that encode the connectivity of the amino acid network that makes a protein. The gene is evolved until the network forms a shear band across the protein, which allows for long-range, soft modes required for protein function. The evolution reduces the high-dimensional sequence space to a low-dimensional space of mechanical modes, in accord with the observed dimensional reduction between genotype and phenotype of proteins. Spectral analysis of the space of 106 solutions shows a strong correspondence between localization around the shear band of both mechanical modes and the sequence structure. Specifically, our model shows how mutations are correlated among amino acids whose interactions determine the functional mode.PACS numbers: 87.14.E-, 87.15.-v, 87.10.-e

Download Full-text

Learning Cross-Aligned Latent Embeddings for Zero-Shot Cross-Modal Retrieval

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i07.6817 ◽

2020 ◽

Vol 34 (07) ◽

pp. 11515-11522

Author(s):

Kaiyi Lin ◽

Xing Xu ◽

Lianli Gao ◽

Zheng Wang ◽

Heng Tao Shen

Keyword(s):

Large Scale ◽

Semantic Space ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Latent Space ◽

Multimodal Features ◽

Benchmark Datasets ◽

Class Labels ◽

Low Dimensional ◽

Sketch Retrieval

Zero-Shot Cross-Modal Retrieval (ZS-CMR) is an emerging research hotspot that aims to retrieve data of new classes across different modality data. It is challenging for not only the heterogeneous distributions across different modalities, but also the inconsistent semantics across seen and unseen classes. A handful of recently proposed methods typically borrow the idea from zero-shot learning, i.e., exploiting word embeddings of class labels (i.e., class-embeddings) as common semantic space, and using generative adversarial network (GAN) to capture the underlying multimodal data structures, as well as strengthen relations between input data and semantic space to generalize across seen and unseen classes. In this paper, we propose a novel method termed Learning Cross-Aligned Latent Embeddings (LCALE) as an alternative to these GAN based methods for ZS-CMR. Unlike using the class-embeddings as the semantic space, our method seeks for a shared low-dimensional latent space of input multimodal features and class-embeddings by modality-specific variational autoencoders. Notably, we align the distributions learned from multimodal input features and from class-embeddings to construct latent embeddings that contain the essential cross-modal correlation associated with unseen classes. Effective cross-reconstruction and cross-alignment criterions are further developed to preserve class-discriminative information in latent space, which benefits the efficiency for retrieval and enable the knowledge transfer to unseen classes. We evaluate our model using four benchmark datasets on image-text retrieval tasks and one large-scale dataset on image-sketch retrieval tasks. The experimental results show that our method establishes the new state-of-the-art performance for both tasks on all datasets.

Download Full-text