logistic normal distribution Latest Research Papers

Abstract This paper uses a novel data-driven probabilistic approach to address the century-old Inner-Outer hypothesis of Indo-Aryan. I develop a Bayesian hierarchical mixed-membership model to assess the validity of this hypothesis using a large data set of automatically extracted sound changes operating between Old Indo-Aryan and Modern Indo-Aryan speech varieties. I employ different prior distributions in order to model sound change, one of which, the Logistic Normal distribution, has not received much attention in linguistics outside of Natural Language Processing, despite its many attractive features. I find evidence for cohesive dialect groups that have made their imprint on contemporary Indo-Aryan languages, and find that when a Logistic Normal prior is used, the distribution of dialect components across languages is largely compatible with a core-periphery pattern similar to that proposed under the Inner-Outer hypothesis.

Download Full-text

A Correlated Topic Model Using Word Embeddings

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2017/588 ◽

2017 ◽

Cited By ~ 20

Author(s):

Guangxu Xun ◽

Yaliang Li ◽

Wayne Xin Zhao ◽

Jing Gao ◽

Aidong Zhang

Keyword(s):

Data Augmentation ◽

Topic Model ◽

Semantic Relatedness ◽

Word Embedding ◽

Word Embeddings ◽

Word Level ◽

Logistic Normal Distribution ◽

Proposed Model ◽

Correlation Information ◽

Correlated Topic Model

Conventional correlated topic models are able to capture correlation structure among latent topics by replacing the Dirichlet prior with the logistic normal distribution. Word embeddings have been proven to be able to capture semantic regularities in language. Therefore, the semantic relatedness and correlations between words can be directly calculated in the word embedding space, for example, via cosine values. In this paper, we propose a novel correlated topic model using word embeddings. The proposed model enables us to exploit the additional word-level correlation information in word embeddings and directly model topic correlation in the continuous word embedding space. In the model, words in documents are replaced with meaningful word embeddings, topics are modeled as multivariate Gaussian distributions over the word embeddings and topic correlations are learned among the continuous Gaussian topics. A Gibbs sampling solution with data augmentation is given to perform inference. We evaluate our model on the 20 Newsgroups dataset and the Reuters-21578 dataset qualitatively and quantitatively. The experimental results show the effectiveness of our proposed model.

Download Full-text

A Logistic Normal Mixture Model for Compositional Data Allowing Essential Zeros

Austrian Journal of Statistics ◽

10.17713/ajs.v45i4.117 ◽

2016 ◽

Vol 45 (4) ◽

pp. 3-23 ◽

Cited By ~ 3

Author(s):

John Bear ◽

Dean Billheimer

Keyword(s):

Normal Distribution ◽

Compositional Data ◽

Normal Mixture ◽

Dispersion Parameters ◽

Normal Distributions ◽

A Value ◽

Common Location ◽

Logistic Normal Distribution ◽

Special Cases ◽

Support Methods

The usual candidate distributions for modeling compositions, the Dirichlet and the logistic normal distribution, do not include zero components in their support. Methods have been developed and refined for dealing with zeros that are rounded, or due to a value being below a detection level. Methods have also been developed for zeros in compositions arising from count data. However, essential zeros, cases where a component is truly absent, in continuous compositions are still a problem.The most promising approach is based on extending the logistic normal distribution to model essential zeros using a mixture of additive logistic normal distributions of different dimension, related by common parameters. We continue this approach, and by imposing an additional constraint, develop a likelihood, and show ways of estimating parameters for location and dispersion. The proposed likelihood, conditional on parameters for the probability of zeros, is a mixture of additive logistic normal distributions of different dimensions whose location and dispersion parameters are projections of a common location or dispersion parameter. For some simple special cases, we contrast the relative efficiency of different location estimators.

Download Full-text

The odd log–logistic normal distribution: Theory and applications in analysis of experiments

Journal of Statistical Theory and Practice ◽

10.1080/15598608.2016.1141127 ◽

2016 ◽

Vol 10 (2) ◽

pp. 311-335 ◽

Cited By ~ 12

Author(s):

Altemir da Silva Braga ◽

Gauss M. Cordeiro ◽

Edwin M. M. Ortega ◽

José Nilton da Cruz

Keyword(s):

Normal Distribution ◽

Distribution Theory ◽

Logistic Normal Distribution

Download Full-text