Gaussian Mixture Models and Probabilistic Decision-Based Neural Networks for Pattern Classification: A Comparative Study

K. K. Yiu; M. W. Mak; C. K. Li

doi:10.1007/s005210050026

BinSanity: unsupervised clustering of environmental microbial assemblies using coverage and affinity propagation

PeerJ ◽

10.7717/peerj.3035 ◽

2017 ◽

Vol 5 ◽

pp. e3035 ◽

Cited By ~ 58

Author(s):

Elaina D. Graham ◽

John F. Heidelberg ◽

Benjamin J. Tully

Keyword(s):

Neural Networks ◽

Microbial Community ◽

Mixture Models ◽

Clustering Algorithm ◽

Gaussian Mixture Models ◽

Gc Content ◽

Gaussian Mixture ◽

Affinity Propagation ◽

Adjusted Rand Index ◽

Low Coverage

Metagenomics has become an integral part of defining microbial diversity in various environments. Many ecosystems have characteristically low biomass and few cultured representatives. Linking potential metabolisms to phylogeny in environmental microorganisms is important for interpreting microbial community functions and the impacts these communities have on geochemical cycles. However, with metagenomic studies there is the computational hurdle of ‘binning’ contigs into phylogenetically related units or putative genomes. Binning methods have been implemented with varying approaches such as k-means clustering, Gaussian mixture models, hierarchical clustering, neural networks, and two-way clustering; however, many of these suffer from biases against low coverage/abundance organisms and closely related taxa/strains. We are introducing a new binning method, BinSanity, that utilizes the clustering algorithm affinity propagation (AP), to cluster assemblies using coverage with compositional based refinement (tetranucleotide frequency and percent GC content) to optimize bins containing multiple source organisms. This separation of composition and coverage based clustering reduces bias for closely related taxa. BinSanity was developed and tested on artificial metagenomes varying in size and complexity. Results indicate that BinSanity has a higher precision, recall, and Adjusted Rand Index compared to five commonly implemented methods. When tested on a previously published environmental metagenome, BinSanity generated high completion and low redundancy bins corresponding with the published metagenome-assembled genomes.

Download Full-text

Feature selection for pattern classification with Gaussian mixture models: A new objective criterion

Pattern Recognition Letters ◽

10.1016/0167-8655(96)00047-5 ◽

1996 ◽

Vol 17 (8) ◽

pp. 803-809 ◽

Cited By ~ 14

Author(s):

S. Krishnan ◽

K. Samudravijaya ◽

P.V.S. Rao

Keyword(s):

Feature Selection ◽

Mixture Models ◽

Pattern Classification ◽

Gaussian Mixture Models ◽

Gaussian Mixture ◽

Objective Criterion ◽

Selection For

Download Full-text

Automatic Genre Classification of TV Programmes Using Gaussian Mixture Models and Neural Networks

18th International Conference on Database and Expert Systems Applications (DEXA 2007) ◽

10.1109/dexa.2007.92 ◽

2007 ◽

Cited By ~ 9

Author(s):

Maurizio Montagnuolo ◽

Alberto Messina

Keyword(s):

Neural Networks ◽

Mixture Models ◽

Gaussian Mixture Models ◽

Gaussian Mixture ◽

Genre Classification

Download Full-text

ACCURACY OF NONPARAMETRIC DENSITY ESTIMATION FOR UNIVARIATE GAUSSIAN MIXTURE MODELS: A COMPARATIVE STUDY

Mathematical Modelling and Analysis ◽

10.3846/mma.2020.10505 ◽

2020 ◽

Vol 25 (4) ◽

pp. 622-641

Author(s):

Jurgita Arnastauskaitė ◽

Tomas Ruzgas

Keyword(s):

Comparative Study ◽

Municipal Solid Waste ◽

Solid Waste ◽

Mixture Models ◽

Density Estimation ◽

Gaussian Mixture Models ◽

Gaussian Mixture ◽

Nonparametric Density Estimation ◽

Estimation Methods ◽

Density Estimators

Flexible and reliable probability density estimation is fundamental in unsupervised learning and classification. Finite Gaussian mixture models are commonly used for this purpose. However, the parametric form of the distribution is not always known. In this case, non-parametric density estimation methods are used. Usually, these methods become computationally demanding as the number of components increases. In this paper, a comparative study of accuracy of some nonparametric density estimators is made by means of simulation. The following approaches have been considered: an adaptive bandwidth kernel estimator, a projection pursuit estimator, a logspline estimator, and a k-nearest neighbor estimator. It was concluded that data clustering as a pre-processing step improves the estimation of mixture densities. However, in case data does not have clearly defined clusters, the pre-preprocessing step does not give that much of advantage. The application of density estimators is illustrated using municipal solid waste data collected in Kaunas (Lithuania). The data distribution is similar (i.e., with kurtotic unimodal density) to the benchmark distribution introduced by Marron and Wand. Based on the homogeneity tests it can be concluded that distributions of the municipal solid waste fractions in Kutaisi (Georgia), Saint-Petersburg (Russia), and Boryspil (Ukraine) are statistically indifferent compared to the distribution of waste fractions in Kaunas. The distribution of waste data collected in Kaunas (Lithuania) follows the general observations introduced by Marron and Wand (i.e., has one mode and certain kurtosis).

Download Full-text

A comparative study of foreground detection using Gaussian mixture models-novice to novel

2016 16th International Conference on Control, Automation and Systems (ICCAS) ◽

10.1109/iccas.2016.7832485 ◽

2016 ◽

Cited By ~ 1

Author(s):

Ajmal Shahbaz ◽

Laksono Kurnianggoro ◽

Kang-Hyun Jo

Keyword(s):

Comparative Study ◽

Mixture Models ◽

Gaussian Mixture Models ◽

Gaussian Mixture ◽

Foreground Detection

Download Full-text

Determination of the Representative Socioeconomic Level by BSA in the Mexican Republic

Revista Perspectiva Empresarial ◽

10.16967/rpe.v5n2a6 ◽

2018 ◽

Vol 5 (2) ◽

pp. 83-100

Author(s):

María Dolores Luquín-García ◽

Edith Cecilia Macedo Ruíz ◽

Omar Rojas-Altamirano ◽

Carlos López-Hernández

Keyword(s):

Neural Networks ◽

Mixture Models ◽

Market Research ◽

Gaussian Mixture Models ◽

Gaussian Mixture ◽

Socioeconomic Level ◽

The One ◽

Definition Of ◽

Statistical Area

The aim of this article is to determine the socioeconomic level (SEL) with disaggregation of the Basic Statistical Area (BSA) in the Mexican Republic. The methodology used is the one established by the Mexican Association of Market Research Agencies (AMAI) along with the National Institute of Statistics and Geography (INEGI). The Clustering of the BSAs was carried out according to variables contained in the Population and Housing Census of 2010, through Gaussian mixture models, learning neural networks and finally, by defining the labels corresponding to each SEL. We found the existence of a representative SEL for each BSA. In addition, the definition of each socioeconomic level shows good results with an average of 90.86% of correctly labeled elements.

Download Full-text

Speech emotion recognition based on Gaussian Mixture Models and Deep Neural Networks

2017 Information Theory and Applications Workshop (ITA) ◽

10.1109/ita.2017.8023477 ◽

2017 ◽

Cited By ~ 5

Author(s):

Ivan J. Tashev ◽

Zhong-Qiu Wang ◽

Keith Godin

Keyword(s):

Neural Networks ◽

Emotion Recognition ◽

Mixture Models ◽

Deep Neural Networks ◽

Gaussian Mixture Models ◽

Gaussian Mixture ◽

Speech Emotion Recognition

Download Full-text

A Comparative Study on Microcalcification Detection Methods with Posterior Probability Estimation based on Gaussian Mixture Models

2005 IEEE Engineering in Medicine and Biology 27th Annual Conference ◽

10.1109/iembs.2005.1616339 ◽

2005 ◽

Cited By ~ 1

Author(s):

P. Casaseca-de-la-Higuera ◽

J.I. Arribas ◽

E. Munoz-Moreno ◽

C. Alberola-Lopez

Keyword(s):

Comparative Study ◽

Mixture Models ◽

Posterior Probability ◽

Gaussian Mixture Models ◽

Gaussian Mixture ◽

Detection Methods ◽

Probability Estimation ◽

Microcalcification Detection

Download Full-text

Automatic Genre Classification of TV Programmes Using Gaussian Mixture Models and Neural Networks

18th International Conference on Database and Expert Systems Applications (DEXA 2007) ◽

10.1109/dexa.2007.4312865 ◽

2007 ◽

Cited By ~ 2

Author(s):

Maurizio Montagnuolo ◽

Alberto Messina

Keyword(s):

Neural Networks ◽

Mixture Models ◽

Gaussian Mixture Models ◽

Gaussian Mixture ◽

Genre Classification

Download Full-text

Deep neural networks with auxiliary Gaussian mixture models for real-time speech recognition

2013 IEEE International Conference on Acoustics, Speech and Signal Processing ◽

10.1109/icassp.2013.6639148 ◽

2013 ◽

Cited By ~ 8

Author(s):

Xin Lei ◽

Hui Lin ◽

Georg Heigold

Keyword(s):

Neural Networks ◽

Speech Recognition ◽

Real Time ◽

Mixture Models ◽

Deep Neural Networks ◽

Gaussian Mixture Models ◽

Gaussian Mixture

Download Full-text