Phenotypic Factor Analysis of Family Data: Correction of the Bias Due to Dependency

Irene Rebollo; Marleen H. M. de Moor; Conor V. Dolan; Dorret I. Boomsma

doi:10.1375/twin.9.3.367

Phenotypic Factor Analysis of Family Data: Correction of the Bias Due to Dependency

Twin Research and Human Genetics ◽

10.1375/twin.9.3.367 ◽

2006 ◽

Vol 9 (3) ◽

pp. 367-376 ◽

Cited By ~ 35

Author(s):

Irene Rebollo ◽

Marleen H. M. de Moor ◽

Conor V. Dolan ◽

Dorret I. Boomsma

Keyword(s):

Maximum Likelihood ◽

Maximum Likelihood Estimation ◽

Likelihood Estimation ◽

Family Resemblance ◽

Family Data ◽

Standard Errors ◽

Shared Environment ◽

Phenotypic Analysis ◽

Chi Square ◽

Phenotypic Data

AbstractTwin registries form an exceptionally rich source of information that is largely unexploited for phenotypic analyses. One obstacle to straightforward phenotypic statistical analysis is the inherent dependency, which is due to the clustering of cases within families. The present simulation study gauges the degree of the bias produced by the dependency of family data on the estimates of standard errors and chi-squared, when they are treated as independent observations in a phenotypic model, and assesses the efficiency of an estimator, which corrects for dependency. When family-clustered data are used for phenotypic analysis, in treating individuals as independent, and using standard maximum likelihood estimation, there is a tendency for the chi-square statistic to be overestimated, and the standard errors of the parameters to be underestimated. The bias increases with family resemblance, due to heritability or shared environment. The source of family resemblance — either heritability (h2) and/or shared environment (c2) — interacts with the composition of the sample. In the absence of c2, samples with twins, parents and spouses show the lowest bias, whereas in the presence of c2 samples with only twins show the lowest bias. In all conditions the bias remained below 15%. The use of the ‘complex option’ available in Mplus (clustering corrected robust maximum likelihood estimation) reduces the bias to the levels observed when only independent cases are considered. Thus with the use of robust estimates the bias due to family dependency becomes practically negligible in all conditions of dependency. In conclusion, the present study shows that the bias due to dependency in family data does not form a serious obstacle to phenotypic data analysis.

Download Full-text

Estimating Genotypic Correlations and Their Standard Errors Using Multivariate Restricted Maximum Likelihood Estimation with SAS Proc MIXED

Crop Science ◽

10.2135/cropsci2005.0191 ◽

2006 ◽

Vol 46 (2) ◽

pp. 642-654 ◽

Cited By ~ 207

Author(s):

James B. Holland

Keyword(s):

Maximum Likelihood ◽

Maximum Likelihood Estimation ◽

Likelihood Estimation ◽

Restricted Maximum Likelihood ◽

Standard Errors ◽

Genotypic Correlations ◽

Restricted Maximum Likelihood Estimation

Download Full-text

Maximum likelihood estimation of components of variance and correlations in the analysis of family data

Annals of Human Genetics ◽

10.1111/j.1469-1809.1987.tb00878.x ◽

1987 ◽

Vol 51 (3) ◽

pp. 259-264 ◽

Cited By ~ 9

Author(s):

A. BENER ◽

S. HUDA

Keyword(s):

Maximum Likelihood ◽

Maximum Likelihood Estimation ◽

Likelihood Estimation ◽

Family Data ◽

Components Of Variance

Download Full-text

Standard-error Correction in Two-stage Optimization Models: A Quasi–maximum Likelihood Estimation Approach

The Stata Journal Promoting communications on statistics and Stata ◽

10.1177/1536867x1801800113 ◽

2018 ◽

Vol 18 (1) ◽

pp. 206-222 ◽

Cited By ~ 1

Author(s):

Fernando Rios-Avila ◽

Gustavo Canavire-Bacarreza

Keyword(s):

Maximum Likelihood ◽

Maximum Likelihood Estimation ◽

Error Correction ◽

Likelihood Estimation ◽

Optimization Models ◽

Standard Errors ◽

Two Stage ◽

Maximum Likelihood Approach ◽

Likelihood Approach ◽

Quasi Maximum Likelihood

Following Wooldridge (2014, Journal of Econometrics 182: 226–234), we discuss and implement in Stata an efficient maximum-likelihood approach to the estimation of corrected standard errors of two-stage optimization models. Specifically, we compare the robustness and efficiency of the proposed method with routines already implemented in Stata to deal with selection and endogeneity problems. This strategy is an alternative to the use of bootstrap methods and has the advantage that it can be easily applied for the estimation of two-stage optimization models for which already built-in programs are not yet available. It could be of particular use for addressing endogeneity in a nonlinear framework.

Download Full-text

Two Equivalent Discrepancy Functions for Maximum Likelihood Estimation: Do Their Test Statistics Follow a Non-Central Chi-Square Distribution under Model Misspecification?

Sociological Methods & Research ◽

10.1177/0049124103258131 ◽

2004 ◽

Vol 32 (4) ◽

pp. 453-500 ◽

Cited By ~ 24

Author(s):

Ulf Henning Olsson ◽

Tron Foss ◽

Einar Breivik

Keyword(s):

Maximum Likelihood ◽

Maximum Likelihood Estimation ◽

Model Misspecification ◽

Likelihood Estimation ◽

Test Statistics ◽

Chi Square

Download Full-text

A Comparison of Minimum Logit Chi-Square Estimation and Maximum Likelihood Estimation in 2×2×2 and 3×2×2 Contingency Tables: Tests for Interaction

Journal of the American Statistical Association ◽

10.1080/01621459.1970.10481192 ◽

1970 ◽

Vol 65 (332) ◽

pp. 1617-1631 ◽

Cited By ~ 2

Author(s):

Charles L. Odoroff

Keyword(s):

Maximum Likelihood ◽

Maximum Likelihood Estimation ◽

Contingency Tables ◽

Likelihood Estimation ◽

Chi Square

Download Full-text

Three-marker phenotypic analysis of lymphocytes based on two-color immunofluorescence using a multinomial model for flow cytometric counts and maximum likelihood estimation

Cytometry ◽

10.1002/cyto.990140210 ◽

1993 ◽

Vol 14 (2) ◽

pp. 179-187 ◽

Cited By ~ 2

Author(s):

W. L. J. van Putten ◽

J. Kortboyer ◽

R. L. H. Bolhuis ◽

J. W. Gratama

Keyword(s):

Maximum Likelihood ◽

Maximum Likelihood Estimation ◽

Likelihood Estimation ◽

Multinomial Model ◽

Phenotypic Analysis ◽

Flow Cytometric

Download Full-text

Comparison of maximum likelihood estimation and chi-square statistics applied to counting experiments

Nuclear Instruments and Methods in Physics Research Section A Accelerators Spectrometers Detectors and Associated Equipment ◽

10.1016/s0168-9002(00)00756-7 ◽

2001 ◽

Vol 457 (1-2) ◽

pp. 384-401 ◽

Cited By ~ 60

Author(s):

T. Hauschild ◽

M. Jentschel

Keyword(s):

Maximum Likelihood ◽

Maximum Likelihood Estimation ◽

Likelihood Estimation ◽

Chi Square

Download Full-text

A Comparison of Minimum Logit Chi-Square Estimation and Maximum Likelihood Estimation in 2 | times 2 | times 2 and 3 | times 2 | times 2 Contingency Tables: Tests for Interaction

Journal of the American Statistical Association ◽

10.2307/2284345 ◽

1970 ◽

Vol 65 (332) ◽

pp. 1617 ◽

Cited By ~ 5

Author(s):

Charles L. Odoroff

Keyword(s):

Maximum Likelihood ◽

Maximum Likelihood Estimation ◽

Contingency Tables ◽

Likelihood Estimation ◽

Chi Square

Download Full-text

Scalable Bias-corrected Linkage Disequilibrium Estimation Under Genotype Uncertainty

10.1101/2021.02.08.430270 ◽

2021 ◽

Author(s):

David Gerard

Keyword(s):

Linkage Disequilibrium ◽

Maximum Likelihood ◽

Maximum Likelihood Estimation ◽

Maximum Likelihood Estimators ◽

Likelihood Estimation ◽

Real Data ◽

Standard Errors ◽

Posterior Distributions ◽

Posterior Mean ◽

Genome Wide

AbstractLinkage disequilibrium (LD) estimates are often calculated genome-wide for use in many tasks, such as SNP pruning and LD decay estimation. However, in the presence of genotype uncertainty, naive approaches to calculating LD have extreme attenuation biases, incorrectly suggesting that SNPs are less dependent than in reality. These biases are particularly strong in polyploid organisms, which often exhibit greater levels of genotype uncertainty than diploids. A principled approach using maximum likelihood estimation with genotype likelihoods can reduce this bias, but is prohibitively slow for genome-wide applications. Here, we present scalable moment-based adjustments to LD estimates based on the marginal posterior distributions of the genotypes. We demonstrate, on both simulated and real data, that these moment-based estimators are as accurate as maximum likelihood estimators, and are almost as fast as naive approaches based only on posterior mean genotypes. This opens up bias-corrected LD estimation to genome-wide applications. Additionally, we provide standard errors for these moment-based estimators. All methods are implemented in the ldsep package on the Comprehensive R Archive Network https://cran.r-project.org/package=ldsep.

Download Full-text

Maximum Likelihood Estimation of Structural Equation Models for Continuous Data: Standard Errors and Goodness of Fit

Structural Equation Modeling A Multidisciplinary Journal ◽

10.1080/10705511.2016.1269606 ◽

2017 ◽

Vol 24 (3) ◽

pp. 383-394 ◽

Cited By ~ 58

Author(s):

Alberto Maydeu-Olivares

Keyword(s):

Maximum Likelihood ◽

Maximum Likelihood Estimation ◽

Structural Equation ◽

Goodness Of Fit ◽

Structural Equation Models ◽

Likelihood Estimation ◽

Standard Errors ◽

Continuous Data ◽

Data Standard

Download Full-text