scholarly journals Phenotypic Factor Analysis of Family Data: Correction of the Bias Due to Dependency

2006 ◽  
Vol 9 (3) ◽  
pp. 367-376 ◽  
Author(s):  
Irene Rebollo ◽  
Marleen H. M. de Moor ◽  
Conor V. Dolan ◽  
Dorret I. Boomsma

AbstractTwin registries form an exceptionally rich source of information that is largely unexploited for phenotypic analyses. One obstacle to straightforward phenotypic statistical analysis is the inherent dependency, which is due to the clustering of cases within families. The present simulation study gauges the degree of the bias produced by the dependency of family data on the estimates of standard errors and chi-squared, when they are treated as independent observations in a phenotypic model, and assesses the efficiency of an estimator, which corrects for dependency. When family-clustered data are used for phenotypic analysis, in treating individuals as independent, and using standard maximum likelihood estimation, there is a tendency for the chi-square statistic to be overestimated, and the standard errors of the parameters to be underestimated. The bias increases with family resemblance, due to heritability or shared environment. The source of family resemblance — either heritability (h2) and/or shared environment (c2) — interacts with the composition of the sample. In the absence of c2, samples with twins, parents and spouses show the lowest bias, whereas in the presence of c2 samples with only twins show the lowest bias. In all conditions the bias remained below 15%. The use of the ‘complex option’ available in Mplus (clustering corrected robust maximum likelihood estimation) reduces the bias to the levels observed when only independent cases are considered. Thus with the use of robust estimates the bias due to family dependency becomes practically negligible in all conditions of dependency. In conclusion, the present study shows that the bias due to dependency in family data does not form a serious obstacle to phenotypic data analysis.

Author(s):  
Fernando Rios-Avila ◽  
Gustavo Canavire-Bacarreza

Following Wooldridge (2014, Journal of Econometrics 182: 226–234), we discuss and implement in Stata an efficient maximum-likelihood approach to the estimation of corrected standard errors of two-stage optimization models. Specifically, we compare the robustness and efficiency of the proposed method with routines already implemented in Stata to deal with selection and endogeneity problems. This strategy is an alternative to the use of bootstrap methods and has the advantage that it can be easily applied for the estimation of two-stage optimization models for which already built-in programs are not yet available. It could be of particular use for addressing endogeneity in a nonlinear framework.


2021 ◽  
Author(s):  
David Gerard

AbstractLinkage disequilibrium (LD) estimates are often calculated genome-wide for use in many tasks, such as SNP pruning and LD decay estimation. However, in the presence of genotype uncertainty, naive approaches to calculating LD have extreme attenuation biases, incorrectly suggesting that SNPs are less dependent than in reality. These biases are particularly strong in polyploid organisms, which often exhibit greater levels of genotype uncertainty than diploids. A principled approach using maximum likelihood estimation with genotype likelihoods can reduce this bias, but is prohibitively slow for genome-wide applications. Here, we present scalable moment-based adjustments to LD estimates based on the marginal posterior distributions of the genotypes. We demonstrate, on both simulated and real data, that these moment-based estimators are as accurate as maximum likelihood estimators, and are almost as fast as naive approaches based only on posterior mean genotypes. This opens up bias-corrected LD estimation to genome-wide applications. Additionally, we provide standard errors for these moment-based estimators. All methods are implemented in the ldsep package on the Comprehensive R Archive Network https://cran.r-project.org/package=ldsep.


Sign in / Sign up

Export Citation Format

Share Document