Specification of Generalized Linear Mixed Models for Family Data using Markov Chain Monte Carlo MethodsKris M Jamsen1*, Sophie G Zaloumis1,2, Katrina J Scurrah1,2 and Lyle C Gurrin1
- *Corresponding Author:
- Kris M Jamsen
Centre for Molecular, Environmental, Genetic and Analytic Epidemiology
Level 3, 207 Bouverie St, The University of Melbourne
Victoria 3010, Australia
Tel: +61 (0)3 8344 0700
Fax: +61 (0)3 9349 5815
E-mail: [email protected]
Received date: December 21, 2011; Accepted date: February 15, 2012; Published date: February 21, 2012
Citation: Jamsen KM, Zaloumis SG, Scurrah KJ, Gurrin LC (2012) Specification of Generalized Linear Mixed Models for Family Data using Markov Chain Monte Carlo Methods. J Biomet Biostat S1:003. doi: 10.4172/2155-6180.S1-003
Copyright: © 2012 Jamsen KM, et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Statistical models imposed on family data can be used to partition phenotypic variation into components due to sharing of both genetic and environmental risk factors for disease. Generalized linear mixed models (GLMMs) are useful tools for the analysis of family data, but it is not always clear how to specify individual-level regression equations so that the resulting within-family variance-covariance matrix of the phenotype reflects the correlation implied by the relatedness of individuals within families. This is particularly challenging when families are of varying sizes and compositions. In this paper we propose a general approach to specifying GLMMs for family data that uses a decomposition of the within-family variance-covariance matrix of the phenotype to set up a series of regression equations with fixed and random effects that corresponds to an appropriate genetic model. This “mechanistic” specification is particularly suited to estimation and evaluation of models within a Markov chain Monte Carlo (MCMC) framework. The proposed approach was assessed with simulated data to demonstrate the accuracy of estimation of the variance components. We analyzed data from the Victorian Family Heart Study (families with two generations over-sampled for those with monozygotic and dizygotic twins) and for a binary phenotype (hypertension) that resulted in substantially reduced computation time in the MCMC framework (via WinBUGS) compared with a maximum likelihood approach (via Stata). The proposed approach to model specification in this paper is easily implemented using standard software and can accommodate prior information on the magnitude of fixed or random effects.