QIC program and model selection in GEE analyses.

James JiSheng Cui

Research output: Contribution to journalArticleResearchpeer-review

304 Citations (Scopus)


The generalized estimating equation (GEE) approach is a widely used statistical method in the analysis of longitudinal data in clinical and epidemiological studies. It is an extension of the generalized linear model (GLM) method to correlated data such that valid standard errors of the parameter estimates can be drawn. Unlike the GLM method, which is based on the maximum likelihood theory for independent observations, the GEE method is based on the quasilikelihood theory and no assumption is made about the distribution of response observations. Therefore, Akaike s information criterion, a widely used method for model selection in GLM, is not applicable to GEE directly. However, Pan (Biometrics 2001; 57: 120-125) proposed a model-selection method for GEE and termed it quasilikelihood under the independence model criterion. This criterion can also be used to select the best-working correlation structure. From Pan s methods, I developed a general Stata program, qic, that accommodates all the distribution and link functions and correlation structures available in Stata version 9. In this paper, I introduce this program and demonstrate how to use it to select the best working correlation structure and the best subset of covariates through two examples in longitudinal studies. quasilikelihood under the independence model criterion c 2007 StataCorp LP st0126
Original languageEnglish
Pages (from-to)209 - 220
Number of pages12
JournalThe Stata Journal
Issue number2
Publication statusPublished - 2007

Cite this