TY - JOUR
T1 - Making an unknown unknown a known unknown
T2 - Missing data in longitudinal neuroimaging studies
AU - Matta, Tyler H.
AU - Flournoy, John C.
AU - Byrne, Michelle L.
PY - 2018/10
Y1 - 2018/10
N2 - The analysis of longitudinal neuroimaging data within the massively univariate framework provides the opportunity to study empirical questions about neurodevelopment. Missing outcome data are an all-to-common feature of any longitudinal study, a feature that, if handled improperly, can reduce statistical power and lead to biased parameter estimates. The goal of this paper is to provide conceptual clarity of the issues and non-issues that arise from analyzing incomplete data in longitudinal studies with particular focus on neuroimaging data. This paper begins with a review of the hierarchy of missing data mechanisms and their relationship to likelihood-based methods, a review that is necessary not just for likelihood-based methods, but also for multiple-imputation methods. Next, the paper provides a series of simulation studies with designs common in longitudinal neuroimaging studies to help illustrate missing data concepts regardless of interpretation. Finally, two applied examples are used to demonstrate the sensitivity of inferences under different missing data assumptions and how this may change the substantive interpretation. The paper concludes with a set of guidelines for analyzing incomplete longitudinal data that can improve the validity of research findings in developmental neuroimaging research.
AB - The analysis of longitudinal neuroimaging data within the massively univariate framework provides the opportunity to study empirical questions about neurodevelopment. Missing outcome data are an all-to-common feature of any longitudinal study, a feature that, if handled improperly, can reduce statistical power and lead to biased parameter estimates. The goal of this paper is to provide conceptual clarity of the issues and non-issues that arise from analyzing incomplete data in longitudinal studies with particular focus on neuroimaging data. This paper begins with a review of the hierarchy of missing data mechanisms and their relationship to likelihood-based methods, a review that is necessary not just for likelihood-based methods, but also for multiple-imputation methods. Next, the paper provides a series of simulation studies with designs common in longitudinal neuroimaging studies to help illustrate missing data concepts regardless of interpretation. Finally, two applied examples are used to demonstrate the sensitivity of inferences under different missing data assumptions and how this may change the substantive interpretation. The paper concludes with a set of guidelines for analyzing incomplete longitudinal data that can improve the validity of research findings in developmental neuroimaging research.
KW - Likelihood
KW - Longitudinal data
KW - Missing data
KW - Neuroimaging
UR - http://www.scopus.com/inward/record.url?scp=85033368867&partnerID=8YFLogxK
U2 - 10.1016/j.dcn.2017.10.001
DO - 10.1016/j.dcn.2017.10.001
M3 - Review Article
C2 - 29129673
AN - SCOPUS:85033368867
SN - 1878-9293
VL - 33
SP - 83
EP - 98
JO - Developmental Cognitive Neuroscience
JF - Developmental Cognitive Neuroscience
ER -