TY - JOUR
T1 - Identifying dietary patterns using a normal mixture model
T2 - Application to the EPIC study
AU - Fahey, Michael T.
AU - Ferrari, Pietro
AU - Slimani, Nadia
AU - Vermunt, Jeroen K.
AU - White, Ian R.
AU - Hoffmann, Kurt
AU - Wirfält, Elisabet
AU - Bamia, Christina
AU - Touvier, Mathilde
AU - Linseisen, Jakob
AU - Rodríguez-Barranco, Miguel
AU - Tumino, Rosario
AU - Lund, Eiliv Eylin
AU - Overvad, Kim
AU - de Mesquita, Bas Bueno
AU - Bingham, Sheila
AU - Riboli, Elio B
PY - 2012/1
Y1 - 2012/1
N2 - Background: Finite mixture models posit the existence of a latent categorical variable and can be used for probabilistic classification. The authors illustrate the use of mixture models for dietary pattern analysis. An advantage of this approach is taking classification uncertainty into account. Methods: Participants were a random sample of women from the European Prospective Investigation into Cancer. Food consumption was measured using dietary questionnaires. Mixture models identified latent classes in food consumption data, which were interpreted as dietary patterns. Results: Among various assumptions examined, models allowing the variance of foods to vary within and between classes fit better than alternatives assuming constant variance (the K-means method of cluster analysis also makes the latter assumption). An eight-class model was best fitting and five patterns validated well in a second random sample. Patterns with lower classification uncertainty tended to be better validated. One pattern showed low consumption of foods despite being associated with moderate body mass index. Conclusion: Mixture modelling for dietary pattern analysis has advantages over both factor and cluster analysis. In contrast to these other methods, it is easy to estimate pattern prevalence, to describe patterns and to use patterns to predict disease taking classification uncertainty into account. Owing to substantial error in food consumptions, any analysis will usually find some patterns that cannot be well validated. While knowledge of classification uncertainty may aid pattern evaluation, any method will better identify patterns from food consumptions measured with less error. Mixture models may be useful to identify individuals who under-report food consumption.
AB - Background: Finite mixture models posit the existence of a latent categorical variable and can be used for probabilistic classification. The authors illustrate the use of mixture models for dietary pattern analysis. An advantage of this approach is taking classification uncertainty into account. Methods: Participants were a random sample of women from the European Prospective Investigation into Cancer. Food consumption was measured using dietary questionnaires. Mixture models identified latent classes in food consumption data, which were interpreted as dietary patterns. Results: Among various assumptions examined, models allowing the variance of foods to vary within and between classes fit better than alternatives assuming constant variance (the K-means method of cluster analysis also makes the latter assumption). An eight-class model was best fitting and five patterns validated well in a second random sample. Patterns with lower classification uncertainty tended to be better validated. One pattern showed low consumption of foods despite being associated with moderate body mass index. Conclusion: Mixture modelling for dietary pattern analysis has advantages over both factor and cluster analysis. In contrast to these other methods, it is easy to estimate pattern prevalence, to describe patterns and to use patterns to predict disease taking classification uncertainty into account. Owing to substantial error in food consumptions, any analysis will usually find some patterns that cannot be well validated. While knowledge of classification uncertainty may aid pattern evaluation, any method will better identify patterns from food consumptions measured with less error. Mixture models may be useful to identify individuals who under-report food consumption.
UR - http://www.scopus.com/inward/record.url?scp=84855990250&partnerID=8YFLogxK
U2 - 10.1136/jech.2009.103408
DO - 10.1136/jech.2009.103408
M3 - Article
AN - SCOPUS:84855990250
SN - 0143-005X
VL - 66
SP - 89
EP - 94
JO - Journal of Epidemiology and Community Health
JF - Journal of Epidemiology and Community Health
IS - 1
ER -