A predictive framework for modeling healthcare data with evolving clinical interventions

Santu Rana, Sunil Gupta, Dinh Phung, Svetha Venkatesh

Research output: Contribution to journalArticleResearchpeer-review

6 Citations (Scopus)


Medical interventions critically determine clinical outcomes. But prediction models either ignore interventions or dilute impact by building a single prediction rule by amalgamating interventions with other features. One rule across all interventions may not capture differential effects. Also, interventions change with time as innovations are made, requiring prediction models to evolve over time. To address these gaps, we propose a prediction framework that explicitly models interventions by extracting a set of latent intervention groups through a Hierarchical Dirichlet Process (HDP) mixture. Data are split in temporal windows and for each window, a separate distribution over the intervention groups is learnt. This ensures that the model evolves with changing interventions. The outcome is modeled as conditional, on both the latent grouping and the patients' condition, through a Bayesian logistic regression. Learning distributions for each time-window result in an over-complex model when interventions do not change in every time-window. We show that by replacing HDP with a dynamic HDP prior, a more compact set of distributions can be learnt. Experiments performed on two hospital datasets demonstrate the superiority of our framework over many existing clinical and traditional prediction frameworks.

Original languageEnglish
Pages (from-to)162-182
Number of pages21
JournalStatistical Analysis and Data Mining
Issue number3
Publication statusPublished - 2015
Externally publishedYes


  • Data mining
  • Healthcare data modeling
  • Machine learning

Cite this