Bayesian logistic regression approaches to predict incorrect DRG assignment

Mani Suleiman, Haydar Demirhan, Leanne Boyd, Federico Girosi, Vural Aksakalli

Research output: Contribution to journalArticleResearchpeer-review

5 Citations (Scopus)


Episodes of care involving similar diagnoses and treatments and requiring similar levels of resource utilisation are grouped to the same Diagnosis-Related Group (DRG). In jurisdictions which implement DRG based payment systems, DRGs are a major determinant of funding for inpatient care. Hence, service providers often dedicate auditing staff to the task of checking that episodes have been coded to the correct DRG. The use of statistical models to estimate an episode’s probability of DRG error can significantly improve the efficiency of clinical coding audits. This study implements Bayesian logistic regression models with weakly informative prior distributions to estimate the likelihood that episodes require a DRG revision, comparing these models with each other and to classical maximum likelihood estimates. All Bayesian approaches had more stable model parameters than maximum likelihood. The best performing Bayesian model improved overall classification per- formance by 6% compared to maximum likelihood, with a 34% gain compared to random classification, respectively. We found that the original DRG, coder and the day of coding all have a significant effect on the likelihood of DRG error. Use of Bayesian approaches has improved model parameter stability and classification accuracy. This method has already lead to improved audit efficiency in an operational capacity.

Original languageEnglish
Pages (from-to)364-375
Number of pages12
JournalHealth Care Management Science
Issue number2
Publication statusPublished - Jun 2019
Externally publishedYes


  • Bayesian Analysis
  • Clinical Coding
  • DRGs
  • Health Informatics
  • Statistical Modelling

Cite this