Abstract
Stability in clinical prediction models is crucial for transferability between studies, yet has received little attention. The problem is paramount in high dimensional data, which invites sparse models with feature selection capability. We introduce an effective method to stabilize sparse Cox model of time-to-events using statistical and semantic structures inherent in Electronic Medical Records (EMR). Model estimation is stabilized using three feature graphs built from (i) Jaccard similarity among features (ii) aggregation of Jaccard similarity graph and a recently introduced semantic EMR graph (iii) Jaccard similarity among features transferred from a related cohort. Our experiments are conducted on two real world hospital datasets: a heart failure cohort and a diabetes cohort. On two stability measures - the Consistency index and signal-to-noise ratio (SNR) - the use of our proposed methods significantly increased feature stability when compared with the baselines.
Original language | English |
---|---|
Title of host publication | Advances in Knowledge Discovery and Data Mining |
Subtitle of host publication | 19th Pacific-Asia Conference, PAKDD 2015 Ho Chi Minh City, Vietnam, May 19–22, 2015 Proceedings, Part II |
Editors | Tru Cao, Ee-Peng Lim, Zhi-Hua Zhou, Tu-Bao Ho, David Cheung, Hiroshi Motoda |
Place of Publication | Cham Switzerland |
Publisher | Springer |
Pages | 331-343 |
Number of pages | 13 |
ISBN (Electronic) | 9783319180328 |
ISBN (Print) | 9783319180311 |
DOIs | |
Publication status | Published - 2015 |
Externally published | Yes |
Event | Pacific-Asia Conference on Knowledge Discovery and Data Mining 2015 - Ho Chi Minh City, Vietnam Duration: 19 May 2015 → 22 May 2015 Conference number: 19th https://web.archive.org/web/20150429212339/http://www.pakdd2015.jvn.edu.vn/ https://link.springer.com/book/10.1007/978-3-319-18038-0 (Proceedings) |
Publication series
Name | Lecture Notes in Computer Science |
---|---|
Publisher | Springer |
Volume | 9078 |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference
Conference | Pacific-Asia Conference on Knowledge Discovery and Data Mining 2015 |
---|---|
Abbreviated title | PAKDD 2015 |
Country/Territory | Vietnam |
City | Ho Chi Minh City |
Period | 19/05/15 → 22/05/15 |
Internet address |