Text mining electronic hospital records to automatically classify admissions against disease: Measuring the impact of linking data sources

Simon Kocbek, Lawrence Cavedon, David Martinez, Christopher Bain, Chris Mac Manus, Gholamreza Haffari, Ingrid Zukerman, Karin Verspoor

    Research output: Contribution to journalArticleResearchpeer-review

    17 Citations (Scopus)

    Abstract

    Text and data mining play an important role in obtaining insights from Health and Hospital Information Systems. This paper presents a text mining system for detecting admissions marked as positive for several diseases: Lung Cancer, Breast Cancer, Colon Cancer, Secondary Malignant Neoplasm of Respiratory and Digestive Organs, Multiple Myeloma and Malignant Plasma Cell Neoplasms, Pneumonia, and Pulmonary Embolism. We specifically examine the effect of linking multiple data sources on text classification performance.
    Original languageEnglish
    Pages (from-to)158-167
    Number of pages10
    JournalJournal of Biomedical Informatics
    Volume64
    DOIs
    Publication statusPublished - 11 Oct 2016

    Keywords

    • Cancer record retrieval
    • Text mining
    • Natural Language Processing
    • Electronic Health Records
    • Radiology
    • Pathology

    Cite this