Probability model type sufficiency

Leigh James Fitzgibbon, Lloyd Allison, Joshua William Comley

    Research output: Chapter in Book/Report/Conference proceedingConference PaperResearchpeer-review

    Abstract

    We investigate the role of sufficient statistics in generalized probabilistic data mining and machine learning software frameworks. Some issues involved in the specification of a statistical model type are discussed and we show that it is beneficial to explicitly include a sufficient statistic and functions for its manipulation in the model type’s specification. Instances of such types can then be used by generalized learning algorithms while maintaining optimal learning time complexity. Examples are given for problems such as incremental learning and data partitioning problems (e.g. change-point problems, decision trees and mixture models).
    Original languageEnglish
    Title of host publicationIntelligent Data Engineering and Automated Learning
    Subtitle of host publication4th International Conference, IDEAL 2003 Hong Kong, China, March 21–23, 2003 Revised Papers
    EditorsJiming Liu, Yiuming Cheung, Hujun Yin
    Place of PublicationNew York NY USA
    PublisherSpringer
    Pages530-534
    Number of pages5
    ISBN (Electronic)9783540450801
    ISBN (Print)9783540405504
    DOIs
    Publication statusPublished - 2003
    EventInternational Conference on Intelligent Data Engineering and Automated Learning 2003 - Hong Kong China, New York USA
    Duration: 1 Jan 2003 → …

    Publication series

    NameLecture Notes in Computer Science
    PublisherSpringer
    Volume2690
    ISSN (Print)0302-9743

    Conference

    ConferenceInternational Conference on Intelligent Data Engineering and Automated Learning 2003
    CityNew York USA
    Period1/01/03 → …

    Cite this