Description length and dimensionality reduction in functional data analysis

Don Stephen Poskitt, Sengarapillai Arivalzahan

Research output: Contribution to journalArticleResearchpeer-review

12 Citations (Scopus)

Abstract

The use of description length principles to select an appropriate number of basis functions for functional data is investigated. A flexible definition of the dimension of a random function that is constructed directly from the Karhunen-Loeve expansion of the observed process or data generating mechanism is provided. The results obtained show that although the classical, principle component variance decomposition technique will behave in a coherent manner, in general, the dimension chosen by this technique will not be consistent in the conventional sense. Two description length criteria are described. Both of these criteria are proved to be consistent and it is shown that in low noise settings they will identify the true finite dimension of a signal that is embedded in noise. Two examples, one from mass spectroscopy and the other from climatology, are used to illustrate the basic ideas. The application of different forms of the bootstrap for functional data is also explored and used to demonstrate the workings of the theoretical results.
Original languageEnglish
Pages (from-to)98 - 113
Number of pages16
JournalComputational Statistics and Data Analysis
Volume58
DOIs
Publication statusPublished - 2013

Cite this