Skip to main navigation Skip to search Skip to main content

Burning sage: reversing the curse of dimensionality in the visualization of high-dimensional data

Research output: Contribution to journalArticleResearchpeer-review

Abstract

In high-dimensional data analysis, the curse of dimensionality reasons that points tend to be far away from the center of the distribution and on the edge of high-dimensional space. Contrary to this, is that projected data tends to clump at the center. This gives a sense that any structure near the center of the projection is obscured, whether this is true or not. A geometric transformation to reverse the curse, is defined in this article, which uses radial transformations on the projected data. It is integrated seamlessly into the grand tour algorithm, and we have called it a burning sage tour, to indicate that it reverses the curse. The work is implemented into the tourr package in R. Several case studies are included that show how the sage visualizations enhance exploratory clustering and classification problems. Supplementary files for this article are available online.

Original languageEnglish
Pages (from-to)40-49
Number of pages10
JournalJournal of Computational and Graphical Statistics
Volume31
Issue number1
DOIs
Publication statusPublished - Jan 2022

Keywords

  • Data science
  • Data visualization
  • Dynamic graphics
  • Grand tour
  • Machine learning
  • Multivariate data
  • Statistical computing
  • Statistical graphics

Cite this