Abstract
In high-dimensional data analysis, the curse of dimensionality reasons that points tend to be far away from the center of the distribution and on the edge of high-dimensional space. Contrary to this, is that projected data tends to clump at the center. This gives a sense that any structure near the center of the projection is obscured, whether this is true or not. A geometric transformation to reverse the curse, is defined in this article, which uses radial transformations on the projected data. It is integrated seamlessly into the grand tour algorithm, and we have called it a burning sage tour, to indicate that it reverses the curse. The work is implemented into the tourr package in R. Several case studies are included that show how the sage visualizations enhance exploratory clustering and classification problems. Supplementary files for this article are available online.
| Original language | English |
|---|---|
| Pages (from-to) | 40-49 |
| Number of pages | 10 |
| Journal | Journal of Computational and Graphical Statistics |
| Volume | 31 |
| Issue number | 1 |
| DOIs | |
| Publication status | Published - Jan 2022 |
Keywords
- Data science
- Data visualization
- Dynamic graphics
- Grand tour
- Machine learning
- Multivariate data
- Statistical computing
- Statistical graphics
Projects
- 1 Finished
-
Visualisation of Multidimensional Physics Data
Valencia, G. (Primary Chief Investigator (PCI)), Balazs, C. (Chief Investigator (CI)), Cook, D. (Chief Investigator (CI)), Buja, A. (Partner Investigator (PI)) & Rosati, M. (Partner Investigator (PI))
1/05/17 → 31/12/22
Project: Research
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver