TY - JOUR
T1 - Measuring lineup difficulty by matching distance metrics with subject choices in crowds-sourced data
AU - Chowdhury, Niladri Roy
AU - Cook, Dianne
AU - Hofmann, Heike
AU - Majumder, Mahbubul
PY - 2018/1/2
Y1 - 2018/1/2
N2 - Graphics play a crucial role in statistical analysis and data mining. Being able to quantify structure in data that is visible in plots, and how people read the structure from plots is an ongoing challenge. The lineup protocol provides a formal framework for data plots, making inference possible. The data plot is treated like a test statistic, and lineup protocol acts like a comparison with the sampling distribution of the nulls. This article describes metrics for describing structure in data plots and evaluates them in relation to the choices that human readers made during several large Amazon Turk studies using lineups. The metrics that were more specific to the plot types tended to better match subject choices, than generic metrics. The process that we followed to evaluate metrics will be useful for general development of numerically measuring structure in plots, and also in future experiments on lineups for choosing blocks of pictures. Supplementary materials for this article are available online.
AB - Graphics play a crucial role in statistical analysis and data mining. Being able to quantify structure in data that is visible in plots, and how people read the structure from plots is an ongoing challenge. The lineup protocol provides a formal framework for data plots, making inference possible. The data plot is treated like a test statistic, and lineup protocol acts like a comparison with the sampling distribution of the nulls. This article describes metrics for describing structure in data plots and evaluates them in relation to the choices that human readers made during several large Amazon Turk studies using lineups. The metrics that were more specific to the plot types tended to better match subject choices, than generic metrics. The process that we followed to evaluate metrics will be useful for general development of numerically measuring structure in plots, and also in future experiments on lineups for choosing blocks of pictures. Supplementary materials for this article are available online.
KW - Cognitive perception
KW - Data mining
KW - Data science
KW - Data visualization
KW - Distance metrics
KW - Exploratory data analysis
KW - Information visualization
KW - Statistical graphics
KW - Visual inference
UR - http://www.scopus.com/inward/record.url?scp=85041561007&partnerID=8YFLogxK
U2 - 10.1080/10618600.2017.1356323
DO - 10.1080/10618600.2017.1356323
M3 - Article
AN - SCOPUS:85041561007
SN - 1061-8600
VL - 27
SP - 132
EP - 145
JO - Journal of Computational and Graphical Statistics
JF - Journal of Computational and Graphical Statistics
IS - 1
ER -