Collection and analysis of self-reported information on an ordered Likert scale is ubiquitous across the social sciences. Inference from such analyses is valid where the response scale employed means the same thing to all individuals. That is, if there is no differential item functioning (DIF) present in the data. A priori this is unlikely to hold across all individuals and cohorts in any sample of data. For this reason, anchoring vignettes have been proposed as a way to correct for DIF when individuals self-assess their health (or well-being, or satisfaction levels, or disability levels, etc.) on an ordered categorical scale. Using an example of self-assessed pain, we illustrate the use of vignettes to adjust for DIF using the compound hierarchical ordered probit model (CHOPIT). The validity of this approach relies on the two underlying assumptions of response consistency (RC) and vignette equivalence (VE). Using a minor amendment to the specification of the standard CHOPIT model, we develop easy-to-implement score tests of the null hypothesis of RC and VE both separately and jointly. Monte Carlo simulations show that the tests have good size and power properties in finite samples. We illustrate the use of the tests by applying them to our empirical example. The tests should aid more robust analyses of self-reported survey outcomes collected alongside anchoring vignettes.
|Number of pages||34|
|Journal||Journal of the Royal Statistical Society Series A-Statistics in Society|
|Publication status||Published - Jan 2021|
- anchoring vignettes
- differential item functioning
- ordered response models
- score test