The Unjournal · Pivotal Questions Initiative

Research Presentation: Benjamin et al.

Scale-use heterogeneity findings and calibration methods

💬 Annotate this page — select any text to comment via Hypothes.is (free account to post; anyone can read)
SEGMENT 5 30 minutes (1:35–2:05 PM ET)

Speakers: Dan Benjamin (UCLA/NBER), Miles Kimball (CU Boulder)

Authors will share an outline of their presentation and invitation for feedback in advance. Participants are encouraged to pre-read and pre-ask questions.

Unjournal Evaluation: Scale-Use Heterogeneity

View the evaluation summary and full paper on PubPub →

Overview

This segment presents the core research findings from Benjamin et al.[1]Benjamin, Cooper, Heffetz, Kimball & Zhou (2023). "Adjusting for Scale-Use Heterogeneity in Self-Reported Well-Being." Full paper available via the Unjournal evaluation. on scale-use heterogeneity in wellbeing surveys. The authors will discuss how different individuals use the 0-10 life satisfaction scale differently,[2]Scale-use heterogeneity can manifest as: (1) different "anchoring" points, (2) different ranges used, (3) different interpretations of scale labels. All create bias in cross-person comparisons. and propose calibration methods to address this measurement challenge.

Key Topics

Collaborative Notes

Open in new tab →

Questions & Comments

Add questions and comments directly to the collaborative notes above, or submit them via the beliefs elicitation form.

Notes

  1. Benjamin, Cooper, Heffetz, Kimball & Zhou (2023). "Adjusting for Scale-Use Heterogeneity in Self-Reported Well-Being." Full paper available via the Unjournal evaluation.
  2. Scale-use heterogeneity can manifest as: (1) different "anchoring" points, (2) different ranges used, (3) different interpretations of scale labels. All create bias in cross-person comparisons.
  3. The paper finds substantial heterogeneity in how people use scales—but also that changes in wellbeing may be more comparable than levels, which is encouraging for intervention evaluation.
  4. Vignette anchoring: asking respondents to rate hypothetical people's wellbeing. This reveals individual scale-use patterns and enables adjustment. Trade-off: increased survey length.
  5. Key open questions include: (1) Do calibration methods work in LMIC contexts? (2) Does scale-use differ systematically between treatment and control groups (causing bias)? (3) How much precision do simple methods sacrifice?