论文标题
看到您相信或相信您所看到的?信念偏见相关估计
Seeing What You Believe or Believing What You See? Belief Biases Correlation Estimation
论文作者
论文摘要
当分析师或科学家对世界的运作方式有信念时,他们的思想可能会偏向于这种信念。因此,一个基岩原理是通过测试对客观数据的信念的预测来最大程度地减少这种偏见。但是解释可视化数据是一个复杂的感知和认知过程。通过两个众包实验,我们证明了对相关关系强度的据称客观评估可以受到观众对这种关系的存在的强烈信念的影响。参与者查看了散点图,描绘了有意义的变量对(例如,环境法规和空气质量的数量)之间的关系,并估计了它们的相关性。他们还估计了与通用的“ x”和“ y”轴相同的相同散点图的相关性。在另一个部分中,他们还报告了他们认为有意义的变量对之间存在相关性的强烈强烈。与用有意义的可变对标记的散点图相比,参与者查看标记为通用轴的散点图时,他们更准确地估计了相关性。此外,当观众认为两个变量应该具有牢固的关系时,它们高估了这些变量之间的相关性,其R值约为0.1。当他们认为变量应与之无关时,他们低估了R值约为0.1的相关性。尽管通常认为数据可视化是向观众提出客观真理,但这些结果表明,现有的个人信念甚至可以使人们从数据中提取的客观统计值偏向。
When an analyst or scientist has a belief about how the world works, their thinking can be biased in favor of that belief. Therefore, one bedrock principle of science is to minimize that bias by testing the predictions of one's belief against objective data. But interpreting visualized data is a complex perceptual and cognitive process. Through two crowdsourced experiments, we demonstrate that supposedly objective assessments of the strength of a correlational relationship can be influenced by how strongly a viewer believes in the existence of that relationship. Participants viewed scatterplots depicting a relationship between meaningful variable pairs (e.g., number of environmental regulations and air quality) and estimated their correlations. They also estimated the correlation of the same scatterplots labeled instead with generic 'X' and 'Y' axes. In a separate section, they also reported how strongly they believed there to be a correlation between the meaningful variable pairs. Participants estimated correlations more accurately when they viewed scatterplots labeled with generic axes compared to scatterplots labeled with meaningful variable pairs. Furthermore, when viewers believed that two variables should have a strong relationship, they overestimated correlations between those variables by an r-value of about 0.1. When they believed that the variables should be unrelated, they underestimated the correlations by an r-value of about 0.1. While data visualizations are typically thought to present objective truths to the viewer, these results suggest that existing personal beliefs can bias even objective statistical values people extract from data.