论文标题
2020年人口普查披露系统中的地理刺
Geographic Spines in the 2020 Census Disclosure Avoidance System
论文作者
论文摘要
2020年人口普查披露避免系统(DAS)是一种正式的私人机制,首先为一组预先指定的分层地理单元(称为地理脊柱)添加独立的噪声。在处理这些嘈杂的测量值之后,DAS输出了一个正式的私有数据库,该数据库具有指示标准人口普查地理脊柱中的位置,该数据库由美国整体定义,该州,州,县,人口普查区,块组和人口普查。本文介绍了地理脊柱在DAS内部如何定义初始噪声测量结果如何影响输出数据库的准确性。具体而言,对于地理区域而言,地理区域的表往往是最准确的,而这两者都可以通过将内部脊柱的地理位置上的地理位置汇总在一起来得出1),而内部脊柱的地理单位和2)更接近内部脊柱的地理单位。在描述了与内部DAS地理脊柱选择相关的准确性权衡之后,我们提供了用于定义2020年人口普查生产DAS运行的设置。
The 2020 Census Disclosure Avoidance System (DAS) is a formally private mechanism that first adds independent noise to cross tabulations for a set of pre-specified hierarchical geographic units, which is known as the geographic spine. After post-processing these noisy measurements, DAS outputs a formally private database with fields indicating location in the standard census geographic spine, which is defined by the United States as a whole, states, counties, census tracts, block groups, and census blocks. This paper describes how the geographic spine used internally within DAS to define the initial noisy measurements impacts accuracy of the output database. Specifically, tabulations for geographic areas tend to be most accurate for geographic areas that both 1) can be derived by aggregating together geographic units above the block geographic level of the internal spine, and 2) are closer to the geographic units of the internal spine. After describing the accuracy tradeoffs relevant to the choice of internal DAS geographic spine, we provide the settings used to define the 2020 Census production DAS runs.