论文标题
在本地隐私下进行稀疏分配估算的压缩感应方法
Compressive Sensing Approaches for Sparse Distribution Estimation Under Local Privacy
论文作者
论文摘要
近年来,许多Web服务提供商都采用了当地的差异隐私(LDP),例如Google \ cite {erlingsson2014rappor},Apple \ cite {Apple2017privacy}和Microsoft \ cite {Bolin2017telemetry},以收集和分析数据pricental'pricental'privational'privational'privational'pricental。在本文中,我们考虑了在当地差异隐私约束下的离散分配估计问题。分配估计是最基本的估计问题之一,在非私有设置和私人环境中都进行了广泛的研究。在本地模型中,已知具有最佳样品复杂性的私人机制。但是,它们仅在最糟糕的意义上才是最佳选择。它们的样本复杂性与整个宇宙的大小成正比,这在实践中可能是巨大的。在本文中,我们认为稀疏或大约稀疏(例如\ \偏斜)的分布,并表明所需的样品数量可以大大减少。最近研究了这个问题\ cite {acharya2021估算},但它们仅考虑严格的稀疏分布和高隐私制度。我们提出了基于压缩感应的新私有化机制。我们的方法可用于大约稀疏的分布和中等隐私,并具有最佳的样本和沟通复杂性。
Recent years, local differential privacy (LDP) has been adopted by many web service providers like Google \cite{erlingsson2014rappor}, Apple \cite{apple2017privacy} and Microsoft \cite{bolin2017telemetry} to collect and analyse users' data privately. In this paper, we consider the problem of discrete distribution estimation under local differential privacy constraints. Distribution estimation is one of the most fundamental estimation problems, which is widely studied in both non-private and private settings. In the local model, private mechanisms with provably optimal sample complexity are known. However, they are optimal only in the worst-case sense; their sample complexity is proportional to the size of the entire universe, which could be huge in practice. In this paper, we consider sparse or approximately sparse (e.g.\ highly skewed) distribution, and show that the number of samples needed could be significantly reduced. This problem has been studied recently \cite{acharya2021estimating}, but they only consider strict sparse distributions and the high privacy regime. We propose new privatization mechanisms based on compressive sensing. Our methods work for approximately sparse distributions and medium privacy, and have optimal sample and communication complexity.