私有分区的变分推断

论文标题

私有分区的变分推断

Differentially private partitioned variational inference

论文作者

Heikkilä, Mikko A., Ashman, Matthew, Swaroop, Siddharth, Turner, Richard E., Honkela, Antti

论文摘要

从敏感数据中学习隐私的模型，这些模型分布在多个设备上是一个越来越重要的问题。这个问题通常是在联邦学习环境中提出的，目的是在保持数据分布的同时学习单个全球模型。此外，贝叶斯学习是建模的一种流行方法，因为它自然支持可靠的不确定性估计。但是，即使使用集中的非私有数据，贝叶斯学习通常也很棘手，因此近似技术（例如变异推断）是必要的。最近通过分区的变异推理算法扩展了变异推理到非私有联盟学习设置。为了保护隐私，当前的黄金标准称为差异隐私。差异隐私在强大的数学上明确定义的意义上保证了隐私。在本文中，我们介绍了差异化私有分区的变异推断，这是学习与联合学习环境中贝叶斯后验分布的差异近似的第一个一般框架，同时最大程度地减少了通信巡回赛的数量并为数据主体提供差异隐私保证。我们在通用框架中提出了三个替代实现，一个基于单个方面的本地优化运行，而两个基于对全球模型的扰动更新（一种使用联合平均版本，第二个将虚拟方添加到协议中添加虚拟方面的版本），并在理论和经验上比较其属性。

Learning a privacy-preserving model from sensitive data which are distributed across multiple devices is an increasingly important problem. The problem is often formulated in the federated learning context, with the aim of learning a single global model while keeping the data distributed. Moreover, Bayesian learning is a popular approach for modelling, since it naturally supports reliable uncertainty estimates. However, Bayesian learning is generally intractable even with centralised non-private data and so approximation techniques such as variational inference are a necessity. Variational inference has recently been extended to the non-private federated learning setting via the partitioned variational inference algorithm. For privacy protection, the current gold standard is called differential privacy. Differential privacy guarantees privacy in a strong, mathematically clearly defined sense. In this paper, we present differentially private partitioned variational inference, the first general framework for learning a variational approximation to a Bayesian posterior distribution in the federated learning setting while minimising the number of communication rounds and providing differential privacy guarantees for data subjects. We propose three alternative implementations in the general framework, one based on perturbing local optimisation runs done by individual parties, and two based on perturbing updates to the global model (one using a version of federated averaging, the second one adding virtual parties to the protocol), and compare their properties both theoretically and empirically.

下载PDF全文

下载文献需遵守相关版权规定

论文标题