论文标题
重新访问模型不足的私人学习:更快的速度和积极学习
Revisiting Model-Agnostic Private Learning: Faster Rates and Active Learning
论文作者
论文摘要
教师合奏(PATE)框架的私人汇总是差异私人学习中最有前途的方法之一。现有的理论分析表明,PATE在可实现的环境中始终学习任何VC级,但在更一般的情况下,最佳分类器的错误率远离零有限。我们通过引入Tsybakov噪声条件(TNC)来填补这一空白,并建立更强大,更容易解释的学习范围。这些界限为Pate何时工作并改善了现有结果,即使在狭窄的可实现环境中,这些界限也提供了新的见解。我们还调查了使用积极学习来节省隐私预算的令人信服的想法,并且经验研究表明了这一新想法的有效性。证明中的新组件包括对大多数投票分类器的更精致分析(可能具有独立的兴趣),并且观察到合成“学生”学习问题几乎可以通过Tsybakov噪声条件下的构造来实现。
The Private Aggregation of Teacher Ensembles (PATE) framework is one of the most promising recent approaches in differentially private learning. Existing theoretical analysis shows that PATE consistently learns any VC-classes in the realizable setting, but falls short in explaining its success in more general cases where the error rate of the optimal classifier is bounded away from zero. We fill in this gap by introducing the Tsybakov Noise Condition (TNC) and establish stronger and more interpretable learning bounds. These bounds provide new insights into when PATE works and improve over existing results even in the narrower realizable setting. We also investigate the compelling idea of using active learning for saving privacy budget, and empirical studies show the effectiveness of this new idea. The novel components in the proofs include a more refined analysis of the majority voting classifier - which could be of independent interest - and an observation that the synthetic "student" learning problem is nearly realizable by construction under the Tsybakov noise condition.