使用NDCG和相互等级指标评估神经网络的防御和攻击

论文标题

使用NDCG和相互等级指标评估神经网络的防御和攻击

Evaluation of Neural Networks Defenses and Attacks using NDCG and Reciprocal Rank Metrics

论文作者

Brama, Haya, Dery, Lihi, Grinshpoun, Tal

论文摘要

最近，通过输入修改（即对抗性示例）对神经网络的攻击问题最近引起了很多关注。这些攻击相对容易生成和难以检测，因此构成了许多建议的防御试图减轻的安全漏洞。但是，对攻击和防御效果的评估通常取决于传统的分类指标，而没有适应对抗场景的适应性。这些指标大多数都是基于精度的，因此可能具有有限的范围和低独特的功率。其他指标不考虑神经网络功能的独特特征，也不考虑间接测量攻击的效果（例如，通过其一代的复杂性）。在本文中，我们提出了两个指标，这些指标旨在衡量攻击的影响或防御效应对多类分类任务中神经网络输出的效果。受到信息检索文献中使用的归一化累积累积收益和相互等级指标的启发，我们将神经网络预测视为排名的结果列表。使用有关等级概率的其他信息，使我们能够定义适合手头任务的新型指标。我们使用验证的VGG19模型和ImageNet数据集在使用各种攻击和防御措施中评估指标。与共同的分类指标相比，我们提出的指标表现出较高的信息性和独特性。

The problem of attacks on neural networks through input modification (i.e., adversarial examples) has attracted much attention recently. Being relatively easy to generate and hard to detect, these attacks pose a security breach that many suggested defenses try to mitigate. However, the evaluation of the effect of attacks and defenses commonly relies on traditional classification metrics, without adequate adaptation to adversarial scenarios. Most of these metrics are accuracy-based, and therefore may have a limited scope and low distinctive power. Other metrics do not consider the unique characteristics of neural networks functionality, or measure the effect of the attacks indirectly (e.g., through the complexity of their generation). In this paper, we present two metrics which are specifically designed to measure the effect of attacks, or the recovery effect of defenses, on the output of neural networks in multiclass classification tasks. Inspired by the normalized discounted cumulative gain and the reciprocal rank metrics used in information retrieval literature, we treat the neural network predictions as ranked lists of results. Using additional information about the probability of the rank enabled us to define novel metrics that are suited to the task at hand. We evaluate our metrics using various attacks and defenses on a pretrained VGG19 model and the ImageNet dataset. Compared to the common classification metrics, our proposed metrics demonstrate superior informativeness and distinctiveness.

下载PDF全文

下载文献需遵守相关版权规定

论文标题