兴奋性过失转向错误：基于激发神经元的深度学习测试框架

论文标题

兴奋性过失转向错误：基于激发神经元的深度学习测试框架

Excitement Surfeited Turns to Errors: Deep Learning Testing Framework Based on Excitable Neurons

论文作者

Jin, Haibo, Chen, Ruoxi, Zheng, Haibin, Chen, Jinyin, Cheng, Yao, Yu, Yue, Liu, Xianglong

论文摘要

尽管能力令人印象深刻和表现出色，但深层神经网络（DNN）由于经常发生的错误行为而引起了公众对其安全问题的日益关注。因此，有必要对DNN进行系统测试，然后再将其部署到现实世界应用程序中。现有的测试方法基于神经元覆盖范围提供了细粒度的指标，并提出了改善此类指标的各种方法。但是，已经逐渐意识到，较高的神经元覆盖范围\ textit {not}必然代表了在识别导致错误的缺陷方面更好的能力。此外，由于训练程序错误，覆盖范围引导的方法无法捕获错误。因此，通过这些测试示例将DNN的鲁棒性改善不令人满意。为了应对这一挑战，我们介绍了基于沙普利价值的可激发神经元的概念，并为DNN（即DeepSensor）设计了一种新颖的白盒测试框架。我们的观察结果是，由于潜在的缺陷，由于微弱扰动而造成更大责任造成模型损失变化的神经元更可能与不正确的角病例有关。通过最大化有关模型的各种错误行为的可激发神经元的数量，DeepSensor可以生成测试示例，从而有效触发由于对抗性输入，污染数据和不完整训练而导致更多错误。在两个图像分类模型和说话者识别模型上实施的广泛实验已经证明了DeepSensor的优势。

Despite impressive capabilities and outstanding performance, deep neural networks (DNNs) have captured increasing public concern about their security problems, due to their frequently occurred erroneous behaviors. Therefore, it is necessary to conduct a systematical testing for DNNs before they are deployed to real-world applications. Existing testing methods have provided fine-grained metrics based on neuron coverage and proposed various approaches to improve such metrics. However, it has been gradually realized that a higher neuron coverage does \textit{not} necessarily represent better capabilities in identifying defects that lead to errors. Besides, coverage-guided methods cannot hunt errors due to faulty training procedure. So the robustness improvement of DNNs via retraining by these testing examples are unsatisfactory. To address this challenge, we introduce the concept of excitable neurons based on Shapley value and design a novel white-box testing framework for DNNs, namely DeepSensor. It is motivated by our observation that neurons with larger responsibility towards model loss changes due to small perturbations are more likely related to incorrect corner cases due to potential defects. By maximizing the number of excitable neurons concerning various wrong behaviors of models, DeepSensor can generate testing examples that effectively trigger more errors due to adversarial inputs, polluted data and incomplete training. Extensive experiments implemented on both image classification models and speaker recognition models have demonstrated the superiority of DeepSensor.

下载PDF全文

下载文献需遵守相关版权规定

论文标题