论文标题
从具有模型验证的卷积规则的二进制神经网络中提取规则
Rule Extraction from Binary Neural Networks with Convolutional Rules for Model Validation
论文作者
论文摘要
大多数深度神经网络被认为是黑匣子,这意味着它们的输出很难解释。相比之下,逻辑表达式被认为更容易理解,因为它们使用了在语义上接近自然语言而不是分布式表示的符号。但是,对于高维输入数据(例如图像),单个符号(即像素)不容易解释。我们介绍了一阶卷积规则的概念,这些规则是可以使用卷积神经网络(CNN)提取的逻辑规则,其复杂性取决于卷积过滤器的大小,而不是输入的维度。我们的方法是基于从二进制神经网络中提取的,具有随机局部搜索。我们展示了如何提取不一定短的规则,而是输入的特征,并且易于可视化。我们的实验表明,所提出的方法能够对神经网络的功能进行建模,同时生成可解释的逻辑规则。
Most deep neural networks are considered to be black boxes, meaning their output is hard to interpret. In contrast, logical expressions are considered to be more comprehensible since they use symbols that are semantically close to natural language instead of distributed representations. However, for high-dimensional input data such as images, the individual symbols, i.e. pixels, are not easily interpretable. We introduce the concept of first-order convolutional rules, which are logical rules that can be extracted using a convolutional neural network (CNN), and whose complexity depends on the size of the convolutional filter and not on the dimensionality of the input. Our approach is based on rule extraction from binary neural networks with stochastic local search. We show how to extract rules that are not necessarily short, but characteristic of the input, and easy to visualize. Our experiments show that the proposed approach is able to model the functionality of the neural network while at the same time producing interpretable logical rules.