论文标题
深层单纯形分类器,以最大程度地提高欧几里得和角空间的边缘
Deep Simplex Classifier for Maximizing the Margin in Both Euclidean and Angular Spaces
论文作者
论文摘要
深度神经网络分类器中使用的分类损失函数可以基于最大化欧几里得或角空间的边缘分为两类。在分类过程中使用样品载体之间的欧几里得距离,用于最大化欧几里得空间缘的方法,而余弦相似性距离在测试阶段使用的余弦相似性距离用于最大程度地提高角空间中边缘的方法。本文引入了一种新的分类损失,同时使欧几里得和角空间的边缘最大化。这样,欧几里得和余弦距离将产生相似和一致的结果并相互补充,这又可以提高精度。提出的损失函数强制执行类的样本,以将代表它们的中心聚集。近似类近似类的中心是从超球的边界中选择的,并且班级中心之间的成对距离始终是等效的。该限制对应于从常规单纯形的顶点选择中心。用户在建议的损失功能中必须设置任何超参数,因此,对于经典的分类问题而言,建议的方法非常容易。此外,由于类样品围绕其相应的均值聚集,因此所提出的分类器也非常适合开放式识别问题,其中测试样本可以来自训练阶段未见的未知类别。实验研究表明,尽管它简单,但提出的方法仍达到了开放式识别的最新精度。
The classification loss functions used in deep neural network classifiers can be grouped into two categories based on maximizing the margin in either Euclidean or angular spaces. Euclidean distances between sample vectors are used during classification for the methods maximizing the margin in Euclidean spaces whereas the Cosine similarity distance is used during the testing stage for the methods maximizing margin in the angular spaces. This paper introduces a novel classification loss that maximizes the margin in both the Euclidean and angular spaces at the same time. This way, the Euclidean and Cosine distances will produce similar and consistent results and complement each other, which will in turn improve the accuracies. The proposed loss function enforces the samples of classes to cluster around the centers that represent them. The centers approximating classes are chosen from the boundary of a hypersphere, and the pairwise distances between class centers are always equivalent. This restriction corresponds to choosing centers from the vertices of a regular simplex. There is not any hyperparameter that must be set by the user in the proposed loss function, therefore the use of the proposed method is extremely easy for classical classification problems. Moreover, since the class samples are compactly clustered around their corresponding means, the proposed classifier is also very suitable for open set recognition problems where test samples can come from the unknown classes that are not seen in the training phase. Experimental studies show that the proposed method achieves the state-of-the-art accuracies on open set recognition despite its simplicity.