论文标题
用一块石头杀死两只鸟:通过部分足球俱乐部对面部识别CNN的高效训练
Killing Two Birds with One Stone:Efficient and Robust Training of Face Recognition CNNs by Partial FC
论文作者
论文摘要
通过使用数百万尺度的野外数据集和基于保证金的软效果损失,学习判别性深度嵌入是面部识别的最新方法。但是,完全连接的(FC)层的内存和计算成本线性扩展到训练集中的身份数量。此外,大规模的培训数据不可避免地会遭受阶层间冲突和长尾分配的困扰。在本文中,我们提出了FC层的稀疏更新变体,称为部分FC(PFC)。在每次迭代中,选择正类中心和负面类中心的随机子集以计算基于边缘的软磁损失。在整个培训过程中,所有班级中心仍然保持维护,但是在每次迭代中只选择并更新一个子集。因此,计算要求,阶层间冲突的概率以及尾巴中心中被动更新的频率大大降低。跨不同训练数据和骨架(例如CNN和VIT)进行的广泛实验证实了所提出的PFC的有效性,鲁棒性和效率。源代码可从\ https://github.com/deepinsight/insightface/tree/master/recognition获得。
Learning discriminative deep feature embeddings by using million-scale in-the-wild datasets and margin-based softmax loss is the current state-of-the-art approach for face recognition. However, the memory and computing cost of the Fully Connected (FC) layer linearly scales up to the number of identities in the training set. Besides, the large-scale training data inevitably suffers from inter-class conflict and long-tailed distribution. In this paper, we propose a sparsely updating variant of the FC layer, named Partial FC (PFC). In each iteration, positive class centers and a random subset of negative class centers are selected to compute the margin-based softmax loss. All class centers are still maintained throughout the whole training process, but only a subset is selected and updated in each iteration. Therefore, the computing requirement, the probability of inter-class conflict, and the frequency of passive update on tail class centers, are dramatically reduced. Extensive experiments across different training data and backbones (e.g. CNN and ViT) confirm the effectiveness, robustness and efficiency of the proposed PFC. The source code is available at \https://github.com/deepinsight/insightface/tree/master/recognition.