多标签分类：锤击损失和子集准确性是否真的相互冲突？

论文标题

多标签分类：锤击损失和子集准确性是否真的相互冲突？

Multi-label classification: do Hamming loss and subset accuracy really conflict with each other?

论文作者

Wu, Guoqiang, Zhu, Jun

论文摘要

已经为多标签分类制定了各种评估措施，包括锤损失（HL），子集准确性（SA）和排名损失（RL）。但是，经验结果与现有理论之间存在差距：1）算法在某种程度上通常在某种程度上表现良好，而在其他方面则很差，而缺乏正式的理论分析； 2）在小标签空间案例中，优化HL的算法在SA度量上通常具有可比性甚至更好的性能，而现有的理论结果表明SA和HL是相互矛盾的措施。本文通过分析SA和HL措施的相应学习算法的学习保证来填补这一空白。我们表明，当学习算法通过其替代损失优化HL时，它在HL量度限制的错误中独立于$ c $（标签的数量），而SA量的限制最多取决于$ O（c）$。另一方面，当直接通过其替代损失优化SA时，它的学习确保了HL和SA措施依赖于$ O（\ sqrt {C}）$。这解释了这样的观察，即当标签空间不大时，用替代损失优化HL可以为SA具有有希望的性能。我们进一步表明，我们的技术适用于分析其他措施（例如RL）算法的学习保证。最后，实验结果支持理论分析。

Various evaluation measures have been developed for multi-label classification, including Hamming Loss (HL), Subset Accuracy (SA) and Ranking Loss (RL). However, there is a gap between empirical results and the existing theories: 1) an algorithm often empirically performs well on some measure(s) while poorly on others, while a formal theoretical analysis is lacking; and 2) in small label space cases, the algorithms optimizing HL often have comparable or even better performance on the SA measure than those optimizing SA directly, while existing theoretical results show that SA and HL are conflicting measures. This paper provides an attempt to fill up this gap by analyzing the learning guarantees of the corresponding learning algorithms on both SA and HL measures. We show that when a learning algorithm optimizes HL with its surrogate loss, it enjoys an error bound for the HL measure independent of $c$ (the number of labels), while the bound for the SA measure depends on at most $O(c)$. On the other hand, when directly optimizing SA with its surrogate loss, it has learning guarantees that depend on $O(\sqrt{c})$ for both HL and SA measures. This explains the observation that when the label space is not large, optimizing HL with its surrogate loss can have promising performance for SA. We further show that our techniques are applicable to analyze the learning guarantees of algorithms on other measures, such as RL. Finally, the theoretical analyses are supported by experimental results.

下载PDF全文

下载文献需遵守相关版权规定

论文标题