象牙：任务不可监督的关键点

论文标题

象牙：任务不可监督的关键点

TUSK: Task-Agnostic Unsupervised Keypoints

论文作者

Jin, Yuhe, Sun, Weiwei, Hosang, Jan, Trulls, Eduard, Yi, Kwang Moo

论文摘要

关键点学习的现有无监督方法在很大程度上取决于以下假设：特定的关键点类型（例如肘部，数字，抽象几何形状）仅在图像中出现一次。这极大地限制了它们的适用性，因为在应用未从未讨论或评估的方法之前必须隔离每个实例。因此，我们提出了一种新的方法来学习任务无关的，无监督的关键点（Tusk），可以处理多个实例。为了实现这一目标，我们使用单个热图检测到了多个热图的常用策略，而是专门针对特定的关键点类型，并通过群集实现了对关键点类型的无监督学习。具体来说，我们通过教导它们从一组稀疏的关键点及其描述符中重建图像来编码语义，并在其中被迫在学识渊博的原型围绕特征空间中形成不同的簇。这使我们的方法适合于更广泛的任务范围，而不是以前的任何无监督的关键点方法：我们在多个现代检测和分类，对象发现和地标检测方面展示了实验 - 与艺术状态相同，同时也可以处理多个实例。

Existing unsupervised methods for keypoint learning rely heavily on the assumption that a specific keypoint type (e.g. elbow, digit, abstract geometric shape) appears only once in an image. This greatly limits their applicability, as each instance must be isolated before applying the method-an issue that is never discussed or evaluated. We thus propose a novel method to learn Task-agnostic, UnSupervised Keypoints (TUSK) which can deal with multiple instances. To achieve this, instead of the commonly-used strategy of detecting multiple heatmaps, each dedicated to a specific keypoint type, we use a single heatmap for detection, and enable unsupervised learning of keypoint types through clustering. Specifically, we encode semantics into the keypoints by teaching them to reconstruct images from a sparse set of keypoints and their descriptors, where the descriptors are forced to form distinct clusters in feature space around learned prototypes. This makes our approach amenable to a wider range of tasks than any previous unsupervised keypoint method: we show experiments on multiple-instance detection and classification, object discovery, and landmark detection-all unsupervised-with performance on par with the state of the art, while also being able to deal with multiple instances.

下载PDF全文

下载文献需遵守相关版权规定

论文标题