伴侣：蒙面的自动编码器在线3D测试时间学习者

论文标题

伴侣：蒙面的自动编码器在线3D测试时间学习者

MATE: Masked Autoencoders are Online 3D Test-Time Learners

论文作者

Mirza, M. Jehanzeb, Shin, Inkyu, Lin, Wei, Schriebl, Andreas, Sun, Kunyang, Choe, Jaesung, Possegger, Horst, Kozinski, Mateusz, Kweon, In So, Yoon, Kun-Jin, Bischof, Horst

论文摘要

我们的伴侣是专为3D数据设计的第一个测试时间训练（TTT）方法，它使得对Point Cloud分类的深网进行了鲁棒的训练，可以在测试数据中发生分配变化。与2D图像域中的现有TTT方法一样，MATE还利用测试数据进行适应。它的测试时间目标是一个蒙版的自动编码器：每个测试点云的很大一部分在将其馈送到网络之前被删除，任务是重建全点云。网络更新后，它将用于对点云进行分类。我们在几个3D对象分类数据集上测试伴侣，并表明它可以显着提高深网的鲁棒性，以在3D点云中通常发生的几种类型的损坏。我们表明，伴侣在适应所需的点的比例方面非常有效。它可以有效地适应每个测试样品的几乎5％的令牌，从而使其非常轻巧。我们的实验表明，Mate还通过稀疏地调整测试数据来实现竞争性能，从而进一步降低了其计算开销，使其非常适合实时应用。

Our MATE is the first Test-Time-Training (TTT) method designed for 3D data, which makes deep networks trained for point cloud classification robust to distribution shifts occurring in test data. Like existing TTT methods from the 2D image domain, MATE also leverages test data for adaptation. Its test-time objective is that of a Masked Autoencoder: a large portion of each test point cloud is removed before it is fed to the network, tasked with reconstructing the full point cloud. Once the network is updated, it is used to classify the point cloud. We test MATE on several 3D object classification datasets and show that it significantly improves robustness of deep networks to several types of corruptions commonly occurring in 3D point clouds. We show that MATE is very efficient in terms of the fraction of points it needs for the adaptation. It can effectively adapt given as few as 5% of tokens of each test sample, making it extremely lightweight. Our experiments show that MATE also achieves competitive performance by adapting sparsely on the test data, which further reduces its computational overhead, making it ideal for real-time applications.

下载PDF全文

下载文献需遵守相关版权规定

论文标题