H2RBox：水平盒注释是您所需的面向对象检测的全部

论文标题

H2RBox：水平盒注释是您所需的面向对象检测的全部

H2RBox: Horizontal Box Annotation is All You Need for Oriented Object Detection

论文作者

Yang, Xue, Zhang, Gefan, Li, Wentong, Wang, Xuehui, Zhou, Yue, Yan, Junchi

论文摘要

从航空图像到自动驾驶的许多应用中出现了定向的对象检测，而许多现有的检测基准是用水平边界盒注释的，而水平边界盒仅与细粒度旋转的盒子相比，这也比细粒度的旋转盒子较低，从而导致易于使用的训练语料库和对定向对象检测的上升需求之间的差距。本文提出了一种简单但有效的对象检测方法，称为H2RBOX仅使用水平盒子注释来进行弱监督训练，该训练缩小了上述差距，即使在接受过旋转盒子训练的训练的培训的培训中，也表现出竞争性能。我们方法的核心是弱和自制的学习，它通过学习两个不同观点的一致性来预测对象的角度。据我们所知，H2RBOX是第一个基于水平盒子注释的面向对象检测器。与替代方案（即对定向对象检测的适应后的水平盒子监督实例分割）相比，我们的方法不容易受到掩码的预测质量，并且可以在包含大量密集对象和异差群的复杂场景中表现出更强的性能。实验结果表明，H2RBOX在水平盒子监督实例分割方法以及较低的内存需求方面具有显着的性能和速度优势。与旋转的盒子监督对象探测器相比，我们的方法显示出非常接近的性能和速度。 The source code is available at PyTorch-based \href{https://github.com/yangxue0827/h2rbox-mmrotate}{MMRotate} and Jittor-based \href{https://github.com/yangxue0827/h2rbox-jittor}{JDet}.

Oriented object detection emerges in many applications from aerial images to autonomous driving, while many existing detection benchmarks are annotated with horizontal bounding box only which is also less costive than fine-grained rotated box, leading to a gap between the readily available training corpus and the rising demand for oriented object detection. This paper proposes a simple yet effective oriented object detection approach called H2RBox merely using horizontal box annotation for weakly-supervised training, which closes the above gap and shows competitive performance even against those trained with rotated boxes. The cores of our method are weakly- and self-supervised learning, which predicts the angle of the object by learning the consistency of two different views. To our best knowledge, H2RBox is the first horizontal box annotation-based oriented object detector. Compared to an alternative i.e. horizontal box-supervised instance segmentation with our post adaption to oriented object detection, our approach is not susceptible to the prediction quality of mask and can perform more robustly in complex scenes containing a large number of dense objects and outliers. Experimental results show that H2RBox has significant performance and speed advantages over horizontal box-supervised instance segmentation methods, as well as lower memory requirements. While compared to rotated box-supervised oriented object detectors, our method shows very close performance and speed. The source code is available at PyTorch-based \href{https://github.com/yangxue0827/h2rbox-mmrotate}{MMRotate} and Jittor-based \href{https://github.com/yangxue0827/h2rbox-jittor}{JDet}.

下载PDF全文

下载文献需遵守相关版权规定

论文标题