论文标题

Monopair:使用成对空间关系的单眼3D对象检测

MonoPair: Monocular 3D Object Detection Using Pairwise Spatial Relationships

论文作者

Chen, Yongjian, Tai, Lei, Sun, Kai, Li, Mingyang

论文摘要

单眼3D对象检测是自主驾驶的重要组成部分,同时又有挑战性解决方案,尤其是对于那些仅部分可见的遮挡样品。大多数检测器将每个3D对象视为一个独立的训练目标,不可避免地会导致缺乏封闭样品的有用信息。为此,我们提出了一种新颖的方法来通过考虑配对样品的关系来改善单眼3D对象检测。这使我们可以针对其相邻邻居的部分封闭对象编码空间约束。具体而言,提出的检测器计算对象位置的不确定性预测和相邻对象对的3D距离的预测,随后通过非线性最小二乘正式共同优化。最后,将一阶段的不确定性预测结构和优化模块专用于确保运行时效率。实验表明,我们的方法通过优于最新的竞争对手,尤其是对于硬样品而言,我们的方法在KITTI 3D检测基准上产生了最佳性能。

Monocular 3D object detection is an essential component in autonomous driving while challenging to solve, especially for those occluded samples which are only partially visible. Most detectors consider each 3D object as an independent training target, inevitably resulting in a lack of useful information for occluded samples. To this end, we propose a novel method to improve the monocular 3D object detection by considering the relationship of paired samples. This allows us to encode spatial constraints for partially-occluded objects from their adjacent neighbors. Specifically, the proposed detector computes uncertainty-aware predictions for object locations and 3D distances for the adjacent object pairs, which are subsequently jointly optimized by nonlinear least squares. Finally, the one-stage uncertainty-aware prediction structure and the post-optimization module are dedicatedly integrated for ensuring the run-time efficiency. Experiments demonstrate that our method yields the best performance on KITTI 3D detection benchmark, by outperforming state-of-the-art competitors by wide margins, especially for the hard samples.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源