论文标题

通过加权交叉熵损失解决对象检测中的类不平衡

Resolving Class Imbalance in Object Detection with Weighted Cross Entropy Losses

论文作者

Phan, Trong Huy, Yamamoto, Kazuma

论文摘要

对象检测是计算机视觉中的一项重要任务,它为许多现实世界应用,例如自动驾驶,监视和机器人技术提供服务。在过去的十年中,开发了大量最先进的广义对象探测器(例如,R-CNN,Yolo,SSD速度更快)。尽管在模型修改和提高培训策略方面的改进方面的持续努力以提高检测准确性,但在探测器的性能方面仍然存在局限性,而对于具有不均匀对象类别的专门数据集。这起源于对象分类子任务的跨熵损失函数的常见用法,该功能只是忽略了训练过程中对象类的外观频率,因此导致对象类的准确性较低,而对象类别具有较少的样品。一般机器学习中的类不平衡已经进行了广泛的研究,但是,几乎没有关注对象检测的主题。在本文中,我们建议通过应用几种跨熵损失的加权变体来探索和克服此问题,例如,基于对对象检测器的有效数量样品数量,平衡的交叉熵,焦点损失和平衡损失。使用BDD100K的实验(从车载摄像机中获得的一项高度级别不平衡的驾驶数据库,这些摄像机主要捕获主要是汽车级对象和其他少数族裔对象类,例如总线,人和运动),已经证明了通过上述损失功能训练的探测器的更好的探测器表演。

Object detection is an important task in computer vision which serves a lot of real-world applications such as autonomous driving, surveillance and robotics. Along with the rapid thrive of large-scale data, numerous state-of-the-art generalized object detectors (e.g. Faster R-CNN, YOLO, SSD) were developed in the past decade. Despite continual efforts in model modification and improvement in training strategies to boost detection accuracy, there are still limitations in performance of detectors when it comes to specialized datasets with uneven object class distributions. This originates from the common usage of Cross Entropy loss function for object classification sub-task that simply ignores the frequency of appearance of object class during training, and thus results in lower accuracies for object classes with fewer number of samples. Class-imbalance in general machine learning has been widely studied, however, little attention has been paid on the subject of object detection. In this paper, we propose to explore and overcome such problem by application of several weighted variants of Cross Entropy loss, for examples Balanced Cross Entropy, Focal Loss and Class-Balanced Loss Based on Effective Number of Samples to our object detector. Experiments with BDD100K (a highly class-imbalanced driving database acquired from on-vehicle cameras capturing mostly Car-class objects and other minority object classes such as Bus, Person and Motor) have proven better class-wise performances of detector trained with the afore-mentioned loss functions.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源