Yolov4在空中图像中的对象检测的分析和适应

论文标题

Yolov4在空中图像中的对象检测的分析和适应

Analysis and Adaptation of YOLOv4 for Object Detection in Aerial Images

论文作者

Samyal, Aryaman Singh, R, Akshatha K, Hans, Soham, K, Karunakar A, B, Satish Shenoy

论文摘要

各种计算机视觉任务的无人机部署（UAV）部署的最新和快速增长为使它们更有效和有价值的许多机会铺平了道路。由于外观，姿势和尺度的变化，空中图像中的对象检测具有挑战性。具有继承的有限内存和计算能力需求的自主空中飞行系统准确且计算有效的检测算法，用于实时应用。我们的工作显示了流行的Yolov4框架的改编，以高精度和推理速度以空中图像预测对象及其位置。我们利用传输学习来更快地收敛于Vistrone DET空中对象检测数据集。训练有素的模型导致平均平均精度（MAP）为45.64％，推理速度在Tesla K80 GPU上达到8.7 fps，并且在检测截短和遮挡的物体方面非常准确。我们通过实验评估了不同网络分辨率大小和培训时期对性能的影响。一项与几个当代空中对象探测器的比较研究证明，Yolov4的性能更好，这意味着更合适的检测算法可以在空中平台上融合。

The recent and rapid growth in Unmanned Aerial Vehicles (UAVs) deployment for various computer vision tasks has paved the path for numerous opportunities to make them more effective and valuable. Object detection in aerial images is challenging due to variations in appearance, pose, and scale. Autonomous aerial flight systems with their inherited limited memory and computational power demand accurate and computationally efficient detection algorithms for real-time applications. Our work shows the adaptation of the popular YOLOv4 framework for predicting the objects and their locations in aerial images with high accuracy and inference speed. We utilized transfer learning for faster convergence of the model on the VisDrone DET aerial object detection dataset. The trained model resulted in a mean average precision (mAP) of 45.64% with an inference speed reaching 8.7 FPS on the Tesla K80 GPU and was highly accurate in detecting truncated and occluded objects. We experimentally evaluated the impact of varying network resolution sizes and training epochs on the performance. A comparative study with several contemporary aerial object detectors proved that YOLOv4 performed better, implying a more suitable detection algorithm to incorporate on aerial platforms.

下载PDF全文

下载文献需遵守相关版权规定

论文标题