fastx：基于无人机应用程序的边缘GPU的实时对象检测

论文标题

fastx：基于无人机应用程序的边缘GPU的实时对象检测

FasterX: Real-Time Object Detection Based on Edge GPUs for UAV Applications

论文作者

Zhou, Wei, Min, Xuanlin, Hu, Rui, Long, Yiwen, Luo, Huan, JunYi

论文摘要

无人驾驶飞机（UAV）的实时对象检测是一个具有挑战性的问题，因为Edge GPU设备作为物联网（IoT）节点的计算资源有限。为了解决这个问题，在本文中，我们提出了一种基于yoLox模型的新型轻型深度学习体系结构，用于Edge GPU上的实时对象检测。首先，我们设计了一个有效且轻巧的Pixsf头，以更换Yolox的原始头部以更好地检测小物体，可以将其进一步嵌入深度可分离的卷积（DS Conv）中，以达到更轻的头。然后，开发为减少网络参数的颈层中的较小结构，这是精度和速度之间的权衡。此外，我们将注意模块嵌入了头层，以改善预测头的特征提取效果。同时，我们还改进了标签分配策略和损失功能，以减轻无人机数据集的类别不平衡和盒子优化问题。最后，提出了辅助头进行在线蒸馏，以提高pixsf头部嵌入位置嵌入和特征提取的能力。我们的轻质模型的性能在NVIDIA Jetson NX和Jetson Nano GPU嵌入平台上进行了实验验证。扩展的实验表明，与目前的模型相比，FASTELX模型在Visdrone2021数据集中实现了在Visdrone2021数据集中的准确性和延迟之间的折衷。

Real-time object detection on Unmanned Aerial Vehicles (UAVs) is a challenging issue due to the limited computing resources of edge GPU devices as Internet of Things (IoT) nodes. To solve this problem, in this paper, we propose a novel lightweight deep learning architectures named FasterX based on YOLOX model for real-time object detection on edge GPU. First, we design an effective and lightweight PixSF head to replace the original head of YOLOX to better detect small objects, which can be further embedded in the depthwise separable convolution (DS Conv) to achieve a lighter head. Then, a slimmer structure in the Neck layer termed as SlimFPN is developed to reduce parameters of the network, which is a trade-off between accuracy and speed. Furthermore, we embed attention module in the Head layer to improve the feature extraction effect of the prediction head. Meanwhile, we also improve the label assignment strategy and loss function to alleviate category imbalance and box optimization problems of the UAV dataset. Finally, auxiliary heads are presented for online distillation to improve the ability of position embedding and feature extraction in PixSF head. The performance of our lightweight models are validated experimentally on the NVIDIA Jetson NX and Jetson Nano GPU embedded platforms.Extensive experiments show that FasterX models achieve better trade-off between accuracy and latency on VisDrone2021 dataset compared to state-of-the-art models.

下载PDF全文

下载文献需遵守相关版权规定

论文标题