无监督的域自适应对象检测使用前后循环自适应

论文标题

无监督的域自适应对象检测使用前后循环自适应

Unsupervised Domain Adaptive Object Detection using Forward-Backward Cyclic Adaptation

论文作者

Yang, Siqi, Wu, Lin, Wiliem, Arnold, Lovell, Brian C.

论文摘要

我们提出了一种新的方法，可以通过前向循环（FBC）训练进行无监督的域适应以进行对象检测。最近基于对抗性训练的域适应方法显示了通过边际特征分布对齐对域差异最小化域差异的有效性。但是，对齐边缘特征分布并不能保证班级条件分布的对齐。与图像分类任务（例如一个图像中存在各种数量的对象，图像中的大多数内容是背景。这促使我们通过梯度对齐来学习类别级别语义的域不变性。直观地，如果两个域的梯度指向相似的方向，那么一个域的学习可以改善另一个领域的域。为了实现梯度对齐，我们提出了前向循环适应性，它通过向后跳，从源到目标进行适应性，并通过向前传递从目标到源。此外，我们将低水平的特征与通过对抗训练适应整体颜色/纹理的功能保持一致。但是，检测器在两个域上表现良好并不是目标域的理想选择。因此，在每个循环中，域的多样性是通过对源域上的最大熵正则化来实现的，以惩罚对目标域的自信源特定学习和最小熵正则化，以使目标特定于目标特异性学习。提供了训练过程的理论分析，并且有关挑战性跨域对象检测数据集的广泛实验表明，我们的方法优于最新方法。

We present a novel approach to perform the unsupervised domain adaptation for object detection through forward-backward cyclic (FBC) training. Recent adversarial training based domain adaptation methods have shown their effectiveness on minimizing domain discrepancy via marginal feature distributions alignment. However, aligning the marginal feature distributions does not guarantee the alignment of class conditional distributions. This limitation is more evident when adapting object detectors as the domain discrepancy is larger compared to the image classification task, e.g. various number of objects exist in one image and the majority of content in an image is the background. This motivates us to learn domain invariance for category level semantics via gradient alignment. Intuitively, if the gradients of two domains point in similar directions, then the learning of one domain can improve that of another domain. To achieve gradient alignment, we propose Forward-Backward Cyclic Adaptation, which iteratively computes adaptation from source to target via backward hopping and from target to source via forward passing. In addition, we align low-level features for adapting holistic color/texture via adversarial training. However, the detector performs well on both domains is not ideal for target domain. As such, in each cycle, domain diversity is enforced by maximum entropy regularization on the source domain to penalize confident source-specific learning and minimum entropy regularization on target domain to intrigue target-specific learning. Theoretical analysis of the training process is provided, and extensive experiments on challenging cross-domain object detection datasets have shown the superiority of our approach over the state-of-the-art.

下载PDF全文

下载文献需遵守相关版权规定

论文标题