论文标题
矛盾的人:无监督域适应的Vapnik必须
Contradistinguisher: A Vapnik's Imperative to Unsupervised Domain Adaptation
论文作者
论文摘要
据信,同时监督的不可监视的学习是人类在多个领域或任务中无缝执行任务的关键。这种跨域学习现象在领域适应文献中进行了很好的研究。最近的域适应性工作依赖于首先对齐源和目标域分布,然后在标记的源域上训练分类器以对目标域进行分类的间接方式。但是,这种方法的主要缺点是,在本身上获得近乎完美的域对齐可能是困难/不可能的(例如,语言域)。为了解决这个问题,我们遵循Vapnik对统计学习的要求,即应以最直接的方式解决任何所需的问题,而不是解决更一般的中间任务,并提出一种直接的域适应方法,而不需要域名。我们提出了一个引用矛盾的模型,该模型学习对比特征,其目标是以无监督的方式共同学会与未标记的目标域进行矛盾,并在源域上以有监督的方式进行分类。我们在单一源和多源设置上都在Office-31和Visda-2017数据集上实现了最新的数据集。我们还注意到,矛盾的损失通过增加形状偏差来改善模型性能。
A complex combination of simultaneous supervised-unsupervised learning is believed to be the key to humans performing tasks seamlessly across multiple domains or tasks. This phenomenon of cross-domain learning has been very well studied in domain adaptation literature. Recent domain adaptation works rely on an indirect way of first aligning the source and target domain distributions and then train a classifier on the labeled source domain to classify the target domain. However, this approach has the main drawback that obtaining a near-perfect alignment of the domains in itself might be difficult/impossible (e.g., language domains). To address this, we follow Vapnik's imperative of statistical learning that states any desired problem should be solved in the most direct way rather than solving a more general intermediate task and propose a direct approach to domain adaptation that does not require domain alignment. We propose a model referred Contradistinguisher that learns contrastive features and whose objective is to jointly learn to contradistinguish the unlabeled target domain in an unsupervised way and classify in a supervised way on the source domain. We achieve the state-of-the-art on Office-31 and VisDA-2017 datasets in both single-source and multi-source settings. We also notice that the contradistinguish loss improves the model performance by increasing the shape bias.