分布不匹配的半监督深度学习图像分类：调查

论文标题

分布不匹配的半监督深度学习图像分类：调查

Semi-supervised Deep Learning for Image Classification with Distribution Mismatch: A Survey

论文作者

Calderon-Ramirez, Saul, Yang, Shengxiang, Elizondo, David

论文摘要

深度学习方法已在几个不同的领域中采用，在图像识别应用方面取得了杰出的成功，例如材料质量控制，医学成像，自动驾驶等。深度学习模型依赖于众多标记的观测值来培训预期模型。这些模型由数百万参数组成，以估计，增加了对更多训练观察结果的需求。通常，收集标记的数据观察结果是昂贵的，使深度学习模型的使用并不理想，因为该模型可能会超越数据。在半监督的设置中，未标记的数据用于提高具有小标记数据集的模型的准确性和概括。然而，在许多情况下，可能有不同的未标记数据源。这增加了标记和未标记数据集之间分布不匹配的严重分布不匹配的风险。这种现象会引起典型的半监督深度学习框架的巨大性能，这通常认为标记和未标记的数据集是从类似分布中汲取的。因此，在本文中，我们研究了半监督深度学习的最新方法，以进行图像识别。重点是在半监督的深度学习模型中进行的，旨在处理标记和未标记数据集之间的分布不匹配。我们应对开放挑战的目的是鼓励社区解决这些挑战，并克服在现实世界中使用设置下传统深度学习管道的高数据需求。

Deep learning methodologies have been employed in several different fields, with an outstanding success in image recognition applications, such as material quality control, medical imaging, autonomous driving, etc. Deep learning models rely on the abundance of labelled observations to train a prospective model. These models are composed of millions of parameters to estimate, increasing the need of more training observations. Frequently it is expensive to gather labelled observations of data, making the usage of deep learning models not ideal, as the model might over-fit data. In a semi-supervised setting, unlabelled data is used to improve the levels of accuracy and generalization of a model with small labelled datasets. Nevertheless, in many situations different unlabelled data sources might be available. This raises the risk of a significant distribution mismatch between the labelled and unlabelled datasets. Such phenomena can cause a considerable performance hit to typical semi-supervised deep learning frameworks, which often assume that both labelled and unlabelled datasets are drawn from similar distributions. Therefore, in this paper we study the latest approaches for semi-supervised deep learning for image recognition. Emphasis is made in semi-supervised deep learning models designed to deal with a distribution mismatch between the labelled and unlabelled datasets. We address open challenges with the aim to encourage the community to tackle them, and overcome the high data demand of traditional deep learning pipelines under real-world usage settings.

下载PDF全文

下载文献需遵守相关版权规定

论文标题