论文标题
混合摄像头:通过不确定性正则化的弱监督语义分割
Mixup-CAM: Weakly-supervised Semantic Segmentation via Uncertainty Regularization
论文作者
论文摘要
获得对象响应图是使用图像级标签实现弱监督语义分割的重要步骤。但是,现有方法依赖于分类任务,这可能会导致响应映射仅在歧视对象区域上参与,因为网络不需要看到整个对象来优化分类损失。为了解决此问题,我们提出了一个有原则的端到端火车框架,以允许网络注意对象的其他部分,同时生成更完整,更统一的响应图。具体而言,我们将混合数据增强方案介绍到分类网络中,并设计两个不确定性正则化术语,以更好地与混合策略进行互动。在实验中,我们进行了广泛的分析以证明所提出的方法并显示出对最新方法的有利性能。
Obtaining object response maps is one important step to achieve weakly-supervised semantic segmentation using image-level labels. However, existing methods rely on the classification task, which could result in a response map only attending on discriminative object regions as the network does not need to see the entire object for optimizing the classification loss. To tackle this issue, we propose a principled and end-to-end train-able framework to allow the network to pay attention to other parts of the object, while producing a more complete and uniform response map. Specifically, we introduce the mixup data augmentation scheme into the classification network and design two uncertainty regularization terms to better interact with the mixup strategy. In experiments, we conduct extensive analysis to demonstrate the proposed method and show favorable performance against state-of-the-art approaches.