【AIGC One Sentence Reading】:本文研究基于掩码一致性机制的弱监督图像语义分割,通过引入掩码一致性机制提供额外监督信号,缩小全监督与弱监督差距,在相同监督水平下优于现有方法,实验效果良好。
【AIGC Short Abstract】:本文研究了基于掩码一致性机制的弱监督图像语义分割,旨在利用易获取的图像级标签降低标注成本。通过提出掩码一致性机制,为网络训练提供额外自监督信号,缩小全监督与弱监督之间的差距。实验表明,该方法在相同监督水平下优于最先进方法,具有广泛应用前景。
Note: This section is automatically generated by AI . The website and platform operators shall not be liable for any commercial or legal consequences arising from your use of AI generated content on this website. Please be aware of this.
Abstract
Semantic segmentation is a computer vision technology widely used in scenarios such as unmanned driving and defect detection, but the fine-grained annotation at the pixel level requires a huge annotation cost. Therefore, how to use the easily obtained image-level labels for weakly supervised semantic segmentation is the focus of long-standing research. Compared with pixel-level segmentation based on a class activation maps (CAM), a masked consistency mechanism (MCM) was proposed to provide additional supervision signals to narrow the gap between full supervision and weakly supervision. In the fully supervised semantic segmentation, the network had consistent pixel-level segmentation supervision for mask prediction of each patch of the image, so some patches were masked out in vision transformer (ViT) and it was required that the CAMs generated by the retained patches should be consistent with the CAMs generated by the complete images to provide additional self-supervision signals for network training. Experiments on PASCAL VOC 2012 and MS COCO show that the proposed method is superior to the most advanced method using the same level of supervision.