论文标题
用于多类底眼病变细分的渐进多尺度一致网络
Progressive Multi-scale Consistent Network for Multi-class Fundus Lesion Segmentation
论文作者
论文摘要
有效地整合多尺度信息对于挑战性多级分割的基础病变具有很大的意义,因为不同的病变在尺度和形状上有显着差异。已经提出了几种成功处理多尺度对象分割的方法。但是,在先前的研究中未考虑两个问题。首先是相邻特征水平之间缺乏相互作用,这将导致高级特征偏离低级特征和详细提示的丢失。第二个是低级和高级特征之间的冲突,这是因为它们学习了不同的功能范围,从而使模型混淆并降低了最终预测的准确性。在本文中,我们提出了一个渐进的多尺度一致网络(PMCNET),该网络(PMCNET)整合了提议的渐进式特征融合(PFF)块和动态注意块(DAB),以解决上述问题。具体而言,PFF块逐渐整合了相邻编码层中的多尺度功能,从而通过汇总细粒细节和高级语义来促进每一层的特征学习。由于不同尺度的功能应保持一致,因此DAB旨在通过不同尺度的融合功能动态学习细心的线索,从而旨在平滑多尺度功能中存在的基本冲突。可以将两个提出的PFF和DAB块与现成的骨干网络集成在一起,以解决多尺度的两个问题,并在底面病变的多级分段中进行特征不一致,这将在功能空间中产生更好的特征表示。三个公共数据集的实验结果表明,所提出的方法比最近的最新方法更有效。
Effectively integrating multi-scale information is of considerable significance for the challenging multi-class segmentation of fundus lesions because different lesions vary significantly in scales and shapes. Several methods have been proposed to successfully handle the multi-scale object segmentation. However, two issues are not considered in previous studies. The first is the lack of interaction between adjacent feature levels, and this will lead to the deviation of high-level features from low-level features and the loss of detailed cues. The second is the conflict between the low-level and high-level features, this occurs because they learn different scales of features, thereby confusing the model and decreasing the accuracy of the final prediction. In this paper, we propose a progressive multi-scale consistent network (PMCNet) that integrates the proposed progressive feature fusion (PFF) block and dynamic attention block (DAB) to address the aforementioned issues. Specifically, PFF block progressively integrates multi-scale features from adjacent encoding layers, facilitating feature learning of each layer by aggregating fine-grained details and high-level semantics. As features at different scales should be consistent, DAB is designed to dynamically learn the attentive cues from the fused features at different scales, thus aiming to smooth the essential conflicts existing in multi-scale features. The two proposed PFF and DAB blocks can be integrated with the off-the-shelf backbone networks to address the two issues of multi-scale and feature inconsistency in the multi-class segmentation of fundus lesions, which will produce better feature representation in the feature space. Experimental results on three public datasets indicate that the proposed method is more effective than recent state-of-the-art methods.