论文标题
频率表示中卷积解码器网络的缺陷
Defects of Convolutional Decoder Networks in Frequency Representation
论文作者
论文摘要
在本文中,我们证明了级联卷积解码器网络的表示缺陷,考虑到表示输入样本的不同频率分量的能力。我们在解码器网络的中间层中对特征图的每个通道进行离散的傅立叶变换。然后,我们扩展了2D圆形卷积定理,以通过频域中的卷积层表示向前和向后传播。基于此,我们证明了表示特征频谱的三个缺陷。首先,我们证明卷积操作,零盖操作以及其他一组设置都使卷积解码器网络更有可能削弱高频组件。其次,我们证明UPSAPLING操作会生成特征频谱,其中强信号在某些频率上重复出现。第三,我们证明,如果输入样本中的频率分量和回归目标输出中的频率分量的变化很小,则通常无法有效地学习解码器。
In this paper, we prove the representation defects of a cascaded convolutional decoder network, considering the capacity of representing different frequency components of an input sample. We conduct the discrete Fourier transform on each channel of the feature map in an intermediate layer of the decoder network. Then, we extend the 2D circular convolution theorem to represent the forward and backward propagations through convolutional layers in the frequency domain. Based on this, we prove three defects in representing feature spectrums. First, we prove that the convolution operation, the zero-padding operation, and a set of other settings all make a convolutional decoder network more likely to weaken high-frequency components. Second, we prove that the upsampling operation generates a feature spectrum, in which strong signals repetitively appear at certain frequencies. Third, we prove that if the frequency components in the input sample and frequency components in the target output for regression have a small shift, then the decoder usually cannot be effectively learned.