扩展的自适应缩放标度归一化以端到端图像压缩

论文标题

扩展的自适应缩放标度归一化以端到端图像压缩

Expanded Adaptive Scaling Normalization for End to End Image Compression

论文作者

Shin, Chajin, Lee, Hyeongmin, Son, Hanbin, Lee, Sangjin, Lee, Dogyoon, Lee, Sangyoun

论文摘要

最近，采用了利用卷积神经层的基于学习的图像压缩方法。重新缩放模块（例如批准归一化）通常用于卷积神经网络中的模块并不能适应各种输入。因此，广义可分割的归一化（GDN）已被广泛用于图像压缩中，以在空间轴和通道轴上自适应地重新定制输入特征。但是，GDN的代表权或自由度受到严重限制。另外，GDN不能考虑图像的空间相关性。为了处理GDN的局限性，我们构建了一种自适应缩放模块的扩展形式，该模块命名为扩展的自适应缩放标度归一化（EASN）。首先，我们利用Swish函数来提高表示能力。然后，我们增加了接受场，以使自适应重新恢复模块考虑空间相关。此外，我们引入了一个输入映射函数，以使模块具有更高的自由度。我们使用特征映射的可视化结果演示了我们的EASN在图像压缩网络中的工作方式，并且我们进行了广泛的实验，以表明我们的EASN显着提高了速率 - 距离性能，甚至以高比特率以优于VVC内部。

Recently, learning-based image compression methods that utilize convolutional neural layers have been developed rapidly. Rescaling modules such as batch normalization which are often used in convolutional neural networks do not operate adaptively for the various inputs. Therefore, Generalized Divisible Normalization(GDN) has been widely used in image compression to rescale the input features adaptively across both spatial and channel axes. However, the representation power or degree of freedom of GDN is severely limited. Additionally, GDN cannot consider the spatial correlation of an image. To handle the limitations of GDN, we construct an expanded form of the adaptive scaling module, named Expanded Adaptive Scaling Normalization(EASN). First, we exploit the swish function to increase the representation ability. Then, we increase the receptive field to make the adaptive rescaling module consider the spatial correlation. Furthermore, we introduce an input mapping function to give the module a higher degree of freedom. We demonstrate how our EASN works in an image compression network using the visualization results of the feature map, and we conduct extensive experiments to show that our EASN increases the rate-distortion performance remarkably, and even outperforms the VVC intra at a high bit rate.

下载PDF全文

下载文献需遵守相关版权规定

论文标题