论文标题
作为艺术材料的单词:用顺序甘斯生成绘画
Words as Art Materials: Generating Paintings with Sequential GANs
论文作者
论文摘要
使用生成对抗网络将文本描述转换为图像已成为流行的研究领域。近年来,已成功产生了视觉吸引力的图像。受这些研究的启发,我们研究了大方差数据集上的艺术图像的产生。该数据集包括具有变化的图像,例如形状,颜色和内容。图像中的这些变化提供了原创性,这是艺术本质的重要因素。我们作品的一个主要特征是,我们将关键字用作图像描述,而不是句子。作为网络体系结构,我们提出了一个顺序的生成对抗网络模型。该顺序模型的第一阶段处理矢量单词并创建基本图像,而下一个阶段的重点是创建高分辨率的艺术风格的图像,而无需使用单词矢量。为了应对甘恩斯的不稳定性质,我们提出了诸如Wasserstein损失,光谱归一化和Minibatch区分等技术的混合物。最终,我们能够生成具有多种样式的绘画图像。我们通过使用Fréchet成立距离评分评估了结果,并与186名参与者进行了用户研究。
Converting text descriptions into images using Generative Adversarial Networks has become a popular research area. Visually appealing images have been generated successfully in recent years. Inspired by these studies, we investigated the generation of artistic images on a large variance dataset. This dataset includes images with variations, for example, in shape, color, and content. These variations in images provide originality which is an important factor for artistic essence. One major characteristic of our work is that we used keywords as image descriptions, instead of sentences. As the network architecture, we proposed a sequential Generative Adversarial Network model. The first stage of this sequential model processes the word vectors and creates a base image whereas the next stages focus on creating high-resolution artistic-style images without working on word vectors. To deal with the unstable nature of GANs, we proposed a mixture of techniques like Wasserstein loss, spectral normalization, and minibatch discrimination. Ultimately, we were able to generate painting images, which have a variety of styles. We evaluated our results by using the Fréchet Inception Distance score and conducted a user study with 186 participants.