使用gans合成最低训练数据以生成深泡

论文标题

使用gans合成最低训练数据以生成深泡

Using GANs to Synthesise Minimum Training Data for Deepfake Generation

论文作者

Singh, Simranjeet, Sharma, Rajneesh, Smeaton, Alan F.

论文摘要

在计算机视觉，自然语言处理，语音综合等领域中，生成性对抗网络（GAN）有许多应用。毫无疑问，最值得注意的结果是在图像综合领域，尤其是在深层视频的产生中。尽管Deepfakes获得了许多负面的媒体报道，但它们在娱乐，客户关系甚至辅助护理等应用中可能是有用的技术。生成深击的一个问题是对主题的大量图像培训数据的要求，如果主题是已经存在许多图像的名人，那不是问题。如果只有少量的训练图像，那么深色的质量将很差。一些媒体报道表明，可以用只有500张图像制作出良好的深层味道，但是在实践中，优质的深击需要数千张图像，这是名人和政客们变得如此受欢迎的原因之一。在这项研究中，我们利用gan的特性来产生带有可变面部表情的个体图像，然后用来产生深层味。我们观察到，由于合成GAN生成的训练图像的面部表情如此差异，并且数量减少了，我们可以制作一个近乎现实的深层捕获视频。

There are many applications of Generative Adversarial Networks (GANs) in fields like computer vision, natural language processing, speech synthesis, and more. Undoubtedly the most notable results have been in the area of image synthesis and in particular in the generation of deepfake videos. While deepfakes have received much negative media coverage, they can be a useful technology in applications like entertainment, customer relations, or even assistive care. One problem with generating deepfakes is the requirement for a lot of image training data of the subject which is not an issue if the subject is a celebrity for whom many images already exist. If there are only a small number of training images then the quality of the deepfake will be poor. Some media reports have indicated that a good deepfake can be produced with as few as 500 images but in practice, quality deepfakes require many thousands of images, one of the reasons why deepfakes of celebrities and politicians have become so popular. In this study, we exploit the property of a GAN to produce images of an individual with variable facial expressions which we then use to generate a deepfake. We observe that with such variability in facial expressions of synthetic GAN-generated training images and a reduced quantity of them, we can produce a near-realistic deepfake videos.

下载PDF全文

下载文献需遵守相关版权规定

论文标题