真实图像编辑的内域gan倒置

论文标题

真实图像编辑的内域gan倒置

In-Domain GAN Inversion for Real Image Editing

论文作者

Zhu, Jiapeng, Shen, Yujun, Zhao, Deli, Zhou, Bolei

论文摘要

最近的工作表明，经过训练以合成图像的训练时，在生成对抗网络（GAN）的潜在空间中出现了各种语义。但是，很难将这些学习的语义用于真实图像编辑。将真实图像馈送到经过训练的GAN发生器的常见做法是将其倒回潜在代码。但是，现有的反转方法通常着重于通过像素值重建目标图像，但未能将倒置代码降落在原始潜在空间的语义域中。结果，重建的图像不能通过改变倒置代码来很好地支持语义编辑。为了解决这个问题，我们提出了一种内域倒置方法，该方法不仅忠实地重建输入图像，而且还确保了倒置代码对编辑具有语义意义。我们首先学习了一个新颖的域引导编码器，将给定的图像投射到甘恩的本地潜在空间。然后，我们通过涉及编码器作为正规器来微调编码代码并更好地恢复目标图像来提出域调控优化。广泛的实验表明，我们的倒置方法实现了满足真实图像重建的满足，更重要的是促进了各种图像编辑任务，从而表现出明显优于启动。

Recent work has shown that a variety of semantics emerge in the latent space of Generative Adversarial Networks (GANs) when being trained to synthesize images. However, it is difficult to use these learned semantics for real image editing. A common practice of feeding a real image to a trained GAN generator is to invert it back to a latent code. However, existing inversion methods typically focus on reconstructing the target image by pixel values yet fail to land the inverted code in the semantic domain of the original latent space. As a result, the reconstructed image cannot well support semantic editing through varying the inverted code. To solve this problem, we propose an in-domain GAN inversion approach, which not only faithfully reconstructs the input image but also ensures the inverted code to be semantically meaningful for editing. We first learn a novel domain-guided encoder to project a given image to the native latent space of GANs. We then propose domain-regularized optimization by involving the encoder as a regularizer to fine-tune the code produced by the encoder and better recover the target image. Extensive experiments suggest that our inversion method achieves satisfying real image reconstruction and more importantly facilitates various image editing tasks, significantly outperforming start-of-the-arts.

下载PDF全文

下载文献需遵守相关版权规定

论文标题