Multi streams with dynamic balancing-based Conditional Generative Adversarial Network for paired image generation

作者：

Highlights：

•

摘要

Computer vision society experienced the birth of new CNN architecture known as Generative Adversarial Networks (GANs), which can generate fake images similar to real ones. The widespread use of GANs leads the image-to-image translation strategy dealing with more diverse tasks that were treated using traditional CNNs, such as medical analysis and semantic segmentation. In this paper, we propose a generic GAN referred to as Multi Streams with Dynamic Balancing-based Conditional Generative Adversarial Network (MSDB-CGAN). The MSDB-CGAN serves more challenging applications, that require multi input images such as binocular depth estimation, efficiently through its dedicated input streams and automatic skip connections. Moreover, the proposed GAN analyzes the inputs according to the target image, then assigns dynamic weights to the input streams. To validate the proposed MSDB-CGAN, we targeted four challenging tasks: binocular depth estimation, human-pose translation, middle frame interpolation, and future frame prediction. These applications present different inputs requirements and configurations. The reported quantitative and qualitative comparisons prove that the MSDB-CGAN significantly outperforms the existing GANs as well as traditional CNN-based architectures.

论文关键词：Generative adversarial learning,Conditional image generation,Dynamic balancing,Multi-streaming inputs,Depth estimation,Frame synthesis,Human pose translation

论文评审过程：Received 21 October 2021, Revised 9 June 2022, Accepted 9 June 2022, Available online 18 June 2022, Version of Record 27 June 2022.

论文官网地址：https://doi.org/10.1016/j.knosys.2022.109252