Evaluation Metrics for Conditional Image Generation

作者:Yaniv Benny, Tomer Galanti, Sagie Benaim, Lior Wolf

摘要

We present two new metrics for evaluating generative models in the class-conditional image generation setting. These metrics are obtained by generalizing the two most popular unconditional metrics: the Inception Score (IS) and the Fréchet Inception Distance (FID). A theoretical analysis shows the motivation behind each proposed metric and links the novel metrics to their unconditional counterparts. The link takes the form of a product in the case of IS or an upper bound in the FID case. We provide an extensive empirical evaluation, comparing the metrics to their unconditional variants and to other metrics, and utilize them to analyze existing generative models, thus providing additional insights about their performance, from unlearned classes to mode collapse.

论文关键词:Image generation, Conditional generation, Evaluation metrics, Inception Score, Fréchet Inception Distance

论文评审过程:

论文官网地址:https://doi.org/10.1007/s11263-020-01424-w