Conditional density estimation and simulation through optimal transport

作者:Esteban G. Tabak, Giulio Trigila, Wenjun Zhao

摘要

A methodology to estimate from samples the probability density of a random variable x conditional to the values of a set of covariates \(\{z_{l}\}\) is proposed. The methodology relies on a data-driven formulation of the Wasserstein barycenter, posed as a minimax problem in terms of the conditional map carrying each sample point to the barycenter and a potential characterizing the inverse of this map. This minimax problem is solved through the alternation of a flow developing the map in time and the maximization of the potential through an alternate projection procedure. The dependence on the covariates \(\{z_{l}\}\) is formulated in terms of convex combinations, so that it can be applied to variables of nearly any type, including real, categorical and distributional. The methodology is illustrated through numerical examples on synthetic and real data. The real-world example chosen is meteorological, forecasting the temperature distribution at a given location as a function of time, and estimating the joint distribution at a location of the highest and lowest daily temperatures as a function of the date.

论文关键词:Conditional density estimation, Optimal transport, Wasserstein barycenter, Explanation of variability, Confounding factors, Sampling, Uncertainty quantification

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10994-019-05866-3