Leveraging Nested MLMC for Sequential Neural Posterior Estimation with Intractable Likelihoods
Yang, Xiong, He
There is a growing interest in studying sequential neural posterior estimation (SNPE) techniques due to their advantages for simulation-based models with intractable likelihoods. The methods aim to learn the posterior from adaptively proposed simulations using neural network-based conditional density estimators. As an SNPE technique, the automatic posterior transformation (APT) method proposed by Greenberg et al. (2019) performs well and scales to high-dimensional data. However, the APT method requires computing the expectation of the logarithm of an intractable normalizing constant, i.e., a nested expectation. Although atomic proposals were used to render an analytical normalizing constant, it remains challenging to analyze the convergence of learning. In this paper, we reformulate APT as a nested estimation problem. Building on this, we construct several multilevel Monte Carlo (MLMC) estimators for the loss function and its gradients to accommodate different scenarios, including two unbiased estimators, and a biased estimator that trades a small bias for reduced variance and controlled runtime and memory usage. We also provide convergence results of stochastic gradient descent to quantify the interaction of the bias and variance of the gradient estimator. Numerical experiments for approximating complex posteriors with multimodality in moderate dimensions are provided to examine the effectiveness of the proposed methods.
academic
Leveraging Nested MLMC for Sequential Neural Posterior Estimation with Intractable Likelihoods
Title: Leveraging Nested MLMC for Sequential Neural Posterior Estimation with Intractable Likelihoods
Authors: Xiliang Yang (South China University of Technology), Yifei Xiong (Purdue University), Zhijian He (South China University of Technology, Corresponding Author)
This paper investigates the application of Sequential Neural Posterior Estimation (SNPE) techniques for simulation models with intractable likelihood functions. Addressing the nested expectation problem in the Automatic Posterior Transformation (APT) method, which requires computing the logarithmic expectation of an intractable normalization constant, the paper reformulates APT as a nested estimation problem and constructs several Multilevel Monte Carlo (MLMC) estimators, including two unbiased estimators and one biased estimator. The biased estimator trades a small bias for reduced variance and controlled runtime and memory usage. The paper also provides convergence results for stochastic gradient descent, quantifying the interaction between gradient estimator bias and variance.
Challenges of Simulation Models: Simulation models are widely used in neuroscience, physics, and biology, but traditional Bayesian inference faces challenges with intractable likelihood functions and expensive simulators.
Demand for SNPE Methods: Sequential neural posterior estimation methods avoid direct likelihood computation by learning posterior distributions from simulations with adaptive proposals using neural network conditional density estimators.
Limitations of APT Methods: Although the Automatic Posterior Transformation (APT) method proposed by Greenberg et al. performs well and scales to high-dimensional data, it requires computing the logarithmic expectation of an intractable normalization constant, forming a nested expectation problem.
Theoretical Analysis: Provides theoretical upper bounds on bias, variance, and average cost, proving that MLMC methods achieve optimal complexity O(ε^-2)
Convergence Guarantees: Establishes convergence theorems for stochastic gradient descent, quantifying the impact of bias and variance on optimization
Experimental Validation: Verifies method effectiveness on multiple benchmark tasks
Given prior distribution p(θ) and observed data x_o, the goal is to approximate the posterior distribution p(θ|x_o) ∝ p(θ)p(x_o|θ), where the likelihood function p(x|θ) is intractable but can be sampled via a simulator.
Greenberg et al. (2019): Automatic posterior transformation for likelihood-free inference
Giles (2015): Multilevel Monte Carlo methods
Rhee & Glynn (2015): Unbiased estimation with square root convergence for SDE models
Papamakarios & Murray (2016): Fast ε-free inference of simulation models
Summary: This is an important paper with significant theoretical and practical value in the field of likelihood-free Bayesian inference. By cleverly reformulating APT as a nested estimation problem and introducing MLMC techniques, it addresses theoretical analysis difficulties and computational efficiency issues of the original method. While there remains room for improvement in computation time, its memory efficiency and theoretical guarantees make it an important contribution to the field.