The Bayesian elastic net regression model is characterized by the regression coefficient prior distribution, the negative log density of which corresponds to the elastic net penalty function. While Markov chain Monte Carlo (MCMC) methods exist for sampling from the posterior of the regression coefficients given the penalty parameters, full Bayesian inference that incorporates uncertainty about the penalty parameters remains a challenge due to an intractable integrable in the posterior density function. Though sampling methods have been proposed that avoid computing this integral, all correctly-specified methods for full Bayesian inference that have appeared in the literature involve at least one "Metropolis-within-Gibbs" update, requiring tuning of proposal distributions. The computational landscape is complicated by the fact that two forms of the Bayesian elastic net prior have been introduced, and two representations (with and without data augmentation) of the prior suggest different MCMC algorithms. We review the forms and representations of the prior, discuss all combinations of these different treatments for the first time, and introduce one combination of form and representation that has yet to appear in the literature. We introduce MCMC algorithms for full Bayesian inference for all treatments of the prior. The algorithms allow for direct sampling of all parameters without any "Metropolis-within-Gibbs" steps. The key to the new approach is a careful transformation of the parameter space and an analysis of the resulting full conditional density functions that allows for efficient rejection sampling. We make empirical comparisons between our approaches and existing MCMC samplers for different data structures.
The Bayesian elastic net regression model is characterized through a prior distribution on regression coefficients, whose negative log-density corresponds to the elastic net penalty function. While MCMC methods exist for sampling from the posterior distribution of regression coefficients given penalty parameters, complete Bayesian inference that incorporates uncertainty in the penalty parameters remains challenging due to intractable integrals in the posterior density function. Although sampling methods have been proposed to avoid computing this integral, all correctly specified complete Bayesian inference methods in the literature involve at least one "Metropolis-within-Gibbs" update requiring adjustment of the proposal distribution. Computational complexity is further exacerbated by the existence of two forms of Bayesian elastic net priors in the literature, and two representations of the prior (with and without data augmentation) that suggest different MCMC algorithms. This paper reviews the prior forms and representations, discusses for the first time all combinations of these different treatments, and introduces a combination of form and representation not previously appearing in the literature. We introduce MCMC algorithms for complete Bayesian inference for all prior treatments, allowing direct sampling of all parameters without any "Metropolis-within-Gibbs" steps.
The Bayesian elastic net regression model has become a popular regression method across many research fields. The model is characterized by a prior distribution on regression coefficients whose negative log-density corresponds to the elastic net penalty function:
Intractable Integrals: The normalizing constant of the prior distribution contains the term Φ(−λ1/(2σλ2))−p, where Φ(⋅) is the standard normal cumulative distribution function, which is an integral expression without closed-form solution.
Parameterization Complexity: Two different prior parameterization forms exist in the literature:
Commonly-scaled: Both λ2βTβ and λ1∣β∣1 are scaled by 2σ2
Differentially-scaled: Different terms use different scaling factors
Representation Diversity: Each parameterization form has two representations:
Direct representation: Without data augmentation
Data augmentation representation: Introducing latent variables in a hierarchical model
Comprehensive Review: First comprehensive review of all combinations of Bayesian elastic net prior forms and representations, introducing a new combination (differentially-scaled direct representation)
Parameter Space Transformation: Proposes clever parameter space transformations that confine the complex Φ(⋅) term to a single complete conditional distribution
Tuning-Free MCMC Algorithm: Develops MCMC algorithms requiring no "Metropolis-within-Gibbs" steps, avoiding proposal distribution adjustment issues
Efficient Rejection Sampling: Designs efficient rejection sampling algorithms with automatically-tuned piecewise exponential proposal distributions based on log-concavity analysis
Theoretical Guarantees: Provides theoretical results on log-concavity of key distributions and mode bounds
Under the normal linear regression model y=Xβ+ε (where ε∼N(0,σ2In)), conduct complete Bayesian elastic net inference, including modeling uncertainty in penalty parameters λ1,λ2 and error variance σ2.
Method Effectiveness: The proposed rejection sampling method successfully eliminates tuning requirements, providing competitive or superior performance in most cases
Theoretical Contribution: Parameter transformation and log-concavity analysis provide new theoretical foundations for Bayesian elastic net computation
Practical Value: The automatic nature of the algorithm makes it more suitable for practical applications
High-Dimensional Performance: The relative advantages of the method are less pronounced in some high-dimensional settings compared to low-dimensional cases
Prior Restrictions: Log-concavity requirement of L≥1 limits use of certain priors
Parameterization Dependence: Performance is sensitive to parameterization choices