Many scientific analyses require simultaneous comparison of multiple functionals of an unknown signal at once, calling for multidimensional confidence regions with guaranteed simultaneous frequentist under structural constraints (e.g., non-negativity, shape, or physics-based). This paper unifies and extends many previous optimization-based approaches to constrained confidence region construction in linear inverse problems through the lens of statistical test inversion. We begin by reviewing the historical development of optimization-based confidence intervals for the single-functional setting, from "strict bounds" to the Burrus conjecture and its recent refutation via the aforementioned test inversion framework. We then extend this framework to the multiple-functional setting. This framework can be used to: (i) improve the calibration constants of previous methods, yielding smaller confidence regions that still preserve frequentist coverage, (ii) obtain tractable multidimensional confidence regions that need not be hyper-rectangles to better capture functional dependence structure, and (iii) generalize beyond Gaussian error distributions to generic log-concave error distributions. We provide theory establishing nominal simultaneous coverage of our methods and show quantitative volume improvements relative to prior approaches using numerical experiments.
- Paper ID: 2510.11708
- Title: Simultaneous Frequentist Calibration of Confidence Regions for Multiple Functionals in Constrained Inverse Problems
- Authors: Pau Batlle, Pratik Patil, Michael Stanley, Javier Ruiz Lupon, Houman Owhadi, Mikael Kuusela
- Classification: math.ST stat.TH
- Publication Date: October 13, 2025
- Paper Link: https://arxiv.org/abs/2510.11708
Many scientific analyses require simultaneous comparison of multiple functionals of an unknown signal, necessitating the construction of multidimensional confidence regions with guaranteed simultaneous frequentist coverage under structural constraints (such as non-negativity, shape, or physics-based constraints). This paper unifies and extends optimization-based confidence region construction methods for constrained linear inverse problems through the perspective of statistical hypothesis testing inversion. The paper first reviews the historical development of optimization-based confidence intervals in the single-functional setting, from "strict bounds" to the Burrus conjecture and its recent refutation through the hypothesis testing inversion framework. It then extends this framework to the multi-functional setting. The framework enables: (i) improved calibration constants over previous methods, yielding smaller confidence regions while maintaining frequentist coverage; (ii) tractable multidimensional confidence regions that need not be hyperrectangular, better capturing functional dependence structure; (iii) generalization from Gaussian error distributions to general log-concave error distributions.
This paper studies the construction of simultaneous confidence regions for multiple functionals in linear inverse problems. Consider the linear inverse problem:
y=Kx∗+ε
where y∈Rn is the observation, x∗∈Rp is the unknown parameter, K∈Rn×p is the known forward operator, and ε∈Rn is random noise.
- Scientific Need: Many scientific analyses require simultaneous inference of multiple linear functionals Hx∗ of the unknown signal, rather than estimating the entire high-dimensional parameter x∗
- Constraint Information: The true parameter x∗ typically satisfies constraints based on prior physical knowledge (such as non-negativity x∗≥0)
- Simultaneous Coverage: Guaranteeing simultaneous frequentist coverage for all functionals, not merely marginal coverage
- Conservatism: Traditional simultaneous strict bounds (SSB) methods are overly conservative, first constructing confidence sets for x∗ then mapping to functional space
- Rectangular Restriction: Existing methods typically produce hyperrectangular confidence regions, failing to capture dependence structure between functionals
- Calibration Issues: Historical methods such as the Burrus conjecture lack rigorous theoretical guarantees
- Unified Framework: Unifies single-functional and multi-functional constrained confidence region construction methods through the hypothesis testing inversion perspective
- Theoretical Breakthroughs:
- Proves convexity of quantile functions for λu2 and λ1 test statistics
- Determines optimal solution locations for quantile optimization problems
- Establishes stochastic dominance relationships between test statistics
- Practical Algorithms:
- Provides optimal calibration constants for non-negativity constraints
- Develops TFM reduction methods for high-dimensional problems
- Proposes row space/null space separation techniques
- Performance Improvements: Significantly reduces confidence region volume compared to classical methods while maintaining nominal coverage rates
Given matrix H∈Rk×p, the goal is to construct a finite-sample 1−α frequentist confidence set Rα(y)⊆Rk for the unknown vector Hx∗∈Rk such that:
Py∼Px(Hx∈Rα(y))≥1−α
holds for all x∈X (the constraint set).
For each μ∈R, consider the hypothesis test:
H0:x∗∈Φμ∩XvsH1:x∗∈X∖Φμ
where Φμ={x∈Rp:hTx=μ}.
For μ∈Rk, define Φμ={x∈Rp:Hx=μ}, and the hypothesis test becomes:
H0:x∗∈Φμ∩XvsH1:x∗∈X∖Φμ
The paper analyzes three test statistics:
- Constrained Second Term λc2(μ,y):
λc2(μ,y)=minHx=μ,Ax≤b∥Kx−y∥22−minAx≤b∥Kx−y∥22
- Unconstrained Second Term λu2(μ,y):
λu2(μ,y)=minHx=μ,Ax≤b∥Kx−y∥22−minx∈Rp∥Kx−y∥22
- Single Term λ1(μ,y):
λ1(μ,y)=minHx=μ,Ax≤b∥Kx−y∥22
For each test statistic, thresholds must be determined to guarantee 1−α coverage:
- Pointwise Threshold: d∗(μ)=supHx=μ,Ax≤bQx,1−α
- Global Threshold: D∗=supAx≤bQx,1−α
where Qx,1−α is the (1−α) quantile of Zx=λ(Hx,Kx+ε).
Theorem 5.4: For any fixed 0<α<1, the quantile function Qu2(x) is convex in x.
Theorem 5.6 (Linear Constraints): Under linear constraints Ax∗≤b,
supx∈PQu2(x)=maxi=1:mQu2(pi)
where {pi}i=1m is the set of extreme points of polyhedron P.
Theorem 5.7 (Cone Constraints): Under cone constraints x∗∈C,
supx∈CQu2(x)=Qu2(0)
Consider a non-negativity constrained problem:
y=Kx+ε,ε∼N(0,I),x∗≥0
where:
K=(201111),H=(10−110−1)
- SSB_x: Simultaneous strict bounds x-description bounding box
- SSB_μ: Simultaneous strict bounds μ-description
- QuantileZero_x/μ: Improved version using optimal constants
- Bonferroni: Bonferroni-corrected product intervals
- Split Method: Row space/null space separation technique
- Empirical Coverage Rate: Verified through N=105 resamples
- Region Area: Computed using polar coordinate integration
For y=(0,0) and y=(20,10), the μ-description method produces convex sets strictly contained within x-description bounding boxes, significantly reducing region area.
- x∗=(0,0,0): QuantileZero_μ method achieves approximately exact 68% coverage rate with minimal average area
- x∗=(5,5,5): All methods achieve coverage, but μ-description methods maintain significant area advantages
- Calibration Constant Improvement: For 68% and 95% confidence levels, optimal constants are 1.644 and 5.139 respectively, showing significant improvement over χ22 distribution values of 2.279 and 5.991
- Area Reduction: μ-description achieves approximately 30-50% average area reduction compared to x-description bounding boxes
- Burrus (1964): First proposed optimization-based methods for constrained confidence intervals
- Rust & O'Leary (1986): Developed practical algorithms
- Stark (1992): Introduced strict bounds methods
- Tenorio et al. (2007): Developed TFM reduction techniques
- Batlle et al. (2023): Refuted the Burrus conjecture through hypothesis testing inversion framework
- Constrained Inference Literature: Connections with χ2-bar distribution theory
- Conformal Prediction: Distinctions in objectives and assumptions
- Theoretical Contribution: Establishes a unified hypothesis testing inversion framework for multi-functional constrained confidence regions
- Computational Advantages: Provides scalable algorithms for high-dimensional problems
- Performance Improvements: Significantly reduces confidence region volume compared to classical methods
- λc2 Statistic: Quantile functions lack convexity; maximization problems remain open
- Computational Complexity: Extreme point search may be difficult in high dimensions
- Pointwise Thresholds: Computing the entire function d∗(μ) is typically challenging
- Non-Gaussian Extensions: Extend to general log-concave distributions
- λc2 Calibration: Develop calibration algorithms for constrained second-term statistics
- Asymptotic Theory: Study large-sample properties
- Application Domains: Extend to shape constraints and other statistical problems
- Theoretical Rigor: Provides a complete mathematical framework including convexity proofs and optimality results
- Practical Value: Develops scalable algorithms addressing high-dimensional practical problems
- Unified Perspective: Unifies historically dispersed methods under the hypothesis testing inversion framework
- Significant Improvements: Substantially reduces confidence regions while maintaining theoretical guarantees
- Theoretical Gaps: Complete theory for λc2 statistics remains undeveloped
- Computational Limitations: Computational complexity in certain high-dimensional cases
- Limited Experiments: Numerical experiments are relatively simple, lacking complex real-world applications
- Academic Contribution: Provides new theoretical foundations for uncertainty quantification in constrained inverse problems
- Practical Applications: Broad application prospects in physical sciences, engineering, and other fields requiring constrained inference
- Methodological Significance: The hypothesis testing inversion framework may inspire solutions to other statistical problems
- Simultaneous multi-functional inference in linear inverse problems
- Parameter estimation with physical constraints
- Scientific computing requiring strict frequentist guarantees
- Uncertainty quantification in high-dimensional constrained optimization problems
The paper cites 47 relevant references spanning constrained inference, inverse problems, optimization theory, and statistics, providing a solid theoretical foundation for the research.