This paper introduces Conformal Thresholded Intervals (CTI), a novel conformal regression method that aims to produce the smallest possible prediction set with guaranteed coverage. Unlike existing methods that rely on nested conformal frameworks and full conditional distribution estimation, CTI estimates the conditional probability density for a new response to fall into each interquantile interval using off-the-shelf multi-output quantile regression. By leveraging the inverse relationship between interval length and probability density, CTI constructs prediction sets by thresholding the estimated conditional interquantile intervals based on their length. The optimal threshold is determined using a calibration set to ensure marginal coverage, effectively balancing the trade-off between prediction set size and coverage. CTI's approach is computationally efficient and avoids the complexity of estimating the full conditional distribution. The method is theoretically grounded, with provable guarantees for marginal coverage and achieving the smallest prediction size given by Neyman-Pearson . Extensive experimental results demonstrate that CTI achieves superior performance compared to state-of-the-art conformal regression methods across various datasets, consistently producing smaller prediction sets while maintaining the desired coverage level. The proposed method offers a simple yet effective solution for reliable uncertainty quantification in regression tasks, making it an attractive choice for practitioners seeking accurate and efficient conformal prediction.
- Paper ID: 2407.14495
- Title: Conformal Thresholded Intervals for Efficient Regression
- Authors: Rui Luo (City University of Hong Kong), Zhixin Zhou (Alpha Benito Research)
- Classification: cs.LG, stat.ML
- Publication Time/Conference: AAAI 2025
- Paper Link: https://arxiv.org/abs/2407.14495
- Code Link: https://github.com/luo-lorry/CTI
This paper proposes Conformal Thresholded Intervals (CTI), a novel conformal regression method designed to produce prediction sets as small as possible while guaranteeing coverage. Unlike existing methods that rely on nested conformal frameworks and complete conditional distribution estimation, CTI employs off-the-shelf multi-output quantile regression to estimate the conditional probability density that a new response falls into each inter-quantile interval. By leveraging the inverse relationship between interval length and probability density, CTI constructs prediction sets by thresholding inter-quantile intervals based on their estimated lengths. An optimal threshold is determined using a calibration set to ensure marginal coverage, effectively balancing the trade-off between prediction set size and coverage rate.
Conformal prediction is a powerful framework for constructing prediction sets with finite-sample coverage guarantees. Existing conformal regression methods fall into two main categories:
- Directly using quantile regression models to predict the lower and upper endpoints of intervals
- First estimating the complete conditional distribution, then inverting to obtain prediction sets
- Quantile Regression Approaches: Typically produce equal-tailed intervals, but for skewed conditional distributions, the shortest valid intervals may be imbalanced
- Density Estimation Methods: Can adapt to skewness but usually involve many tuning parameters, are difficult to interpret, and are complex for practitioners
- Existing methods may produce suboptimal prediction sets when conditional distributions are skewed
- Need for a method that adapts to local data density while being computationally efficient
- Desire to avoid the complexity of estimating complete conditional distributions
- Proposes CTI Method: A novel conformal prediction regression method that leverages multi-output quantile regression to estimate inter-quantile intervals and constructs prediction sets by thresholding these intervals
- Theoretical Analysis: Proves that CTI guarantees marginal coverage and can achieve desired conditional coverage and minimal expected prediction interval length under specific conditions
- Experimental Validation: Extensive numerical experiments on simulated and real datasets demonstrate that CTI produces smaller prediction sets while maintaining effective coverage
- Practical Utility: The method is simple to implement and interpret, making it attractive for practitioners seeking reliable uncertainty quantification
Given a regression dataset {(xi,yi)}i=1n, where xi∈X⊆Rd, yi∈Y⊆R. The goal is to construct a conformal predictor that outputs a prediction set C(x)⊆Y for each test input x, satisfying:
P(Y∈C(X))≥1−α
while minimizing the expected size of the prediction set.
The key insight of CTI is to exploit the inverse relationship between interval length and probability density. For conditional probability density f(y∣x), if quantile regression is sufficiently accurate:
f(y∣x)≈K⋅μ(Ik(x))1
where μ(Ik(x)) is the length of interval Ik(x).
Step 1: Multi-output Quantile Regression
Apply quantile regression on the training set to predict the τ-th quantile of the conditional distribution Y∣X=x:
q^k(x) for k=0,1,…,K
where τ=k/K.
Step 2: Define Inter-quantile IntervalsIk(x)=(q^k−1(x),q^k(x)] for k=1,…,K
Step 3: Construct Prediction Sets
Based on interval length thresholding:
C(x)=⋃{Ik(x):μ(Ik(x))≤t,k=1,…,K}
Step 4: Determine Threshold
Use the calibration set to determine threshold t:
t=(1−α)-th quantile of 1+∣Ical∣1∑i∈Icalδμ(Ik(yi)(xi))+δ∞
- Direct Thresholding Strategy: Unlike CHR which requires explicit binning of the response space, CTI directly trains a multi-output quantile regression model
- Global Thresholding: CTI adopts a global perspective to threshold inter-quantile intervals across all x values, improving efficiency of marginal coverage
- Theoretical Optimality: Based on the Neyman-Pearson lemma, CTI is theoretically close to optimal prediction sets
Simulated Data:
- Generate n=10000 samples, Xi∼Uniform[0,1]
- Response variable: y∼Triangular(0,x,x)
- Conditional density: f(y∣x)=x22y1{y∈(0,x)}
Real Datasets (13 total):
bike, bio, blog, community, concrete, facebook1, facebook2, homes, meps19, meps20, meps21, star, etc.
- Coverage: Proportion of prediction sets containing true values
- Prediction Set Size: Average length/measure of prediction sets
- Split Conformal: Basic split conformal method
- CQR (Conformal Quantile Regression): Conformal quantile regression
- CHR (Conformal Histogram Regression): Conformal histogram regression
- Data split: 20% test, remaining split 70% training and 30% calibration
- Number of quantiles: K=100
- Base models: Random Forest (RF) and Neural Networks (NN)
- Repeated experiments: 10 independent runs
- Significance level: α=0.1 (90% coverage)
Simulated Data Results:
- CTI theoretical expected set size: 0.317
- CHR theoretical expected set size: 0.342
- CQR theoretical expected set size: 0.376
- CTI actual performance: CTI(RF) 0.345±0.005, CTI(NN) 0.369±0.015
- All methods achieved approximately 90% coverage
Real Dataset Results:
On 11 out of 13 datasets, CTI produced smaller prediction sets than other methods while maintaining desired coverage levels.
- Consistent Advantages: CTI outperforms baseline methods on most datasets
- Model Dependency: CTI's efficiency depends on the quality of the underlying quantile regression model
- Small Dataset Challenges: On smaller datasets (e.g., star, n=2161), CQR may perform better
Interval Length Analysis:
By comparing the distribution of response interval lengths with all interval lengths, CTI's performance depends on specific datasets and the underlying quantile regression model. Theoretically, the mean difference between the two distributions should be zero, but actual deviations exist, highlighting the method's dependence on data and models.
- Traditional Methods: Hunter & Lange (2000), Meinshausen (2006), etc.
- Multi-quantile Estimation: Cho et al. (2017) showed that simultaneous estimation of multiple quantiles is more effective than individual estimation
- Non-crossing Constraints: Moon et al. (2021), Brando et al. (2022), etc. address quantile crossing issues
- Nested Conformal Prediction: Romano et al. (2019), Sesia & Candès (2020), etc.
- Density Estimation Methods: Izbicki et al. (2020), Sesia & Romano (2021), etc.
- This Work's Innovation: Directly estimate conditional probability density rather than complete conditional distribution
- CTI provides a simple and effective conformal regression method
- By thresholding inter-quantile intervals, CTI produces smaller prediction sets than existing methods
- The method is theoretically supported by the Neyman-Pearson lemma, guaranteeing optimality
- Experiments validate the method's effectiveness across multiple datasets
- Interval Constraint: Current implementation does not guarantee prediction sets are always interval-shaped
- Model Dependency: Performance highly depends on the accuracy of the underlying quantile regression model
- Small Sample Challenges: May underperform traditional methods on small datasets
- Parameter Selection: Choice of quantile number K requires balancing expressiveness and computational efficiency
- Develop CTI variants that produce interval-shaped prediction sets
- Improve performance on small sample cases
- Study adaptive methods for selecting quantile number K
- Integration with other uncertainty quantification methods
- Solid Theoretical Foundation: Theoretical optimality guarantees based on the Neyman-Pearson lemma
- Simple Methodology: Avoids the complexity of complete conditional distribution estimation
- Comprehensive Experiments: Extensive validation on simulated and real data
- High Practical Value: Easy to implement and interpret, suitable for practitioners
- Strong Innovation: Successfully applies thresholding ideas from classification to regression problems
- Limited Applicability: May perform poorly on small sample datasets
- Prediction Set Shape: Cannot guarantee prediction sets are interval-shaped, may produce discontinuous sets
- Hyperparameter Sensitivity: Relatively sensitive to the choice of quantile number K
- Theoretical Analysis: Some theoretical results based on strong assumptions
- Academic Contribution: Provides new perspectives and methods for the conformal prediction field
- Practical Value: Simple and efficient characteristics offer good application prospects
- Reproducibility: Open-source code provided for easy reproduction and extension
- Medium to Large Datasets: Performs best when sample size is sufficient
- Uncertainty Quantification: Regression tasks requiring reliable confidence intervals
- Real-time Applications: Scenarios with high computational efficiency requirements
- Skewed Distributions: Regression problems with skewed conditional distributions
- Romano, Y., Patterson, E., and Candès, E. (2019). Conformalized quantile regression. NeurIPS.
- Sesia, M., and Romano, Y. (2021). Conformal prediction using conditional histograms. NeurIPS.
- Meinshausen, N. (2006). Quantile regression forests. JMLR.
- Sadinle, M., Lei, J., and Wasserman, L. (2019). Least ambiguous set-valued classifiers with bounded error levels. JASA.