When applying deep learning models in open-world scenarios, active learning (AL) strategies are crucial for identifying label candidates from a nearly infinite amount of unlabeled data. In this context, robust out-of-distribution (OOD) detection mechanisms are essential for handling data outside the target distribution of the application. However, current works investigate both problems separately. In this work, we introduce SISOM as the first unified solution for both AL and OOD detection. By leveraging feature space distance metrics SISOM combines the strengths of the currently independent tasks to solve both effectively. We conduct extensive experiments showing the problems arising when migrating between both tasks. In these evaluations SISOM underlined its effectiveness by achieving first place in two of the widely used OpenOOD benchmarks and second place in the remaining one. In AL, SISOM outperforms others and delivers top-1 performance in three benchmarks
A Unified Approach Towards Active Learning and Out-of-Distribution Detection
- Paper ID: 2405.11337
- Title: A Unified Approach Towards Active Learning and Out-of-Distribution Detection
- Authors: Sebastian Schmidt, Leonard Schenk, Leo Schwinn, Stephan Günnemann
- Category: cs.CV
- Publication Date: May 2024 (arXiv v3: October 12, 2025)
- Paper Link: https://arxiv.org/abs/2405.11337
When applying deep learning models in open-world scenarios, active learning (AL) strategies are crucial for identifying labeling candidates from large quantities of unlabeled data. In this context, robust out-of-distribution (OOD) detection mechanisms are necessary for handling data outside the application's target distribution. However, current work addresses these two problems separately. This paper introduces SISOM as the first unified solution for both AL and OOD detection. By leveraging feature space distance metrics, SISOM combines the advantages of currently independent tasks to effectively address both problems. On the OpenOOD benchmark, SISOM achieves one first-place and two top-three rankings on near-OOD data, while also achieving top-tier performance on AL tasks.
- Active Learning Challenge: Large-scale deep learning models require extensive annotated data, but annotation costs are high, necessitating intelligent selection of the most valuable samples for annotation
- Out-of-Distribution Detection Need: Models encounter data outside the training distribution during practical deployment, requiring reliable OOD detection mechanisms
- Limitations of Independent Processing: Existing methods treat AL and OOD detection as independent tasks, resulting in conflicting design objectives and system complexity
- Practical Application Requirements: Real-world applications such as mobile robotics and autonomous driving require simultaneous handling of label efficiency and out-of-distribution data issues
- Task Correlation Discovery: The authors observe that near-OOD data and unlabeled data exhibit overlap and ambiguity in the latent space
- Advantages of Unified Approaches: Unified processing eliminates overhead from independent design phases and avoids conflicting design objectives
Through UMAP visualization, the paper reveals that near-OOD data and unlabeled AL candidate data are positioned similarly in feature space, both tending to locate between existing clusters or near decision boundaries, providing theoretical foundation for a unified approach.
- First Unified Method: Proposes SISOM as the first unified method specifically designed to simultaneously address AL and OOD detection
- Feature Space Analysis Technique: Introduces latent space analysis enabling optimization loops for post-training feature space refinement
- Self-Balancing Fusion Mechanism: Proposes self-balancing fusion of uncertainty and diversity metrics
- Superior Experimental Performance: Achieves excellent results against highly specialized state-of-the-art methods on common image AL and OOD benchmarks
Active Learning: Starting from initial labeled pool L and unlabeled pool U, model f selects new samples A for annotation through query strategy Q(x,f), with the goal of achieving optimal performance with minimal annotation.
OOD Detection: Given a model f trained on distribution Ω, determine whether test sample x̃ comes from in-distribution (InD) or out-of-distribution (OOD):
G(x,f)={InDOODif S(x;f)≥λif S(x;f)<λ
SISOM comprises five core components:
Constructs informative feature space representations by concatenating multiple network layers:
z=h1(x)⊕⋯⊕hj(x)⊕⋯⊕hn(x)
Uses KL divergence gradient-weighted features:
g=∂z∂DKL(u∣∣f(x))z~=σ(z⊙g)
where u is a uniform distribution and ⊙ denotes Hadamard product.
Defines intra-class and inter-class distances:
din(z~)=minz′∈ZL(c′=c)∣∣z~−z~′∣∣2dout(z~)=minz′∈ZL(c′=c)∣∣z~−z~′∣∣2
SISOM score:
r(x)=doutdin
Computes average distance ratio as a proxy for feature space separability:
ravg=∣L∣1∑z∈Ldout(σ(z⊙g))din(σ(z⊙g))
Adaptively fuses uncertainty and diversity:
r^i=min(ravg,1)⋅Ei+max(1−ravg,0)⋅ri
Optimizes sigmoid function steepness parameter α for each layer to improve feature space separability:
αopt=argminαravg(α)
- Unified Metric Design: Distance ratio simultaneously captures AL and OOD detection requirements
- Adaptive Fusion: Automatically balances uncertainty and diversity based on feature space quality
- Multi-layer Feature Integration: Leverages information from multiple network layers to enhance representational capacity
- Gradient Weighting: Highlights important neurons through KL divergence gradient weighting
Active Learning:
- CIFAR-10/100: Query size q=1000/2000
- SVHN: Query size q=500
- Uses ResNet18/34 architecture
OOD Detection:
- Follows OpenOOD benchmark settings
- CIFAR-10: Near-OOD (CIFAR-100, Tiny ImageNet), Far-OOD (MNIST, SVHN, etc.)
- CIFAR-100: Near-OOD (CIFAR-10, Tiny ImageNet), Far-OOD (MNIST, SVHN, etc.)
- ImageNet-1k: Near-OOD (SSB-hard, NINCO), Far-OOD (iNaturalist, etc.)
- Active Learning: Classification accuracy variation with percentage of annotated data
- OOD Detection: AUROC (Area Under ROC Curve)
Active Learning Baselines:
- CoreSet, Badge, Learning Loss, CoreGCN
- Semi-supervised settings: TypiClust, ProbCover, PT4AL
OOD Detection Baselines:
- NAC, KNN, MSP, Energy, ReAct, SCALE and 17 other methods
- CIFAR-10: SISOM and SISOMe outperform other methods across all selection cycles
- CIFAR-100: Achieves highest performance in later selection steps
- Semi-supervised Settings: SISOM significantly benefits from pre-training, surpassing PT4AL
| Dataset | SISOMe | SISOM | Best Baseline |
|---|
| CIFAR-10 (Near-OOD) | 91.76 | 91.40 | 91.13 |
| CIFAR-100 (Near-OOD) | 81.10 | 79.42 | 81.31 |
| ImageNet (Near-OOD) | 78.59 | 77.33 | 95.22 |
Overall Ranking: SISOMe achieves top-three rankings across all three benchmarks, with first-place overall ranking.
- Optimal Sigmoid Steepness: Improves performance on CIFAR-100 and ImageNet
- Reduced Subset Selection: Significantly increases inference speed with slight performance improvement
- Feature Enhancement Effect: KL divergence gradient weighting significantly improves feature space separability
| Method | SISOM | SISOMe | Badge | CoreSet |
|---|
| Time (seconds) | 1477±896 | 954±126 | 33664±6682 | 2604±1572 |
With 5% subset selection, SISOMe runtime reduces to 266 seconds with slight performance improvement.
Using checkpoints from AL training for OOD detection, SISOMe achieves best performance on both near-OOD and far-OOD, validating the effectiveness of the unified approach.
- Uncertainty Methods: Monte Carlo Dropout, ensemble methods
- Diversity Methods: CoreSet, Badge and other gradient-based methods
- Hybrid Methods: Methods combining uncertainty and diversity
- Preprocessing Methods: Data augmentation, mixture training
- Post-processing Methods: Neuron filtering, weight operations
- Logits-based Methods: Temperature scaling, energy scores
- Feature Space Methods: Mahalanobis distance, k-nearest neighbors
Existing methods primarily use independent modules for AL and OOD; this paper is the first to explore the intrinsic connections between the two tasks.
- Unified Feasibility: First demonstration that AL and OOD detection can be effectively addressed through a single method
- Superior Performance: Achieves or approaches state-of-the-art performance on both tasks
- Practical Value: Simplifies deployment complexity for real-world applications
- Computational Complexity: Distance computation introduces additional overhead, though mitigated through subset selection
- Feature Space Dependency: Method performance depends on feature space separability
- Hyperparameter Sensitivity: Sigmoid steepness parameter requires tuning for different datasets
- Open-Set Active Learning: Extend unified method to open-set AL scenarios
- Complex Tasks: Extend to more complex tasks such as object detection and semantic segmentation
- Batch Diversification: Investigate batch diversification techniques' impact on dual-task methods
- Strong Innovation: First unified method for AL and OOD detection with significant theoretical and practical value
- Clever Method Design: Distance ratio design is both simple and effective; adaptive fusion mechanism reflects deep insights
- Comprehensive Experiments: Covers multiple datasets and baselines, including ablation studies and runtime analysis
- Solid Theoretical Foundation: Visualization analysis reveals intrinsic connections between the two tasks
- Insufficient Theoretical Analysis: Lacks theoretical guarantees for convergence and generalization ability
- Limited Applicability Scope: Primarily validated on image classification tasks; applicability to other domains remains unknown
- Complex Parameter Tuning: Multiple hyperparameters require dataset-specific tuning, potentially affecting practical usability
- Academic Contribution: Opens new research direction for unified study of AL and OOD detection
- Practical Value: Significant importance for real-world applications such as mobile robotics and autonomous driving
- Reproducibility: Provides detailed implementation details and code for easy reproduction and extension
- Resource-Constrained Environments: Applications requiring simultaneous consideration of annotation efficiency and OOD detection
- Real-time Systems: Through subset selection, maintains performance while improving efficiency
- Open-World Applications: Autonomous driving, robot navigation and other scenarios facing distribution shifts
The paper cites extensive related work, primarily including:
- Active Learning: Settles (2010), Sener & Savarese (2018), Ash et al. (2020)
- OOD Detection: Yang et al. (2022), Liu et al. (2020), Sun et al. (2022)
- OpenOOD Benchmark: Yang et al. (2022), Zhang et al. (2023)
Overall Assessment: This is a high-quality research paper proposing an innovative unified method addressing two important problems with sufficient experimental validation, significantly advancing the related field. Despite some theoretical and practical limitations, its pioneering contributions and superior performance make it an important work in the field.