2025-11-13T14:31:10.402139

Out-of-Distribution Detection in LiDAR Semantic Segmentation Using Epistemic Uncertainty from Hierarchical GMMs

Miandashti, Brenner

In addition to accurate scene understanding through precise semantic segmentation of LiDAR point clouds, detecting out-of-distribution (OOD) objects, instances not encountered during training, is essential to prevent the incorrect assignment of unknown objects to known classes. While supervised OOD detection methods depend on auxiliary OOD datasets, unsupervised methods avoid this requirement but typically rely on predictive entropy, the entropy of the predictive distribution obtained by averaging over an ensemble or multiple posterior weight samples. However, these methods often conflate epistemic (model) and aleatoric (data) uncertainties, misclassifying ambiguous in distribution regions as OOD. To address this issue, we present an unsupervised OOD detection approach that employs epistemic uncertainty derived from hierarchical Bayesian modeling of Gaussian Mixture Model (GMM) parameters in the feature space of a deep neural network. Without requiring auxiliary data or additional training stages, our approach outperforms existing uncertainty-based methods on the SemanticKITTI dataset, achieving an 18\% improvement in AUROC, 22\% increase in AUPRC, and 36\% reduction in FPR95 (from 76\% to 40\%), compared to the predictive entropy approach used in prior works.

academic

Out-of-Distribution Detection in LiDAR Semantic Segmentation Using Epistemic Uncertainty from Hierarchical GMMs

Basic Information

Paper ID: 2510.08631
Title: Out-of-Distribution Detection in LiDAR Semantic Segmentation Using Epistemic Uncertainty from Hierarchical GMMs
Authors: Hanieh Shojaei Miandashti, Claus Brenner (Leibniz University Hannover)
Classification: cs.CV, cs.LG
Publication Date: October 8, 2025 (arXiv preprint)
Paper Link: https://arxiv.org/abs/2510.08631

Abstract

This paper proposes an epistemic uncertainty estimation method based on hierarchical Bayesian Gaussian Mixture Model (GMM) parameter modeling for out-of-distribution (OOD) detection in LiDAR semantic segmentation. The method requires neither auxiliary data nor additional training stages, achieving significant improvements over existing prediction entropy-based methods on the SemanticKITTI dataset: 18% improvement in AUROC, 22% improvement in AUPRC, and reduction of FPR95 from 76% to 40%.

Research Background and Motivation

Problem Definition

In safety-critical applications such as autonomous driving, LiDAR semantic segmentation must not only accurately understand known scenes but also identify out-of-distribution (OOD) objects unseen during training, avoiding misclassification of unknown objects as known categories.

Problem Significance

Safety Requirements: Autonomous driving systems need to identify anomalous objects on roads (e.g., trash bins, signs, animals)
Reliability Demands: Deep models tend to produce overconfident but incorrect predictions for OOD inputs
Practical Considerations: Real-world environments contain numerous object categories not covered in training datasets

Limitations of Existing Methods

Supervised Methods: Rely on auxiliary OOD datasets, which are difficult to obtain in the LiDAR domain
Unsupervised Methods: Primarily depend on prediction entropy, conflating epistemic and aleatoric uncertainty
Uncertainty Confusion: Prediction entropy mixes model uncertainty with data noise, leading to misclassification of ambiguous in-distribution regions as OOD

Core Contributions

Proposed OOD detection method based on epistemic uncertainty: Separates epistemic and aleatoric uncertainty through hierarchical Bayesian modeling of GMM parameters
Unsupervised Framework: Requires neither auxiliary OOD data nor additional training stages, enhancing practicality
Significant Performance Improvements: Substantially surpasses existing methods on the SemanticKITTI dataset
Theoretical Contribution: Demonstrates the advantages of epistemic uncertainty over prediction entropy in OOD detection

Method Details

Task Definition

Given range view representations of LiDAR point clouds, the objectives are:

Perform pixel-level semantic segmentation
Identify pixel regions belonging to OOD
Quantify prediction uncertainty

Model Architecture

1. Generative-Discriminative Hybrid Framework

Based on the GMMSeg architecture, employing deep neural networks to extract pixel-level feature representations z ∈ R^D, then modeling class-conditional Gaussian mixture distributions for each semantic class c in feature space:

p(z | c) = Σ(k=1 to K) π_k^(c) N(z | μ_k^(c), Σ_k^(c))

where π_k^(c), μ_k^(c), Σ_k^(c) represent the mixture weight, mean, and covariance of the k-th Gaussian component, respectively.

2. Hierarchical Bayesian Modeling

The key innovation lies in hierarchical Bayesian modeling of GMM parameters:

Prior Distribution: Gaussian-inverse-gamma conjugate priors for the mean and variance of each Gaussian component
Posterior Update: Updates parameter posterior distributions based on training data
Parameter Sampling: Samples multiple sets of GMM parameters from the posterior distribution during inference

Specifically:

μ_k^(c) ~ N(μ_0, σ_k^2(c)/κ_0)
σ_k^2(c) ~ Inv-Gamma(α_0, β_0)

3. Epistemic Uncertainty Estimation

By sampling n sets of GMM parameters, computes the frequency distribution of class predictions for each pixel, then calculates entropy:

H[y | z] = -Σ(c=1 to C) p̄_c log p̄_c

where p̄_c = (1/n)Σ(i=1 to n)Iy^(i) = c represents the empirical probability of class c.

Technical Innovations

Uncertainty Separation: Directly estimates epistemic uncertainty through parameter sampling, avoiding the conflation of two uncertainty types in prediction entropy
No Retraining Required: Leverages the feature space of existing segmentation networks, achieving uncertainty estimation through GMM modeling
Theoretical Foundation: Based on mutual information theory, epistemic uncertainty is defined as Iy; θ | x = H(p(y|x)) - E_θH(p(y|x,θ))

Experimental Setup

Datasets

SemanticKITTI: LiDAR point cloud dataset with 19 semantic classes
OOD Classes: Uses the outlier class in the dataset as OOD samples (trash bins, signs, animals, etc.)
Input Format: 64×1024×5 range view images (x, y, z coordinates, intensity, distance)
Feature Dimension: 32-dimensional features extracted using SalsaNext backbone network

Evaluation Metrics

AUROC: Area under the ROC curve, measuring overall classification performance
AUPRC: Area under the precision-recall curve, suitable for imbalanced class scenarios
FPR95: False positive rate when true positive rate is 95%, measuring safety
mIoU: Mean intersection over union for semantic segmentation

Comparison Methods

MSP: Maximum Softmax Probability
ODIN: Temperature scaling and input perturbation method
MC Dropout: Monte Carlo Dropout
Deep Ensembles (DE): Deep ensemble methods
GMMSeg: Original GMM segmentation method (without uncertainty modeling)

Implementation Details

Backbone Network: SalsaNext
GMM Components: 2 Gaussian components per class
Number of Samples: 20 GMM parameter samples
Threshold Strategy: Unsupervised percentile threshold (top 5%)

Experimental Results

Main Results

Method	AUROC (%)	AUPRC (%)	FPR95 (%)	mIoU (%)
MSP	70.41	10.90	76.00	56.37
ODIN	73.74	12.45	75.54	56.37
MC Dropout	73.64	13.65	75.92	57.15
Deep Ensembles	73.03	16.14	76.48	57.17
GMMSeg	87.62	26.14	48.84	57.60
Proposed Method	91.06	37.67	40.14	57.71

Key Findings

Significant Performance Improvement: Compared to the best baseline (GMMSeg), AUROC improves by 3.44%, AUPRC improves by 11.53%
Enhanced Safety: FPR95 reduces from 76.48% (Deep Ensembles) to 40.14%
Segmentation Performance: Semantic segmentation accuracy shows slight improvement through voting mechanism

Qualitative Analysis

Through visualization comparison:

Proposed Method: Accurately identifies genuine OOD objects (signs, trash bins, etc.), maintains moderate uncertainty for ambiguous ID regions
Deep Ensembles: Produces excessive false positives at semantic boundaries, misses genuine OOD objects

Uncertainty Distribution Analysis

Comparing prediction entropy and epistemic uncertainty distributions:

Prediction entropy exhibits substantial overlap between ID and OOD samples
Epistemic uncertainty achieves clearer separation, with ID samples concentrated in low uncertainty regions

OOD Detection Method Classification

Supervised Methods: Rely on auxiliary datasets for outlier exposure training
Unsupervised Methods: Based on uncertainty estimation, including deterministic methods (MSP, ODIN) and Bayesian methods (MC Dropout, Deep Ensembles)

Uncertainty Estimation

Prediction Entropy Methods: Mix epistemic and aleatoric uncertainty
Mutual Information Methods: Theoretically capable of separating two uncertainty types, but difficult to compute accurately in deep networks
Contribution of This Work: Achieves direct estimation of epistemic uncertainty through Bayesian modeling of GMM parameters

Conclusions and Discussion

Main Conclusions

Epistemic uncertainty is more effective than prediction entropy in OOD detection
Hierarchical Bayesian GMM modeling provides a practical method for estimating epistemic uncertainty
The unsupervised framework eliminates dependence on auxiliary data

Limitations

False Positive Issues: May still produce high uncertainty for semantically ambiguous or incorrectly segmented ID regions
Threshold Selection: Depends on percentile threshold strategy, which may not be applicable to all scenarios
Computational Complexity: Requires multiple parameter samplings, increasing inference time

Future Directions

Investigate strategies to reduce ambiguity in boundary regions
Improve uncertainty threshold selection methods
Explore more efficient uncertainty estimation methods

In-Depth Evaluation

Strengths

Theoretical Contribution: Clearly articulates the advantages of epistemic uncertainty in OOD detection with theoretical foundation
Method Innovation: Cleverly achieves epistemic uncertainty estimation through Bayesian modeling of GMM parameters
Comprehensive Experiments: Conducts thorough comparisons with multiple baselines on standard datasets
Strong Practicality: Requires neither auxiliary data nor retraining, facilitating deployment

Weaknesses

Generalization Verification Needed: Validated only on SemanticKITTI dataset; requires verification on additional datasets
Parameter Sensitivity: Lacks detailed analysis of hyperparameter selection such as GMM component numbers
Computational Overhead: Insufficient analysis of computational costs for multiple samplings
Theoretical Analysis: Lacks in-depth theoretical explanation of why GMM modeling better separates uncertainty types

Impact

Academic Value: Provides new research directions for OOD detection in LiDAR semantic segmentation
Practical Value: Significant importance for safety-critical applications such as autonomous driving
Reproducibility: Clear method description with relatively straightforward implementation

Applicable Scenarios

Autonomous Driving: Detection of anomalous objects on roads
Robot Navigation: Identification of obstacles in unknown environments
Security Monitoring: Detection of anomalous behaviors or objects
Industrial Inspection: Identification of defective products on production lines

References

The paper cites important works in related fields, including:

GMMSeg 18: Foundational work in generative semantic segmentation
Deep Ensembles 16: Classical method for uncertainty estimation
MC Dropout 7: Representative work in Bayesian deep learning
SemanticKITTI 1: Standard dataset for LiDAR semantic segmentation

This paper makes important contributions to the field of OOD detection in LiDAR semantic segmentation. Through clever technical design, it achieves effective estimation of epistemic uncertainty, providing valuable solutions for improving the safety and reliability of autonomous driving systems.