XD-RCDepth: Lightweight Radar-Camera Depth Estimation with Explainability-Aligned and Distribution-Aware Distillation
Sun, Wang, Peng et al.
Depth estimation remains central to autonomous driving, and radar-camera fusion offers robustness in adverse conditions by providing complementary geometric cues. In this paper, we present XD-RCDepth, a lightweight architecture that reduces the parameters by 29.7% relative to the state-of-the-art lightweight baseline while maintaining comparable accuracy. To preserve performance under compression and enhance interpretability, we introduce two knowledge-distillation strategies: an explainability-aligned distillation that transfers the teacher's saliency structure to the student, and a depth-distribution distillation that recasts depth regression as soft classification over discretized bins. Together, these components reduce the MAE compared with direct training with 7.97% and deliver competitive accuracy with real-time efficiency on nuScenes and ZJU-4DRadarCam datasets.
academic
XD-RCDepth: Lightweight Radar-Camera Depth Estimation with Explainability-Aligned and Distribution-Aware Distillation
This paper proposes XD-RCDepth, a lightweight radar-camera depth estimation architecture that reduces parameters by 29.7% compared to state-of-the-art lightweight baselines while maintaining comparable accuracy. To preserve performance under model compression and enhance interpretability, the authors introduce two knowledge distillation strategies: explainability-aligned distillation (transferring saliency structures from teacher to student models) and depth distribution distillation (reformulating depth regression as soft classification over discretized bins). These components reduce MAE by 7.97% compared to direct training and achieve competitive accuracy with real-time efficiency on the nuScenes and ZJU-4DRadarCam datasets.
The authors propose investigating the impact of Grad-CAM target selection and alternative attribution targets on distillation interpretability quality and downstream performance.
The paper cites important works in depth estimation, knowledge distillation, and explainable AI, including:
Hinton et al. (2015): Foundational work on knowledge distillation
Selvaraju et al. (2019): Grad-CAM visualization method
Caesar et al. (2020): nuScenes dataset
Multiple recent studies on radar-camera fusion
Overall Assessment: This is a high-quality technical paper making valuable contributions to lightweight multi-modal depth estimation. The methodology is novel, experiments are comprehensive, and practical value is prominent, providing beneficial references for related research and applications.