CoDS: Enhancing Collaborative Perception in Heterogeneous Scenarios via Domain Separation
Han, Zhang, Zhang et al.
Collaborative perception has been proven to improve individual perception in autonomous driving through multi-agent interaction. Nevertheless, most methods often assume identical encoders for all agents, which does not hold true when these models are deployed in real-world applications. To realize collaborative perception in actual heterogeneous scenarios, existing methods usually align neighbor features to those of the ego vehicle, which is vulnerable to noise from domain gaps and thus fails to address feature discrepancies effectively. Moreover, they adopt transformer-based modules for domain adaptation, which causes the model inference inefficiency on mobile devices. To tackle these issues, we propose CoDS, a Collaborative perception method that leverages Domain Separation to address feature discrepancies in heterogeneous scenarios. The CoDS employs two feature alignment modules, i.e., Lightweight Spatial-Channel Resizer (LSCR) and Distribution Alignment via Domain Separation (DADS). Besides, it utilizes the Domain Alignment Mutual Information (DAMI) loss to ensure effective feature alignment. Specifically, the LSCR aligns the neighbor feature across spatial and channel dimensions using a lightweight convolutional layer. Subsequently, the DADS mitigates feature distribution discrepancy with encoder-specific and encoder-agnostic domain separation modules. The former removes domain-dependent information and the latter captures task-related information. During training, the DAMI loss maximizes the mutual information between aligned heterogeneous features to enhance the domain separation process. The CoDS employs a fully convolutional architecture, which ensures high inference efficiency. Extensive experiments demonstrate that the CoDS effectively mitigates feature discrepancies in heterogeneous scenarios and achieves a trade-off between detection accuracy and inference efficiency.
본 논문은 도메인 분리 기술을 통해 이질적 시나리오에서의 협력 인지 중 특징 차이 문제를 해결하는 CoDS 방법을 제안한다. CoDS는 경량 공간-채널 조정기(LSCR)와 도메인 분리 기반 분포 정렬 모듈(DADS)을 채택하고, 도메인 정렬 상호정보(DAMI) 손실함수를 결합하여 효율적인 이질적 특징 정렬을 구현한다. 본 방법은 완전 합성곱 아키텍처를 채택하여 검출 정확도를 보장하면서 추론 효율을 크게 향상시킨다.
이질적 협력 인지 작업은 다음과 같이 정의된다: N개의 에이전트가 주어질 때, 자차는 이웃 에이전트의 특징을 수신하고 융합한다. 이질적 시나리오에서는 서로 다른 에이전트가 다른 인코더 F^ego_enc 및 F^nei_enc를 사용하므로, 특징 fi와 fj는 차원 및 분포에서 차이가 난다. 목표는 특징 차이를 완화하는 플러그 앤 플레이 적응기를 설계하는 것이다.