CoDS: Enhancing Collaborative Perception in Heterogeneous Scenarios via Domain Separation
Han, Zhang, Zhang et al.
Collaborative perception has been proven to improve individual perception in autonomous driving through multi-agent interaction. Nevertheless, most methods often assume identical encoders for all agents, which does not hold true when these models are deployed in real-world applications. To realize collaborative perception in actual heterogeneous scenarios, existing methods usually align neighbor features to those of the ego vehicle, which is vulnerable to noise from domain gaps and thus fails to address feature discrepancies effectively. Moreover, they adopt transformer-based modules for domain adaptation, which causes the model inference inefficiency on mobile devices. To tackle these issues, we propose CoDS, a Collaborative perception method that leverages Domain Separation to address feature discrepancies in heterogeneous scenarios. The CoDS employs two feature alignment modules, i.e., Lightweight Spatial-Channel Resizer (LSCR) and Distribution Alignment via Domain Separation (DADS). Besides, it utilizes the Domain Alignment Mutual Information (DAMI) loss to ensure effective feature alignment. Specifically, the LSCR aligns the neighbor feature across spatial and channel dimensions using a lightweight convolutional layer. Subsequently, the DADS mitigates feature distribution discrepancy with encoder-specific and encoder-agnostic domain separation modules. The former removes domain-dependent information and the latter captures task-related information. During training, the DAMI loss maximizes the mutual information between aligned heterogeneous features to enhance the domain separation process. The CoDS employs a fully convolutional architecture, which ensures high inference efficiency. Extensive experiments demonstrate that the CoDS effectively mitigates feature discrepancies in heterogeneous scenarios and achieves a trade-off between detection accuracy and inference efficiency.
academic
CoDS: Enhancing Collaborative Perception in Heterogeneous Scenarios via Domain Separation
This paper proposes the CoDS method, which addresses feature discrepancy issues in collaborative perception under heterogeneous scenarios through domain separation techniques. CoDS employs a lightweight spatial-channel regulator (LSCR) and a domain separation-based distribution alignment module (DADS), combined with domain alignment mutual information (DAMI) loss, to achieve efficient heterogeneous feature alignment. The method adopts a fully convolutional architecture, significantly improving inference efficiency while maintaining detection accuracy.
Existing collaborative perception methods generally assume all agents use identical encoders. However, in practical deployment, different vehicles and roadside units are typically equipped with different hardware and software configurations, leading to dimensional and distributional differences in feature extraction.
Based on shared representation assumptions from cognitive and neuroscience perspectives: shared information across multiple viewpoints is most valuable for collaborative perception, while encoder-specific information hinders effective fusion.
Proposes CoDS Method: The first domain separation-based collaborative perception adapter that addresses heterogeneous feature discrepancies by separating domain-related and domain-agnostic information
The heterogeneous collaborative perception task is defined as: given N agents, the ego vehicle receives and fuses features from neighboring agents. In heterogeneous scenarios, different agents use different encoders F^ego_enc and F^nei_enc, causing features fi and fj to differ in both dimensionality and distribution. The objective is to design a plug-and-play adapter to mitigate feature discrepancies.
A discriminator distinguishes positive sample pairs (aligned features from the same scenario) from negative sample pairs (aligned features from different scenarios).
Feature visualization shows that heterogeneous features processed by CoDS are semantically more similar, both highlighting target regions, validating the effectiveness of domain separation.
Detection result visualization demonstrates that CoDS significantly reduces missed detections compared to other methods, achieving superior detection performance.
The paper cites 65 relevant references covering important works in collaborative perception, domain adaptation, mutual information estimation and related fields, demonstrating comprehensive literature review.
Overall Assessment: This is a high-quality collaborative perception paper that proposes an innovative solution to the important and practical problem of heterogeneous scenarios. The method design is ingenious, experimental validation is comprehensive, and it possesses strong theoretical significance and practical value.