Class-aware Domain Knowledge Fusion and Fission for Continual Test-Time Adaptation
Zhou, Zhu, Cui et al.
Continual Test-Time Adaptation (CTTA) aims to quickly fine-tune the model during the test phase so that it can adapt to multiple unknown downstream domain distributions without pre-acquiring downstream domain data. To this end, existing advanced CTTA methods mainly reduce the catastrophic forgetting of historical knowledge caused by irregular switching of downstream domain data by restoring the initial model or reusing historical models. However, these methods are usually accompanied by serious insufficient learning of new knowledge and interference from potentially harmful historical knowledge, resulting in severe performance degradation. To this end, we propose a class-aware domain Knowledge Fusion and Fission method for continual test-time adaptation, called KFF, which adaptively expands and merges class-aware domain knowledge in old and new domains according to the test-time data from different domains, where discriminative historical knowledge can be dynamically accumulated. Specifically, considering the huge domain gap within streaming data, a domain Knowledge FIssion (KFI) module is designed to adaptively separate new domain knowledge from a paired class-aware domain prompt pool, alleviating the impact of negative knowledge brought by old domains that are distinct from the current domain. Besides, to avoid the cumulative computation and storage overheads from continuously fissioning new knowledge, a domain Knowledge FUsion (KFU) module is further designed to merge the fissioned new knowledge into the existing knowledge pool with minimal cost, where a greedy knowledge dynamic merging strategy is designed to improve the compatibility of new and old knowledge while keeping the computational efficiency. Extensive experiments on the ImageNet-C dataset verify the effectiveness of our proposed method against other methods.
academic
Class-aware Domain Knowledge Fusion and Fission for Continual Test-Time Adaptation
This paper proposes KFF, a class-aware domain knowledge fusion and fission method for addressing the Continual Test-Time Adaptation (CTTA) problem. The method adaptively separates new domain knowledge through a Knowledge Fission (KFI) module to avoid negative interference from historical domains, and merges the fissioned new knowledge into the existing knowledge pool at minimal cost through a Knowledge Fusion (KFU) module. Experiments on the ImageNet-C dataset demonstrate a 5.1% performance improvement over the state-of-the-art method DPCore.
Continual Test-Time Adaptation (CTTA) aims to enable pre-trained models to rapidly adapt to multiple unknown downstream domain distributions during the test phase without prior access to downstream domain data. This presents a more challenging problem compared to traditional Test-Time Adaptation (TTA).
Proposes KFF Framework: The first class-aware domain knowledge fusion and fission framework capable of dynamically accumulating discriminative historical knowledge
Designs KFI Module: A knowledge fission module that adaptively separates new domain knowledge, reducing negative knowledge interference across domains
Develops KFU Module: A knowledge fusion module that merges knowledge through greedy strategies, balancing effectiveness and efficiency
Achieves State-of-the-Art Performance: Achieves 34.8% error rate on ImageNet-C, a 5.1% improvement over DPCore
Provides Theoretical Analysis: Theoretical guarantees based on well-separated clustering assumptions
Given source domain training data DS={YS,XS} and test data streams from different domain distributions DT={XT}T=1N, the model fθ must process test batches BTj={xt}t=0b online, with the objective of adapting to target domains while maintaining performance on historical domains.
Lemma A.1: The KFI mechanism correctly assigns all batches to prompts of the same cluster
Lemma A.2: The KFU mechanism only fuses prompts within the same cluster
Proposition A.3: The KFF method correctly assigns all batches to prompts of the same cluster
Theoretical analysis guarantees method correctness, with t-SNE visualizations in experiments validating theoretical assumptions.
Source Domain Dependency: Requires access to source domain statistics, presenting challenges in privacy-constrained scenarios
Synthetic Corruptions: Primarily validated on artificially designed corruptions; robustness to real-world distribution shifts requires further verification
Computational Overhead: While relatively efficient, remains challenging on resource-constrained devices
Hyperparameter Sensitivity: Requires careful tuning of key hyperparameters for different datasets
Computer vision tasks requiring continuous adaptation to multiple domain shifts
Edge computing scenarios with parameter efficiency requirements
Applications with access to limited source domain statistics
Structured environments with relatively predictable domain changes
This paper makes important contributions to the CTTA field, effectively addressing domain conflict issues through innovative knowledge fission and fusion mechanisms, achieving significant performance improvements while maintaining computational efficiency. Despite certain limitations, its core ideas and technical innovations provide valuable references for related research.