Continual learning the ability of a neural network to learn multiple sequential tasks without catastrophic forgetting remains a central challenge in developing adaptive artificial intelligence systems. While deep learning models achieve state-of-the-art performance across domains, they remain limited by overfitting and forgetting. This paper introduces Cluster-Aware Replay (CAR), a hybrid continual learning framework that integrates a small, class-balanced replay buffer with a regularization term based on Inter-Cluster Fitness (ICF) in the feature space. The ICF loss penalizes overlapping feature representations between new and previously learned tasks, encouraging geometric separation in the latent space and reducing interference. Using the standard five-task Split CIFAR-10 benchmark with a ResNet-18 backbone, initial experiments demonstrate that CAR better preserves earlier task performance compared to fine-tuning alone. These findings are preliminary but highlight feature-space regularization as a promising direction for mitigating catastrophic forgetting.
Continual learning—the ability of neural networks to learn multiple sequential tasks without experiencing catastrophic forgetting—remains a core challenge in developing adaptive artificial intelligence systems. While deep learning models have achieved state-of-the-art performance across various domains, they remain limited by overfitting and forgetting. This paper introduces Clustering-Aware Replay (CAR), a hybrid continual learning framework that combines a small, class-balanced replay buffer with a regularization term based on Inter-Cluster Fitness (ICF) in feature space. The ICF loss penalizes overlapping feature representations between new and previously learned tasks, encouraging geometric separation in the latent space and reducing interference.
This research addresses the catastrophic forgetting problem in neural networks, wherein models rapidly lose previously acquired knowledge when learning new tasks. This contrasts sharply with biological intelligence, where the human brain can continuously learn without forgetting prior skills.
Practical Application Demands: Real-world AI systems must learn new tasks at different time points, such as recommendation systems adapting to changing user preferences
The authors argue that existing methods primarily focus on parameter or output-level constraints, with insufficient attention to the geometric structure of feature spaces within models. Explicitly controlling feature space separation between tasks may be an effective pathway to mitigating catastrophic forgetting.
Given a task sequence T=(T1,...,TN), the objective is to ensure the model maintains good performance on all previous tasks Ti (where i<N) after learning task TN.
Feature Space Geometric Constraints: Unlike traditional methods focusing on parameters or logits, CAR directly imposes geometric constraints in feature space
This work relates to centroid distance distillation by Liu et al. and linear separability preservation by Gu et al., but CAR provides a different perspective through explicit maximization of inter-cluster separation.
The paper cites important works in continual learning, including:
Kirkpatrick et al. (2017): EWC method
Rebuffi et al. (2017): iCaRL method
Lopez-Paz & Ranzato (2017): GEM method
Liu et al. (2023): Centroid distance distillation
Gu et al. (2023): Linear separability preservation
Overall Assessment: This is an exploratory research work proposing a novel approach to continual learning from the perspective of feature space geometry. While current experimental results are not sufficiently prominent, it provides valuable research directions for the field. The authors honestly acknowledge method limitations and propose clear improvement directions, demonstrating commendable academic integrity.