2025-11-13T09:49:10.938500

MalCL: Leveraging GAN-Based Generative Replay to Combat Catastrophic Forgetting in Malware Classification

Park, Ji, Park et al.

Continual Learning (CL) for malware classification tackles the rapidly evolving nature of malware threats and the frequent emergence of new types. Generative Replay (GR)-based CL systems utilize a generative model to produce synthetic versions of past data, which are then combined with new data to retrain the primary model. Traditional machine learning techniques in this domain often struggle with catastrophic forgetting, where a model's performance on old data degrades over time. In this paper, we introduce a GR-based CL system that employs Generative Adversarial Networks (GANs) with feature matching loss to generate high-quality malware samples. Additionally, we implement innovative selection schemes for replay samples based on the model's hidden representations. Our comprehensive evaluation across Windows and Android malware datasets in a class-incremental learning scenario -- where new classes are introduced continuously over multiple tasks -- demonstrates substantial performance improvements over previous methods. For example, our system achieves an average accuracy of 55% on Windows malware samples, significantly outperforming other GR-based models by 28%. This study provides practical insights for advancing GR-based malware classification systems. The implementation is available at \url {https://github.com/MalwareReplayGAN/MalCL}\footnote{The code will be made public upon the presentation of the paper}.

academic

MalCL: Leveraging GAN-Based Generative Replay to Combat Catastrophic Forgetting in Malware Classification

Basic Information

Paper ID: 2501.01110
Title: MalCL: Leveraging GAN-Based Generative Replay to Combat Catastrophic Forgetting in Malware Classification
Authors: Jimin Park¹, AHyun Ji¹, Minji Park¹, Mohammad Saidur Rahman², Se Eun Oh¹*
Affiliations: ¹Ewha Womans University, ²University of Texas at El Paso
Categories: cs.CR (Cryptography and Security), cs.AI (Artificial Intelligence)
Publication Date: January 2, 2025 (arXiv preprint)
Paper Link: https://arxiv.org/abs/2501.01110

Abstract

This paper proposes MalCL, a system addressing the continual learning problem in malware classification. The system employs a Generative Adversarial Network (GAN)-based generative replay approach that generates high-quality malware samples through feature matching loss and implements an innovative sample selection mechanism based on model hidden representations. In class-incremental learning scenarios on Windows and Android malware datasets, the system demonstrates significant performance improvements, achieving 55% mean accuracy on Windows malware samples, representing a 28% improvement over other generative replay-based models.

Research Background and Motivation

Core Problem

The primary challenge in malware classification is the phenomenon of catastrophic forgetting. When machine learning models undergo continuous training on new data, their performance on previously learned data significantly degrades. This is particularly severe in the malware domain because:

Rapid malware evolution: The AV-TEST Institute records 450,000 new malware and potentially unwanted applications (PUA) daily
VirusTotal processes over 1 million software submissions daily
Dilemma for antivirus companies: Either remove old samples (risking resurgence of legacy malware) or ignore new samples (missing emerging threats)

Threat Model

The paper defines a concrete threat scenario where attackers exploit legacy malware to evade machine learning systems updated only with new data. As the time gap between original training and attack increases, the likelihood of successful evasion increases.

Limitations of Existing Approaches

Traditional machine learning methods: Fail to effectively address catastrophic forgetting
Continual learning methods from computer vision: Perform poorly when directly applied to malware classification, sometimes underperforming the "None" baseline
Storage constraints: Historical data storage is limited due to privacy regulations

Core Contributions

Malware-domain-specific continual learning model: Proposes MalCL, achieving 55% mean accuracy across 11 continual learning tasks on 100 malware families, representing a 28% improvement over existing methods
Improved feature matching generative replay: Employs GAN generator combined with feature matching loss to reduce feature discrepancies between original and synthetic samples
Innovative replay sample selection mechanism: Develops multiple selection strategies based on intermediate layer features in the classifier to improve alignment between generated and original data
Strategic task set construction: Explores the strategy of assigning large categories to initial tasks, effectively mitigating catastrophic forgetting

Methodology Details

Task Definition

MalCL addresses the class-incremental learning problem:

Input: Sequence of malware feature vectors
Output: Malware family classification
Constraint: Each task introduces new malware categories while maintaining recognition ability for historical categories

Model Architecture

Overall Pipeline

MalCL comprises two core components:

GAN module: Generates synthetic malware samples from previous tasks
Classifier module: Performs malware family classification

GAN Architecture Design

Generator:

4 1D convolutional layers + 2 fully connected layers + 3 transposed convolutional layers
ReLU activation and batch normalization on all layers except the last
Sigmoid activation on output layer

Discriminator:

2 convolutional layers + 2 fully connected layers
Flattening logic from the second convolutional layer for feature matching
Sigmoid layer for final output

Classifier Architecture

3 convolutional layers + 1 fully connected layer
Max pooling and dropout following the first two convolutional layers
Flattening logic from the third convolutional layer for replay sample selection
Softmax on output layer

Technical Innovations

1. Feature Matching Loss

Traditional binary cross-entropy loss:

LG = -1/m ∑(i=1 to m) log(D(G(zi)))

Feature matching loss:

LG = 1/m ∑(i=1 to m) ||Ex~pdata[D(f)(x)] - Ez~pz[D(f)(G(z))]||

Where D(f)(·) denotes the intermediate layer output of the discriminator. This loss function focuses on richer intermediate features rather than final outputs.

2. Replay Sample Selection Strategies

Selection based on L2 distance to one-hot labels:

Sc,k = {s | argmin √((Ci(s) - yc)²), ∀s ∈ S, yc ∈ Y}

Selection based on L1 distance to logistic regression:

By batch mean: Selects samples close to the global mean
By class mean: Selects k samples per class closest to the class center

Sc,k = {s | argmin |Li(s) - L̄i,c(xj)|, ∀s ∈ S, ∀x ∈ Xc}

Experimental Setup

Datasets

EMBER Dataset:
- 337,035 malicious Windows PE files
- 100 malware families with >400 samples each
- Features include file size, PE and COFF header information, DLL features, etc.
AZ-Class Dataset:
- 285,582 Android malware samples
- 100 malware families with ≥200 samples each
- Uses Drebin features across 8 categories (hardware access, permissions, API calls, etc.)
- Final feature dimensionality: 2,439

Evaluation Metrics

Mean Accuracy: Average classification accuracy across all tasks
Minimum Accuracy: Lowest accuracy among all tasks

Comparison Methods

Baseline methods:
- None: Training only on new data (lower bound of catastrophic forgetting)
- Joint: Training with all historical data (ideal upper bound)
Prior work:
- GR (Generative Replay): GAN-based generative replay
- BI-R (Brain-Inspired Replay): Enhanced generative replay using VAE

Implementation Details

Task construction: First task with 50 classes, subsequent tasks adding 5 classes each, totaling 11 tasks
Optimizers: Adam for GAN, SGD for classifier
Batch size: 256
Learning rates: Classifier 1e-3, momentum 0.9, weight decay 1e-7

Experimental Results

Main Results

Method	EMBER Dataset
	Mean	Min
None (baseline)	27.5%	0.6%
Joint (upper bound)	88.7%	74.5%
GR	26.8%	9.5%
BI-R	27.0%	9.2%
MalCL (best)	54.5%	21.8%

Key Findings:

MalCL improves over None baseline by 27%
Improves over existing generative replay methods (GR, BI-R) by 28%
Best configuration: FML + L1 distance to class mean logistic regression

Ablation Studies

Loss Function Comparison

Feature Matching Loss vs Binary Cross-Entropy: FML outperforms BCE across all selection strategies
L1 distance to class mean logistic regression: Best performance at 55% accuracy vs 50% for L2 distance to labels

Task Set Construction Strategy

Strategic task construction: Assigning large categories to initial tasks

First task: 50 "giant" classes (average 5,397 samples)
Subsequent tasks: Randomly selected small classes (average 670 samples)
Result: Accuracy improves to 74%, approaching Joint baseline performance

Cross-Dataset Validation

AZ-Class Dataset Results:

Superior performance compared to EMBER dataset
Narrower accuracy range, indicating greater stability
Reason: EMBER dataset exhibits more severe class imbalance

Continual Learning Method Classification

Replay techniques:
- Exact replay: Experience Replay (ER), iCaRL
- Generative replay: GR, BI-R, DDGR
Regularization methods:
- Elastic Weight Consolidation (EWC)
- Synaptic Intelligence (SI)

Continual Learning in Malware Classification

Rahman et al. first explored continual learning in malware classification
Chen et al. combined contrastive learning and active learning
Existing methods primarily focus on concept drift rather than catastrophic forgetting

Conclusions and Discussion

Main Conclusions

MalCL effectively mitigates catastrophic forgetting in malware classification
Feature matching loss significantly improves synthetic sample quality
Sample selection strategies based on classifier hidden layers are highly effective
Strategic task construction is crucial for performance improvement

Limitations

Significant gap remains with Joint baseline: MalCL best 54.5% vs Joint 88.7%
Global selection strategies underperform: L1 distance to batch mean strategy produces insufficient class coverage
Sensitivity to dataset imbalance: EMBER dataset imbalance characteristics impact performance

Future Directions

Improve synthetic malware generation quality
Develop more advanced generative models
Explore hybrid training methods combining generative replay and joint training advantages
Extend to additional malware types
Integrate more complex features reflecting dynamic malware threat characteristics

In-Depth Evaluation

Strengths

Highly targeted problem: Specifically addresses catastrophic forgetting in the malware domain
Methodological innovation: Combines feature matching loss with diversified sample selection strategies
Comprehensive experiments: Cross-platform (Windows/Android) validation with multiple comparison methods
High practical value: Addresses important problems in real-world security protection
Sufficient technical detail: Provides complete architecture design and implementation details

Limitations

Substantial room for performance improvement: 33% gap remains with ideal upper bound
Missing computational overhead analysis: Lacks detailed analysis of GAN training and generation computational costs
Adversarial robustness: Does not consider robustness of generated samples against adversarial attacks
Feature engineering dependency: Relies on predefined features, potentially limiting generalization capability
Long-term performance evaluation: Lacks performance assessment over longer time sequences

Impact

Academic contribution: First systematic application of GAN-based generative replay to malware classification
Practical value: Provides practical continual learning solutions for antivirus companies
Method generalizability: Technical framework extensible to other security detection domains
Open-source commitment: Promises to release code, facilitating research reproduction and development

Applicable Scenarios

Enterprise security protection: Malware detection systems requiring continuous updates
Resource-constrained environments: Scenarios where large-scale historical data storage is infeasible
Privacy-sensitive applications: Environments where original malware samples cannot be retained
Real-time detection systems: Online detection systems requiring rapid adaptation to emerging threats

References

This paper cites important works from continual learning, malware detection, and generative adversarial networks, including:

Shin et al. (2017): Continual learning with deep generative replay
Rahman, Coull, and Wright (2022): First exploration of continual learning in malware classification
Anderson and Roth (2018): EMBER dataset
Arp et al. (2014): Drebin feature extraction methodology

Overall Assessment: This paper proposes an innovative solution to catastrophic forgetting in malware classification, with adequate technical methodology and experimental validation. While performance improvement potential remains, it makes important contributions to both research and practical applications in this domain.