Training data membership inference via Gaussian process meta-modeling: a post-hoc analysis approach
Huang, Zhang, Mumtaz
Membership inference attacks (MIAs) test whether a data point was part of a model's training set, posing serious privacy risks. Existing methods often depend on shadow models or heavy query access, which limits their practicality. We propose GP-MIA, an efficient and interpretable approach based on Gaussian process (GP) meta-modeling. Using post-hoc metrics such as accuracy, entropy, dataset statistics, and optional sensitivity features (e.g. gradients, NTK measures) from a single trained model, GP-MIA trains a GP classifier to distinguish members from non-members while providing calibrated uncertainty estimates. Experiments on synthetic data, real-world fraud detection data, CIFAR-10, and WikiText-2 show that GP-MIA achieves high accuracy and generalizability, offering a practical alternative to existing MIAs.
academic
Training data membership inference via Gaussian process meta-modeling: a post-hoc analysis approach
Membership inference attacks (MIAs) test whether data points belong to a model's training set, posing serious privacy risks. Existing methods typically rely on shadow models or extensive query access, limiting their practicality. This paper proposes GP-MIA, an efficient and interpretable method based on Gaussian process (GP) meta-modeling. Using post-hoc metrics from a single trained model (such as accuracy, entropy, dataset statistics, and optional sensitivity features like gradients and NTK measurements), GP-MIA trains a GP classifier to distinguish members from non-members while providing calibrated uncertainty estimates. Experiments on synthetic data, real-world fraud detection data, CIFAR-10, and WikiText-2 demonstrate that GP-MIA achieves high accuracy and generalization capability, offering a practical alternative to existing MIAs.
This research addresses membership inference attacks in machine learning models. Given a trained model f_θ* and a test sample pair (x,y), the objective is to design an inference rule M(f_θ*, x, y) ∈ {0,1} that determines whether the sample belongs to the training set.
Membership inference attacks pose serious privacy threats, particularly in sensitive domains such as healthcare, finance, or security, where merely disclosing whether a personal record was used for training could constitute privacy leakage. Deep neural networks are vulnerable to such attacks because they exhibit systematic behavioral differences between training and unseen data.
Shadow Model Approaches: Require training multiple auxiliary models to simulate target behavior, incurring high computational costs
Likelihood Ratio Attacks (LiRA): Require multiple model queries and substantial computational resources for calibration
Practical Limitations: Existing methods typically demand extensive computational resources, carefully curated auxiliary data, or multiple queries to the target model
This paper proposes an efficient method requiring only post-hoc access to a single trained model, avoiding retraining or internal access, while providing calibrated uncertainty estimates to enhance efficiency and interpretability.
Proposes GP-MIA Framework: A novel post-hoc membership inference attack method based on Gaussian process meta-modeling
Designs Multi-level Feature System: Unified representation including basic features (performance metrics, confidence), gradient features, and NTK features
Enables Efficient Inference: Requires only a single forward pass (optional backward pass), avoiding shadow model training
Provides Uncertainty Quantification: GP classifier naturally provides calibrated probabilistic predictions and uncertainty estimates
Validates Cross-domain Generalization: Verifies effectiveness across four distinct domains: synthetic data, fraud detection, image classification, and language modeling
Given a trained supervised model f_θ*: ℝ^d → ℝ^m, the membership inference task is to design a function M(f_θ*, x, y) that determines whether test sample (x,y) belongs to training set X = {(x_i, y_i)}^n_.
GP-MIA provides a flexible and data-efficient membership inference framework that avoids shadow model overhead in a post-hoc manner while capturing information-rich distributional signals.
Shokri et al. (2017) - Shadow model membership inference attacks
Carlini et al. (2022) - Likelihood ratio attacks (LiRA)
Rasmussen & Williams (2006) - Gaussian process machine learning
Ye et al. (2022) - Enhanced membership inference attacks
Hu et al. (2022) - Survey on membership inference attacks
This paper proposes an innovative membership inference attack method based on Gaussian processes that significantly improves efficiency and practicality while maintaining high accuracy. Despite some theoretical and experimental limitations, its core ideas and experimental results provide valuable contributions to privacy attack research.