Recent advances in generative artificial intelligence applications have raised new data security concerns. This paper focuses on defending diffusion models against membership inference attacks. This type of attack occurs when the attacker can determine if a certain data point was used to train the model. Although diffusion models are intrinsically more resistant to membership inference attacks than other generative models, they are still susceptible. The defense proposed here utilizes critically-damped higher-order Langevin dynamics, which introduces several auxiliary variables and a joint diffusion process along these variables. The idea is that the presence of auxiliary variables mixes external randomness that helps to corrupt sensitive input data earlier on in the diffusion process. This concept is theoretically investigated and validated on a toy dataset and a speech dataset using the Area Under the Receiver Operating Characteristic (AUROC) curves and the FID metric.
è«æID : 2509.14225ã¿ã€ãã« : Defending Diffusion Models Against Membership Inference Attacks via Higher-Order Langevin Dynamicsèè
: Benjamin Sterling (ã¹ããŒããŒãã«ãã¯å€§åŠ)ãYousef El-Laham (ã¹ããŒããŒãã«ãã¯å€§åŠ)ãMónica F. Bugallo (ã¹ããŒããŒãã«ãã¯å€§åŠ)åé¡ : cs.LGãstat.MLçºè¡šæ¥æ : 2025幎10æ16æ¥ (arXiv v2)è«æãªã³ã¯ : https://arxiv.org/abs/2509.14225 æ¬è«æã¯ãçæå人工ç¥èœã¢ããªã±ãŒã·ã§ã³ã«ãããŠåºçŸããæ°ããªããŒã¿ã»ãã¥ãªãã£åé¡ã«å¯ŸåŠããæ¡æ£ã¢ãã«ã®ã¡ã³ããŒã·ããæšè«æ»æ(MIA)ããã®é²åŸ¡ã«çŠç¹ãåœãŠãŠãããã¡ã³ããŒã·ããæšè«æ»æãšã¯ãæ»æè
ãç¹å®ã®ããŒã¿ãã€ã³ããã¢ãã«ã®èšç·Žã«äœ¿çšããããã©ãããå€å®ã§ããæ»æã§ãããæ¡æ£ã¢ãã«ã¯ä»ã®çæã¢ãã«ãšæ¯èŒããŠã¡ã³ããŒã·ããæšè«æ»æã«å¯Ÿããå
åšçãªèæ§ã匷ãããäŸç¶ãšããŠè匱æ§ãååšãããæ¬è«æã§ææ¡ãããé²åŸ¡æ¹æ³ã¯ãèšçæžè¡°é«éã©ã³ã°ãžã¥ãã³åååŠ(HOLD++)ãå©çšããè€æ°ã®è£å©å€æ°ãšãããã®å€æ°ã«æ²¿ã£ãçµåæ¡æ£éçšãå°å
¥ããŠãããæ žå¿çãªèãæ¹ã¯ãè£å©å€æ°ã®ååšãå€éšç¢ºçæ§ãæ··åããæ¡æ£éçšã®åææ®µéã§æ©å¯å
¥åããŒã¿ãç Žå£ããã®ã«åœ¹ç«ã€ãšããããšã§ããããã®æŠå¿µã¯çè«çã«ç ç©¶ãããç©å
·ããŒã¿ã»ãããšé³å£°ããŒã¿ã»ããã«ãããŠAUROCæ²ç·ãšFIDææšãçšããŠæ€èšŒãããŠããã
æ¬ç ç©¶ã解決ããæ žå¿çãªåé¡ã¯ã**ã¡ã³ããŒã·ããæšè«æ»æ(Membership Inference Attacks, MIA)**ã«ããæ¡æ£ã¢ãã«ãžã®è
åšã§ãããã¡ã³ããŒã·ããæšè«æ»æã¯ãæ»æè
ãç¹å®ã®ããŒã¿ãµã³ãã«ãç®æšã¢ãã«ã®èšç·Žã«äœ¿çšããããã©ãããå€å®ããããšãããã©ã€ãã·ãŒæ»æã§ããã
ããŒã¿ãã©ã€ãã·ãŒä¿è·ã®å¿
èŠæ§ ïŒçæåAIå¿çšã®æ¥éãªçºå±ãç¹ã«å»çããŒã¿ãæ©å¯ç¥ç財ç£ãªã©ã®é åã§ã®å¿çšã«äŒŽããèšç·ŽããŒã¿ã®ãã©ã€ãã·ãŒä¿è·ã極ããŠéèŠã«ãªã£ãŠããæ¡æ£ã¢ãã«ã®èåŒ±æ§ ïŒæ¡æ£ã¢ãã«ã¯GANãªã©ã®ä»ã®çæã¢ãã«ãšæ¯èŒããŠããåªããå
åšçæ»æèæ§ãæããŠããããããã¯ãã¢æ»æãã¡ã³ããŒã·ããæšè«æ»æãããã³å¯Ÿæçæ»æã«äŸç¶ãšããŠå®¹æã«åããæ¢åé²åŸ¡æ¹æ³ã®éç ïŒçŸåšã®äž»èŠãªé²åŸ¡ææ®µã§ããå·®åãã©ã€ãã·ãŒæ¡æ£ã¢ãã«(DPDM)ã¯ããã©ã€ãã·ãŒ-æçšæ§ãã¬ãŒããªãã®åé¡ãããªãã¡ãã©ã€ãã·ãŒä¿è·æ°Žæºãšçæãµã³ãã«å質ãçŽæ¥çžé¢ããŠããæ¢åã®ã¡ã³ããŒã·ããæšè«æ»æé²åŸ¡ã¯äž»ã«å·®åãã©ã€ãã·ãŒãL2æ£ååãããã³ç¥èèžçãå«ããæ¬è«æã®åæ©ã¯ãçŽæ¥çãªããŒã¿æ¡åŒµãå³å¯ãªå·®åãã©ã€ãã·ãŒå¶çŽãå¿
èŠãšãããæ¡æ£éçšèªäœã®æ§é æ¹åãéããŠãã©ã€ãã·ãŒä¿è·ã匷åããæ°ããé²åŸ¡æŠç¥ãæ¢çŽ¢ããããšã§ããã
èšçæžè¡°é«éã©ã³ã°ãžã¥ãã³åååŠ(HOLD++)ã«åºã¥ãæ°ããé²åŸ¡ãã¬ãŒã ã¯ãŒã¯ãææ¡ ããè£å©å€æ°ã®å°å
¥ãéããŠã¡ã³ããŒã·ããæšè«æ»æãžã®èæ§ã匷åããHOLD++ã®Rényiå·®åãã©ã€ãã·ãŒçè«çä¿èšŒãç¢ºç« ãããã©ã€ãã·ãŒæå€±ãæ¡æ£éçšã®éå§æã«æå€§å€ã«éããæéãšãšãã«åèª¿ã«æžå°ããããšã蚌æããè£å©å€æ°ãšãã©ã€ãã·ãŒä¿è·ã®é¢ä¿ãæããã« ããå¹³åäºä¹èª€å·®ãβãL^(-1)ãããã³nãªã©ã®ãã©ã¡ãŒã¿ã調æŽããããšã§ã調æŽãã§ããããšã蚌æããSwiss Rollããã¡ãããŒã¿ã»ããããã³LJ Speeché³å£°ããŒã¿ã»ããäžã§æ¹æ³ã®æå¹æ§ãæ€èšŒ ããAUROCããã³FIDææšãçšããŠé²åŸ¡å¹æãšçæå質ãè©äŸ¡ããå
¥å ïŒèšç·ŽããŒã¿ã»ããDãæ¡æ£ã¢ãã«ãã©ã¡ãŒã¿
åºå ïŒã¡ã³ããŒã·ããæšè«æ»æã«èæ§ãæã€æ¡æ£ã¢ãã«
å¶çŽ ïŒçæå質ãç¶æããªãããã©ã€ãã·ãŒä¿è·ãæå€§åãã
HOLD++ã®åé²ç¢ºçåŸ®åæ¹çšåŒã¯ä»¥äžã®ããã«å®çŸ©ãããïŒ
ããã§ïŒ
F = Σ(i=1 to n-1) γ_i(E_{i,i+1} - E_{i+1,i}) - ΟE_{n,n} G = â(2ΟL^(-1))E_{n,n} x_0 = (q_0^T, p_0^T, s_0^T, ...)^T åé²éçšã®å¹³åãšå
±åæ£ã¯ä»¥äžã®éãã§ããïŒ
Ό_t = exp(Ft)x_0
Σ_t = L^(-1)I + exp(Ft)(Σ_0 - L^(-1)I)exp(Ft)^T
ãµã³ããªã³ã°ã¯Choleskyåè§£ãéããŠå®è£
ãããïŒ
HOLD++ã«å¯ŸããPIAæ»æææšã¯ä»¥äžã®ããã«ãªãïŒ
R_{t,p} = ||Fx_t - (1/2)GG^T S_Ξ(x_t,t)||_p
è£å©å€æ°å°å
¥ã«ããç¢ºçæ§ã®æ··å ïŒé床ãå é床ãªã©ã®è£å©å€æ°ãå°å
¥ããããšã§ãæ¡æ£éçšã®åææ®µéã«è¿œå ã®ç¢ºçæ§ãå°å
¥ããæ»æè
ãå
ã®ããŒã¿ãæ£ç¢ºã«æšå®ããããšãå°é£ã«ããéæ±ºå®çã¹ã³ã¢é¢æ° ïŒHOLD++ã®ã¹ã³ã¢ãããã¯ãŒã¯ã¯æåŸã®è£å©å€æ°ã®ã¹ã³ã¢ã®ã¿ãã¢ãã«åãããããå®å
šã«æ±ºå®çãªæ»æãäžå¯èœã«ãªãçè«çãã©ã€ãã·ãŒä¿èšŒ ïŒå³å¯ãªRényiå·®åãã©ã€ãã·ãŒåæãæäŸãããã©ã€ãã·ãŒæå€±ã®äžçã蚌æããSwiss RollããŒã¿ã»ãã ïŒäºæ¬¡å
ããã¡ãããŒã¿ã»ãããçè«äºæž¬ã®æ€èšŒã«äœ¿çšLJ SpeechããŒã¿ã»ãã ïŒå®éã®é³å£°ããŒã¿ã»ãããGrad-TTSãçšããŠããã¹ãé³å£°å€æã«äœ¿çšAUROC (Area Under ROC Curve) ïŒã¡ã³ããŒã·ããæšè«æ»æã®æå¹æ§ãè©äŸ¡
1.0ã«è¿ãå€ã¯æ»æãèšç·Ž/éèšç·ŽããŒã¿ãå®ç§ã«åºå¥ã§ããããšã瀺ã 0.5ã«è¿ãå€ã¯æ»æå¹æãã©ã³ãã æšæž¬ãšåçã§ããããšã瀺ã FID (Fréchet Inception Distance) ïŒçæããŒã¿ã®å質ãè©äŸ¡åŸæ¥ã®æ¡æ£ã¢ãã« (n=1) ç°ãªã次æ°ã®HOLD++ (n=2,3,...) ç°ãªã忣å åÎ²ã®æ§æ Swiss Rollå®éšïŒ40,000èšç·Žãšããã¯ã15å±€å
šçµåãããã¯ãŒã¯ãReLU掻æ§åãå±€æ£èŠå LJ Speechå®éšïŒGrad-TTSã¢ãŒããã¯ãã£ã䜿çšãn=2ãŸã§æé«ãã¹ã(ããé«ã次æ°ã®èšç·Žã¯å°é£) 25åã®å®éšãç¹°ãè¿ã95%ä¿¡é ŒåºéãååŸ AUROCã¯ã¢ãã«æ¬¡æ°nãšåæ£å åβã®å¢å ã«äŒŽãèããäœäž β=2ããã³Î²=10ã®95%ä¿¡é Œåºéã¯éè€ããŠããããçµ±èšçæææ§ã瀺ããŠãã 髿¬¡ã¢ãã«(n>1)ã¯åŸæ¥ã®æ¡æ£ã¢ãã«ãšæ¯èŒããŠãã©ã€ãã·ãŒä¿è·ã®é¢ã§æããã«åªããŠãã å®éšçµæã¯ãn=2ãn=1ãšæ¯èŒããŠããåªãããã©ã€ãã·ãŒä¿è·ãšçæå質ãæããããšã瀺ããŠããïŒ
ãšãã㯠FID (n=1) FID (n=2) AUROC (n=1) AUROC (n=2) 30 91.65 77.50 0.503 0.597 60 94.31 62.57 0.686 0.481 90 102.50 65.20 0.869 0.525 180 89.18 57.43 0.949 0.696
ã¢ãã«æ¬¡æ°nã®åœ±é¿ ïŒnãå¢å ããã«ã€ããŠAUROCã¯èããäœäžãããã©ã€ãã·ãŒä¿è·ã匷åããã忣å åβã®åœ±é¿ ïŒãã倧ããβå€ã¯ããåªãããã©ã€ãã·ãŒä¿è·ãæäŸããæéååžåæ ïŒãã©ã€ãã·ãŒè匱æ§ã¯äž»ã«æ¡æ£éçšã®åææ®µéã«éäžããŠããCIFAR-10äžã®äºæããªãçµæ ïŒç»åããŒã¿ã»ããäžã§AUROCã0.5ã«è¿ãå€ã瀺ããé£ç¶æéæ¡æ£ã¢ãã«èªäœãMIAã«å¯Ÿãã匷ãèæ§ãæããŠããããšã瀺åããŠããé³å£°ããŒã¿ã®ç¹æ®æ§ ïŒã¡ã«ã¹ãã¯ããã°ã©ã ã¯ç»åãããããŒã¿æ¡åŒµãå°é£ã§ãããé³å£°ããŒã¿ãMIAæ»æãåãããããªã£ãŠããå質-ãã©ã€ãã·ãŒã®ãã¬ãŒããªã ïŒé«æ¬¡ã¢ãã«ã¯ããåªãããã©ã€ãã·ãŒä¿è·ãæäŸããªãããåæã«ããé«å質ã®çæãµã³ãã«ãçæã§ããSecMI ïŒé¢æ£æ¡æ£ã¢ãã«ã«å¯Ÿããæåã®MIAæ»æPIA (Proximal Initialization Attack) ïŒé£ç¶æéçã®MIAæ»æDPDM ïŒDP-SGDãšé£ç¶æéæ¡æ£ã¢ãã«ãçµã¿åãããå·®åãã©ã€ãã·ãŒæ¹æ³CLD (Critically-damped Langevin Dynamics) ïŒé床è£å©å€æ°ãå°å
¥TOLD (Third-Order Langevin Dynamics) ïŒå éåºŠå€æ°ã远å HOLD++ ïŒèšçæžè¡°é«éã©ã³ã°ãžã¥ãã³åååŠHOLD++ã¯æå¹ãªMIAé²åŸ¡ãæäŸ ïŒè£å©å€æ°å°å
¥ã«ããç¢ºçæ§ãã¡ã³ããŒã·ããæšè«æ»æã®æåçãèããäœäžãããçè«çä¿èšŒãšå®è·µçæ€èšŒã®äžèŽ ïŒRényiå·®åãã©ã€ãã·ãŒåæã¯å®éšçµæãšäžèŽããŠããå質-ãã©ã€ãã·ãŒã®äºéæ¹å ïŒå Žåã«ãã£ãŠã¯ã髿¬¡ã¢ãã«ã¯çæå質ãšãã©ã€ãã·ãŒä¿è·ã®äž¡æ¹ãåæã«æ¹åããèšç·Žè€éæ§ã®å¢å ïŒé«æ¬¡ã¢ãã«ã®èšç·Žã¯ããå°é£ã§ãããç¹ã«è€éãªããŒã¿ã»ããäžã§ã¯é¡èã§ãããã©ã¡ãŒã¿èª¿æŽã®è€éæ§ ïŒã¢ãã«æ¬¡æ°nã忣å åβããã©ã€ãã·ãŒãã©ã¡ãŒã¿Îµ_numã®éã§ãã©ã³ã¹ãåãå¿
èŠãããéå®çãªé«æ¬¡æ€èšŒ ïŒå®éã®ããŒã¿ã»ããäžã§ã¯n=2ãŸã§ã®ã¿æ€èšŒãããããé«ã次æ°ã®å¹æã¯ååã«æ€èšŒãããŠããªãããé«å¹çãªé«æ¬¡ã¢ãã«èšç·Žæ¹æ³ã®æ¢çŽ¢ ä»ã®çš®é¡ã®çæã¢ãã«ãžã®é«éåååŠå¿çšã®ç ç©¶ é©å¿çãã©ã¡ãŒã¿éžææŠç¥ã®éçº çè«ç驿°æ§ã匷ã ïŒé«éã©ã³ã°ãžã¥ãã³åååŠãšãã©ã€ãã·ãŒä¿è·ãå·§åŠã«çµã¿åãããæ°ããçè«çèŠç¹ãæäŸããŠããæ°åŠçåæãå³å¯ ïŒå®å
šãªRényiå·®åãã©ã€ãã·ãŒèšŒæãšãã©ã€ãã·ãŒæå€±äžçåæãæäŸããŠããå®éšèšèšãåçç ïŒããã¡ãããŒã¿ã»ããããå®éã®ããŒã¿ã»ãããžã®æ®µéçæ€èšŒæŠç¥ã¯ç§åŠçã§å¹æçã§ããå®çšäŸ¡å€ãé«ã ïŒåŸæ¥ã®å·®åãã©ã€ãã·ãŒä»¥å€ã®æ°ããé²åŸ¡ææ³ãæäŸããŠããå®éšèŠæš¡ãéå®ç ïŒ2ã€ã®ããŒã¿ã»ããäžã®ã¿ã§æ€èšŒãããå€§èŠæš¡ããŒã¿ã»ããäžã®å®éšãäžè¶³ããŠããèšç®ãªãŒããŒãããåæã®æ¬ èœ ïŒé«æ¬¡ã¢ãã«ããããã远å ã®èšç®ã³ã¹ãã«ã€ããŠè©³çްã«åæãããŠããªãä»ã®é²åŸ¡æ¹æ³ãšã®æ¯èŒãäžåå ïŒäž»ã«åŸæ¥ã®æ¡æ£ã¢ãã«ãšã®æ¯èŒã§ãããDPDMãªã©ã®æ¹æ³ãšã®çŽæ¥çãªæ¯èŒãäžè¶³ããŠãããã©ã¡ãŒã¿æåºŠåæãäžåå ïŒäž»èŠãªãã€ããŒãã©ã¡ãŒã¿ã®éžæã«é¢ããã¬ã€ãã³ã¹ãäžæç¢ºã§ããåŠè¡çè²¢ç® ïŒæ¡æ£ã¢ãã«ã®ãã©ã€ãã·ãŒä¿è·ã«æ°ããçè«çãã¬ãŒã ã¯ãŒã¯ãšå®è·µçæ¹æ³ãæäŸããŠããå®çšäŸ¡å€ ïŒå»çãéèãªã©ã®æ©å¯ããŒã¿é åã«ãããéèŠãªå¿çšå¯èœæ§ãæããŠããåçŸæ§ ïŒèè
ããªãŒãã³ãœãŒã¹ã³ãŒããæäŸããŠãããç ç©¶ã®åçŸãšæ¡åŒµã容æã§ããæ©å¯ããŒã¿çæ ïŒå»çç»åãé³å£°åæãªã©ããã©ã€ãã·ãŒãå«ãçæã¿ã¹ã¯ãã§ãã¬ãŒãããåŠç¿ç°å¢ ïŒããŒã¿ãã©ã€ãã·ãŒãä¿è·ããªããå調èšç·Žãè¡ãå¿
èŠãããå Žåç£æ¥å¿çš ïŒç¥ç財ç£ä¿è·ã«å³å¯ãªèŠä»¶ãããçæã¢ãã«ã®å±éæ¬è«æã¯ãæ¡æ£ã¢ãã«ã®åºç€çè«ãã¡ã³ããŒã·ããæšè«æ»ææ¹æ³ãå·®åãã©ã€ãã·ãŒæè¡ãããã³é«éã©ã³ã°ãžã¥ãã³åååŠãªã©ãäž»èŠé åã®ä»£è¡šçãª17ç¯ã®éèŠæç®ãåŒçšããŠãããç ç©¶ã«å
å®ãªçè«çåºç€ãæäŸããŠããã
ç·åè©äŸ¡ ïŒããã¯æ¡æ£ã¢ãã«ã®ãã©ã€ãã·ãŒä¿è·é åã«ãããŠéèŠãªé©æ°çæçŸ©ãæã€è«æã§ãããé«éã©ã³ã°ãžã¥ãã³åååŠãšã¡ã³ããŒã·ããæšè«æ»æé²åŸ¡ãçµã¿åãããããšã§ãæ°èŠã§å¹æçãªãœãªã¥ãŒã·ã§ã³ãæäŸããŠãããå®éšèŠæš¡ãšæè¡ç詳现ã®é¢ã§ãŸã æ¹åã®äœå°ããããããã®çè«çè²¢ç®ãšå®çšäŸ¡å€ã«ãããæ¬é åã®éèŠãªé²å±ãšãªã£ãŠããã