Spectral clustering is a fundamental method for graph partitioning, but its reliance on eigenvector computation limits scalability to massive graphs. Classical sparsification methods preserve spectral properties by sampling edges proportionally to their effective resistances, but require expensive preprocessing to estimate these resistances. We study whether uniform edge sampling-a simple, structure-agnostic strategy-can suffice for spectral clustering. Our main result shows that for graphs admitting a well-separated $k$-clustering, characterized by a large structure ratio $ÃÂ¥(k) = û_{k+1} / ÃÂ_G(k)$, uniform sampling preserves the spectral subspace used for clustering. Specifically, we prove that uniformly sampling $O(ó^2 n \log n / õ^2)$ edges, where $ó$ is the Laplacian condition number, yields a sparsifier whose top $(n-k)$-dimensional eigenspace is approximately orthogonal to the cluster indicators. This ensures that the spectral embedding remains faithful, and clustering quality is preserved. Our analysis introduces new resistance bounds for intra-cluster edges, a rank-$(n-k)$ effective resistance formulation, and a matrix Chernoff bound adapted to the dominant eigenspace. These tools allow us to bypass importance sampling entirely. Conceptually, our result connects recent coreset-based clustering theory to spectral sparsification, showing that under strong clusterability, even uniform sampling is structure-aware. This provides the first provable guarantee that uniform edge sampling suffices for structure-preserving spectral clustering.
è«æID : 2510.12669ã¿ã€ãã« : Structure-Aware Spectral Sparsification via Uniform Edge Samplingèè
: Kaiwen He (ããã¥ãŒå€§åŠ), Petros Drineas (ããã¥ãŒå€§åŠ), Rajiv Khanna (ããã¥ãŒå€§åŠ)åé¡ : cs.LG cs.DSçºè¡šäŒè° : 39th Conference on Neural Information Processing Systems (NeurIPS 2025)è«æãªã³ã¯ : https://arxiv.org/abs/2510.12669 ã¹ãã¯ãã©ã«ã¯ã©ã¹ã¿ãªã³ã°ã¯ã°ã©ãåå²ã®åºç€çææ³ã§ããããåºæãã¯ãã«èšç®ãžã®äŸåæ§ã«ããå€§èŠæš¡ã°ã©ãäžã®ã¹ã±ãŒã©ããªãã£ãå¶éãããŠãããå€å
žçãªçåææ³ã¯æå¹æµææ¯äŸã«ãããµã³ããªã³ã°ã§ã¹ãã¯ãã«ç¹æ§ãä¿æãããããããã®æµæãæšå®ããããã®é«ã³ã¹ããªååŠçãå¿
èŠã§ãããæ¬è«æã§ã¯ãåçŽãªåäžèŸºãµã³ããªã³ã°æŠç¥ãã¹ãã¯ãã©ã«ã¯ã©ã¹ã¿ãªã³ã°ã«ååã§ããããæ€èšãããäž»èŠãªçµæã¯ãè¯å¥œã«åé¢ãããkåã¯ã©ã¹ã¿ãæã€ã°ã©ãïŒå€§ããªæ§é æ¯Î¥(k) = λk+1/ÏG(k)ã§ç¹åŸŽä»ããããïŒã«å¯ŸããŠãåäžãµã³ããªã³ã°ãã¯ã©ã¹ã¿ãªã³ã°çšã®ã¹ãã¯ãã«éšå空éãä¿æããããšã瀺ããŠãããå
·äœçã«ã¯ãåäžãµã³ããªã³ã°O(γ²n log n/ε²)æ¬ã®èŸºïŒÎ³ã¯ã©ãã©ã·ã¢ã³æ¡ä»¶æ°ïŒã«ãããçåã°ã©ããåŸããããã®äžäœ(n-k)次å
åºæç©ºéãã¯ã©ã¹ã¿ãªã³ã°æç€ºãã¯ãã«ãšã»ãŒçŽäº€ããã¹ãã¯ãã«åã蟌ã¿ã®å¿ 宿§ãä¿èšŒãããã¯ã©ã¹ã¿ãªã³ã°å質ãç¶æãããã
ã¹ãã¯ãã©ã«ã¯ã©ã¹ã¿ãªã³ã°ã¯ã°ã©ãå
ã®ç€ŸäŒæ§é ãçºèŠããåºç€çææ³ã§ããããå€§èŠæš¡ã°ã©ãåŠçæã«èšç®ããã«ããã¯ã«çŽé¢ããŠãããäž»ãªèª²é¡ã¯ä»¥äžã®éãã§ããïŒ
èšç®è€éæ§ ïŒã°ã©ãã©ãã©ã·ã¢ã³è¡åã®åºæãã¯ãã«èšç®ã¯å€§èŠæš¡ã°ã©ãäžã§èšç®ã³ã¹ããæ¥µããŠé«ãååŠçãªãŒããŒããã ïŒå€å
žçãªã¹ãã¯ãã«çåææ³ã¯æå¹æµæã®èšç®ãå¿
èŠãšããããèªäœãé«ã³ã¹ããªããã»ã¹ã§ããã¹ã±ãŒã©ããªãã£å¶é ïŒæ¢åææ³ã¯çŸäžèŠæš¡ã®ããŒããšèŸºãæã€ã°ã©ãã®åŠçãå°é£ã§ããèè
ã¯éèŠãªåé¡ãæèµ·ããïŒã©ã®ãããªæ¡ä»¶äžã§ãåçŽãªåäžèŸºãµã³ããªã³ã°ïŒãããªãéãååŠçãäžèŠïŒãã¹ãã¯ãã©ã«ã¯ã©ã¹ã¿ãªã³ã°ã«å¿
èŠãªæ§é ãä¿æããã®ã«ååã§ãããïŒ
çŽæçã«ã¯ãã°ã©ãã«äžè²«ããã¯ã©ã¹ã¿ãªã³ã°æ§é ãååšããå Žåãæšæºçãªæå¹æµæããŒã¹ã®ãµã³ãã©ãŒã¯éå°ã§ããå¯èœæ§ããããæ¥µç«¯ãªå ŽåãåæãããŠãããäžè²«ããã¯ã©ã¹ã¿ãååšããå Žåãåäžãµã³ããªã³ã°ã¯æããã«ã¯ã©ã¹ã¿ãªã³ã°æ§é ãä¿æããã®ã«ååã§ããã
æå¹æµæãµã³ããªã³ã° ïŒé«å質ã®ã¹ãã¯ãã«çååšãçæã§ããããæå¹æµæã®æšå®ã«ã¯å€§èŠæš¡ã©ãã©ã·ã¢ã³ç·åœ¢ã·ã¹ãã ã®æ±è§£ãå¿
èŠã§ããèšç®ãªãŒããŒããã ïŒååŠçã®ã³ã¹ããçåã«ããèšç®å©åŸãçžæ®ºããå¯èœæ§ãããæ§é ç¡èŠ ïŒæ¢åææ³ã¯ã°ã©ãã®ã¯ã©ã¹ã¿ãªã³ã°æ§é æ
å ±ãååã«æŽ»çšããŠããªãæ§é èªèçåä¿èšŒ ïŒæšæºçãªã¯ã©ã¹ã¿ãªã³ã°å¯èœæ§ä»®èª¬ã®äžã§ãåäžãµã³ããªã³ã°ãã¯ã©ã¹ã¿ãªã³ã°æ§é ãä¿æããã¹ãã¯ãã«çååšãçæããããšã蚌æããã¯ã©ã¹ã¿å
èŸºã®æµæçé ïŒã¯ã©ã¹ã¿ãªã³ã°ã°ã©ãå
ã®èŸºã®æå¹æµæã«å¯Ÿããæ°ããçéãå°åºãã匷ãã¯ã©ã¹ã¿ãªã³ã°æ§é ãããã«èŸºã®ãã¹ãã¯ãã«å質ããå¶çŽããããå®éåããåºæç©ºéè¡åChernoffåæ ïŒäžäœ(n-k)åºæãã¯ãã«éšå空éã«å¯Ÿããè¡åChernofféäžè«èšŒãå°å
¥ããçè«çé£çµ ïŒæè¿ã®ã³ã¢ã»ããåºç€ã¯ã©ã¹ã¿ãªã³ã°çè«ãšã¹ãã¯ãã«çåãé£çµããå
¥å ïŒç¡åã°ã©ãG = (V,E)ãç®æšã¯ã©ã¹ã¿æ°k
åºå ïŒçåã°ã©ãGÌãå
ã®ã°ã©ãã®k-è·¯ã¯ã©ã¹ã¿ãªã³ã°æ§é ãä¿æ
ç®æš ïŒåäžèŸºãµã³ããªã³ã°ãçšããŠã¹ãã¯ãã«ä¿æã°ã©ãçåãå®çŸãã
æ§é æ¯Î¥(k) = λk+1/ÏG(k)ãå®çŸ©ãããããã§ïŒ
λk+1ïŒæ£èŠåã©ãã©ã·ã¢ã³è¡åã®ç¬¬(k+1)çªç®ã®åºæå€ ÏG(k)ïŒã°ã©ãã®k-è·¯æ¡åŒµå®æ° 倧ããªÎ¥(k)ã¯ã°ã©ããæç¢ºãªk-ã¯ã©ã¹ã¿ãªã³ã°æ§é ãæã€ããšã瀺ãã
å®çŸ©4.4 ïŒã°ã©ãGãäžããããL = VΣV^Tã鿣èŠåã©ãã©ã·ã¢ã³è¡åãšãããšãã以äžãå®çŸ©ããïŒ
Ln-k := Σ(i=k+1 to n) λi vi vi^T
Rn-k_eff(a,b) := âšÎŽa - ÎŽb, L+n-k(ÎŽa - ÎŽb)â©
æ§é å®çãæºããç¡éã¿ã°ã©ãGã«å¯ŸããŠãO(κ²n log(n)/ε²)æ¬ã®èŸºãåäžãµã³ããªã³ã°ããå ŽåïŒÎº = λn/λk+1ã¯rank(n-k)æ¡ä»¶æ°ïŒãåŸãããçåã©ãã©ã·ã¢ã³è¡åLÌã¯ä»¥äžãæºããïŒ
âṌn-k Ṍ^T n-k Câ²F †k(1/Î¥(k) + ε/(1-ε) κ)
ããã§á¹Œn-kã¯LÌã®äžäœn-kåã®åºæãã¯ãã«è¡åã§ããã
åäžã¯ã©ã¹ã¿å
ã®é ç¹å¯Ÿ{a,b}ã«å¯ŸããŠããã®rank-(n-k)æå¹æµæã¯ä»¥äžãæºããïŒ
2/λk+1 ⥠R^n-k_eff(a,b) ⥠(1/κ)(1-k/Υ(k)) · 2/λk+1
è¯å¥œãªã¯ã©ã¹ã¿ãªã³ã°ä»®èª¬ã®äžã§ãã¬ãã¬ããžã¹ã³ã¢ç¢ºçååžpeãšåäžååžpunifã¯ä»¥äžãæºããïŒ
(1-k/Î¥(k))(1-ÏG(k))/κ · punif †pe †κ/((1-k/Î¥(k))(1-ÏG(k))) · punif
O(κ²n log(n)/ε²)æ¬ã®èŸºãåäžãµã³ããªã³ã°ããããšã§ã以äžãä¿èšŒãããïŒ
(1-ε)x^T Lx †x^T LH x †(1+ε)x^T Lx
ãã¹ãŠã®x â span(vk+1,...,vn)ã«å¯ŸããŠæç«ããã
ã©ã³ãã ãããã¯ã¢ãã«(SBM) ïŒk=4åã®ã¯ã©ã¹ã¿ãåã¯ã©ã¹ã¿200ããŒãéå±€çã©ã³ãã ãããã¯ã¢ãã« ïŒ4åã®ãããã¬ãã«ã¯ã©ã¹ã¿ãšãµãã¯ã©ã¹ã¿ãåèš16åã®ã¯ã©ã¹ã¿LFRãã³ãããŒã¯ã°ã©ã ïŒ800ããŒãã®ãããã¯ãŒã¯ãã³ãããŒã¯ã°ã©ãäžäœk=4åã®åºæãã¯ãã«ãšçã®ã¯ã©ã¹ã¿ãªã³ã°æç€ºãã¯ãã«éã®æå€§äž»è§ã䜿çšïŒâsin Î(Ṍk, C)ââ
å°ããè§åºŠã¯ã¹ãã¯ãã«åã蟌ã¿ã§ã¯ã©ã¹ã¿ãªã³ã°æ§é ãããè¯ãä¿æãããŠããããšã瀺ãã
åäžèŸºãµã³ããªã³ã° ïŒæ¬è«æã§ææ¡ãããææ³æå¹æµæãµã³ããªã³ã° ïŒéèŠåºŠãµã³ããªã³ã°ã«åºã¥ãå€å
žçææ³è¯å¥œãªã¯ã©ã¹ã¿ãªã³ã°ã°ã©ã ïŒå€§ããªã¯ã©ã¹ã¿å
-ã¯ã©ã¹ã¿éèŸºç¢ºçæ¯åŒ±ãã¯ã©ã¹ã¿ãªã³ã°ã°ã©ã ïŒå°ããªã¯ã©ã¹ã¿å
-ã¯ã©ã¹ã¿éèŸºç¢ºçæ¯åå®éšã¯20åå®è¡ãå¹³åå€ãšæšæºåå·®ãå ±å è¯å¥œãªã¯ã©ã¹ã¿ãªã³ã°ã°ã©ã ïŒåäžãµã³ããªã³ã°ã¯åŒ·ãã¯ã©ã¹ã¿ãªã³ã°æ§é äžã§æå¹æµæãµã³ããªã³ã°ãšåçããããã«åªããæ§èœã瀺ã匱ãã¯ã©ã¹ã¿ãªã³ã°ã°ã©ã ïŒåŒ±ãã¯ã©ã¹ã¿ãªã³ã°èšå®ã§ããåäžãµã³ããªã³ã°ã¯æå¹æµæãµã³ããªã³ã°ãšåæ§ã®èª€å·®è»è·¡ã«åŸãéå±€æ§é ïŒéå±€çã©ã³ãã ãããã¯ã¢ãã«äžã§ãåäžãµã³ããªã³ã°ã¯åæ§ã«è¯å¥œãªæ§èœã瀺ãLFRãã³ãããŒã¯ ïŒå®ãããã¯ãŒã¯ãã³ãããŒã¯äžã§ææ³ã®æå¹æ§ãæ€èšŒããè¯å¥œãªã¯ã©ã¹ã¿ãªã³ã°ãæã€ã°ã©ãäžã§ãåäžãµã³ããªã³ã°ã¯å®éã«ã¯æå¹æµæãµã³ããªã³ã°ããããã«äžåã èè
ã¯ããããåäžãµã³ããªã³ã°ãã¯ã©ã¹ã¿é蟺ã®é床ãªãµã³ããªã³ã°ãé¿ããåŸåããããããã¯ã©ã¹ã¿ã¡ã³ããŒãã¯ãã«ãšã®ãã匷ãéšåç©ºéæŽåãçæããããšãåå ãšä»®å®ããŠãã æ§é å®ç ïŒPengãïŒÎ¥(k) = Ω(k²)ã®å Žåãäžäœkåã®ã©ãã©ã·ã¢ã³åºæãã¯ãã«ã®éšå空éãkåã®ã¯ã©ã¹ã¿æç€ºãã¯ãã«ã®éšå空éã«æ¥è¿ããããšã蚌æãã匱å仮説 ïŒMacgregorãšSunã¯ããã匱ãÎ¥(k)仮説ã®äžã§ãã¹ãã¯ãã©ã«ã¯ã©ã¹ã¿ãªã³ã°ã®æåãä¿èšŒãããããšã蚌æããå€å
žççµæ ïŒSpielmanã¯æå¹æµææ¯äŸãµã³ããªã³ã°ã«ãã£ãŠÎµ-ã¹ãã¯ãã«çååšãçæããã¢ã«ãŽãªãºã ãå°å
¥ããç·åœ¢ãµã€ãºçååš ïŒBatsonãïŒO(n/ε)蟺ã®ç·åœ¢ãµã€ãºã¹ãã¯ãã«çååšã®ååšæ§ã蚌æããã¡ã¿å®ç ïŒBravermanãïŒããŒã¿æ§é ãè¯å¥œãªå Žåãåäžãµã³ããªã³ã°ãéèŠåºŠãµã³ããªã³ã°ãšåæ§ã«æå¹ãªã¯ã©ã¹ã¿ãªã³ã°ã³ã¢ã»ãããçæã§ããããšã瀺ãããã©ã³ã¹ã¯ã©ã¹ã¿ãªã³ã° ïŒHuangãšVishnoiïŒãã©ã³ã¹ã¯ã©ã¹ã¿ãªã³ã°ã«ãããåäžãµã³ããªã³ã°ã®åœ¹å²ãç ç©¶ããçè«çä¿èšŒ ïŒæ§é ä¿æã¹ãã¯ãã©ã«ã¯ã©ã¹ã¿ãªã³ã°ã«ãããåäžèŸºãµã³ããªã³ã°ã®ååæ§ã«å¯ŸããŠãåããŠèšŒæå¯èœãªä¿èšŒãæäŸããå®çšçäŸ¡å€ ïŒã¹ãã¯ãã©ã«ã¯ã©ã¹ã¿ãªã³ã°ã«å¯ŸããŠåçŽã§ã¹ã±ãŒã©ãã«ãªååŠçã¹ããããæäŸããçè«çé£çµ ïŒã³ã¢ã»ããçè«ãšã¹ãã¯ãã«çåãé£çµãã仮説æ¡ä»¶ ïŒã°ã©ããè¯å¥œãªã¯ã©ã¹ã¿ãªã³ã°æ§é ãæã€å¿
èŠãããïŒå€§ããªÎ¥(k)ïŒæ¡ä»¶æ°äŸå ïŒãµã³ããªã³ã°è€éåºŠã¯æ¡ä»¶æ°Îºã«äŸåããç¹å®ã®ã°ã©ãäžã§ã¯å€§ãããªãå¯èœæ§ãããç¡éã¿ã°ã©ãå¶é ïŒçŸåšã®åæã¯äž»ã«ç¡éã¿ã°ã©ãã察象ãšããŠããæµæçéã®æé©å ïŒæµæçéãæ¹åããç¹ã«ÎºãšÎ¥(k)ãžã®äŸåæ§ãæ¹åããéã¿ã°ã©ããžã®æ¡åŒµ ïŒåæãéã¿ã°ã©ããŸãã¯éè€ã¯ã©ã¹ã¿ãªã³ã°ã«æ¡åŒµããä»ã®ã°ã©ãåé¡ ïŒåæ§ã®æ§é èªèåäžãµã³ããªã³ã°çµæãåæåž«ããåŠç¿ãªã©ã®ä»ã®ã°ã©ãåé¡ã«é©çšå¯èœããæ¢çŽ¢ããçè«ç驿° ïŒæ§é æ¡ä»¶äžã§ã®åäžãµã³ããªã³ã°ã®ååæ§ãåããŠèšŒæããçè«ç空çœãåããå®çšçäŸ¡å€ ïŒé«ã³ã¹ããªæµæèšç®ãæé€ããã¹ã±ãŒã©ããªãã£ã倧å¹
ã«åäžãããæè¡çè²¢ç® ïŒrank-(n-k)æå¹æµæãªã©ã®æ°ããåæããŒã«ãå°å
¥ããå®éšæ€èšŒ ïŒè€æ°ã®ã°ã©ãã¢ãã«äžã§çè«ççµæãæ€èšŒãããµã³ããªã³ã°è€é床 ïŒååŠçãåé¿ããŠãããããµã³ããªã³ã°è€é床ã¯äŸç¶ãšããŠé«ããç¹ã«Îºã倧ããå Žåæ§é 仮説 ïŒã°ã©ãæ§é ãžã®ä»®èª¬ã¯æ¯èŒç峿 Œã§ãé©çšç¯å²ãå¶éããŠãã宿°å å ïŒçè«ççéã®å®æ°å åã¯ååã«å³å¯ã§ãªãå¯èœæ§ãããåŠè¡çäŸ¡å€ ïŒã¹ãã¯ãã«çåçè«ã«æ°ããèŠç¹ãæäŸããç°ãªãç ç©¶åéãé£çµããå®çšçæçŸ© ïŒå€§èŠæš¡ã°ã©ãåæã«å¯ŸããŠããåçŽã§å¹æçãªããŒã«ãæäŸããåçºæ§ ïŒæ§é èªèãµã³ããªã³ã°ã«é¢ãããããªãç ç©¶ãåçºããå¯èœæ§ããããœãŒã·ã£ã«ãããã¯ãŒã¯åæ ïŒæç¢ºãªç€ŸäŒæ§é ãæã€ãœãŒã·ã£ã«ãããã¯ãŒã¯çç©ãããã¯ãŒã¯ ïŒã¿ã³ãã¯è³ªçžäºäœçšãããã¯ãŒã¯ãªã©ã®ã¢ãžã¥ãŒã«åæ§é ãæã€çç©ãããã¯ãŒã¯æšå¥šã·ã¹ãã ïŒãŠãŒã¶ãŒ-ã¢ã€ãã çžäºäœçšã°ã©ãã«ãããå調ãã£ã«ã¿ãªã³ã°æ¬è«æã¯ã¹ãã¯ãã©ã«ã°ã©ãçè«ãè¡åæåçè«ãã¯ã©ã¹ã¿ãªã³ã°åæãªã©è€æ°ã®åéã®éèŠãªç ç©¶ãåŒçšããŠããã以äžãå«ãïŒ
SpielmanãšSrivastavaã«ããã¹ãã¯ãã«çåã®éæçç ç©¶ Pengãã«ããã¯ã©ã¹ã¿ãªã³ã°å¯èœã°ã©ãæ§é å®çã®ç ç©¶ Davis-Kahanã®å®çãªã©ã®è¡åæåçè«ã®å€å
žççµæ ç·æ¬ ïŒæ¬è«æã¯ã¹ãã¯ãã©ã«ã°ã©ãçååéã§éèŠãªçè«çè²¢ç®ãè¡ããç¹å®ã®æ§é æ¡ä»¶äžã§ã®åçŽãªåäžãµã³ããªã³ã°ã®æå¹æ§ã蚌æãããããã€ãã®éçã¯ååšããããå€§èŠæš¡ã°ã©ãåæã«å¯ŸããŠæ°ããçè«çåºç€ãšå®çšçããŒã«ãæäŸããéèŠãªåŠè¡çããã³å¿çšç䟡å€ãæã€ã