2025-11-16T19:07:13.213602

SLoG-Net: Algorithm Unrolling for Source Localization on Graphs

Ye, Mateos

We present a novel model-based deep learning solution for the inverse problem of localizing sources of network diffusion. Starting from first graph signal processing (GSP) principles, we show that the problem reduces to joint (blind) estimation of the forward diffusion filter and a sparse input signal that encodes the source locations. Despite the bilinear nature of the observations in said blind deconvolution task, by requiring invertibility of the diffusion filter we are able to formulate a convex optimization problem and solve it using the alternating-direction method of multipliers (ADMM). We then unroll and truncate the novel ADMM iterations to arrive at a parameterized neural network architecture for Source Localization on Graphs (SLoG-Net), that we train in an end-to-end fashion using labeled data. This supervised learning approach offers several advantages such as interpretability, parameter efficiency, and controllable complexity during inference. Our reproducible numerical experiments corroborate that SLoG-Net exhibits performance on par with the iterative ADMM baseline, but with markedly faster inference times and without needing to manually tune step-size or penalty parameters. Overall, our approach combines the best of both worlds by incorporating the inductive biases of a GSP model-based solution within a data-driven, trainable deep learning architecture for blind deconvolution of graph signals.

academic

SLoG-Net: Algorithm Unrolling for Source Localization on Graphs

Basic Information

Paper ID: 2501.00442
Title: SLoG-Net: Algorithm Unrolling for Source Localization on Graphs
Authors: Chang Ye, Gonzalo Mateos (University of Rochester)
Classification: eess.SP (Signal Processing)
Submission Date: December 31, 2024 to arXiv
Paper Link: https://arxiv.org/abs/2501.00442

Abstract

This paper proposes a novel model-based deep learning solution for addressing the inverse problem of network diffusion source localization. Starting from first principles of graph signal processing (GSP), the authors reformulate the problem as joint (blind) estimation of the forward diffusion filter and sparse input signals encoding source locations. Despite the bilinear nature of observations in this blind deconvolution task, by imposing invertibility constraints on the diffusion filter, the problem can be formulated as a convex optimization problem and solved using the Alternating Direction Method of Multipliers (ADMM). Subsequently, the authors unroll and truncate the novel ADMM iterations to obtain a parameterized neural network architecture for source localization on graphs (SLoG-Net), trained end-to-end using labeled data. This supervised learning approach provides advantages including interpretability, parameter efficiency, and controllable inference-time complexity.

Research Background and Motivation

Problem Definition

Network diffusion source localization is an important inverse problem aimed at identifying source node locations in a network from observed diffusion signals. Specifically:

Input: Observed graph signal Y ∈ R^(N×P) with known graph topology
Output: Sparse source signal X ∈ R^(N×P) and unknown diffusion filter coefficients h
Constraints: Source signals exhibit sparsity (at most S ≪ N non-zero elements per column)

Significance

This problem has broad applications across multiple domains:

Sensor-based environmental monitoring
Opinion formation in social networks
Neural signal processing
Epidemiology
Misinformation propagation detection

Limitations of Existing Methods

Traditional GSP methods: Rely on matrix lifting techniques with high computational complexity on large graphs
Iterative solvers: Require careful tuning of step sizes and regularization parameters with slow convergence
Probabilistic models: Optimal only on specific graph structures (e.g., trees) or require restrictive dependency assumptions
Parameter tuning: Existing methods require expensive grid search for parameter selection

Core Contributions

Theoretical Contribution: Reformulates the blind graph filter identification problem as a convex optimization problem under invertibility constraints
Algorithmic Innovation: Develops a specialized ADMM algorithm to efficiently solve the convex optimization problem
Architecture Design: Proposes SLoG-Net by mapping ADMM iterations to trainable neural network layers through algorithm unrolling
Performance Enhancement: Achieves performance comparable to iterative ADMM with significantly faster inference time
Parameter Learning: Automatically learns step sizes and penalty parameters through end-to-end training, eliminating manual tuning

Methodology Details

Task Definition

Given graph G(V,A) and observed signal Y = HX, where:

H = Σ(l=0 to L-1) h_l S^l is an L-th order graph filter
S is the graph shift operator (e.g., normalized adjacency matrix)
X is the sparse source signal matrix

The objective is to jointly estimate filter coefficients h and sparse input X.

Model Architecture

1. Convex Reformulation

Under the filter invertibility assumption (Assumption 2), the problem is transformed to:

min ||X||_{1,1} = ||(Y^T V ⊙ V)g̃||_1
s.t. 1^T_N g̃ = 1

where g̃ is the frequency domain response of the inverse filter.

2. ADMM Algorithm

Using variable separation technique:

min ||x||_1
s.t. Zg̃ - x = 0, 1^T_N g̃ = c

where Z = Y^T V ⊙ V, x = vecX.

ADMM update rules:

Filter Update: g̃k+1 = Γ^(-1)Z^T(ρ_λxk - λk) + (ρ_μc - μk)1_N
Source Signal Update: xk+1 = S_{ρ_λ^(-1)}(Zg̃k+1 + λk/ρ_λ)
Lagrange Multiplier Update: λk+1 = λk + ρ_λ(Zg̃k+1 - xk+1)

3. SLoG-Net Architecture

Unrolls ADMM iterations into K neural network layers, each containing three sublayers:

Filter Sublayer G_k:

g̃[k+1] = (Z^T Z + ρ_2^(k) M^(k)M^(k)T)^(-1)[Z^T(x[k] - ρ_1^(k)λ[k]) + M^(k)(ρ_2^(k)m^(k) - ρ_1^(k)μ[k])]

Source Signal Sublayer X_k:

x[k+1] = S_{τ^(k)}(α_1^(k)Zg̃[k+1] + α_2^(k)λ[k])

Multiplier Sublayer M_k:

λ[k+1] = β_1^(k)λ[k] + β_2^(k)Zg̃[k+1] + β_3^(k)x[k+1]
μ[k+1] = γ^(k)μ[k] + M^(k)T g̃[k+1] + m^(k)

Technical Innovations

Learnable Constraints: Replaces fixed constraint 1^T g̃ = 1 with parameterized matrix M^(k) and vector m^(k)
Layer-wise Decoupling: Uses different parameters per layer rather than parameter sharing for enhanced expressiveness
Efficient Matrix Inversion: Exploits diagonal structure of Z^T Z and matrix inversion lemma for O(N^2) complexity
Residual Connections: ResNet-like data flow design with Z input to all layers

Experimental Setup

Datasets

Synthetic Data:
- Graph types: Erdős-Rényi, Stochastic Block Model (SBM), Barabási-Albert, Random Geometric Graphs
- Number of nodes: N = 20-100
- Sparsity: θ = 0.15
- Filter order: L = 5
Real Data:
- Dolphin social network (N=62)
- Zachary's karate club (N=34)
- Subgraph of Digg 2009 dataset (N=20)

Evaluation Metrics

Relative Error (RE): ||X̂ - X_test||_F / ||X_test||_F
Support Set Accuracy (ACC): Proportion of correctly identified source locations
Inference Time: Forward propagation duration

Comparison Methods

ADMM Baseline: Iterative ADMM algorithm
GNN Methods: Convolutional graph neural networks
IVGD: Invertible validity-aware graph diffusion neural network

Implementation Details

Network layers: K = 5
Training set size: |T| = 200k
Batch size: P = 400
Optimizer: Adam
Training epochs: 30
Constraint parameter dimension: d = 2

Experimental Results

Main Results

1. Comparison with ADMM

Noise Robustness: SLoG-Net outperforms ADMM across different noise levels
Inference Speed: SLoG-Net inference time ~0.009s, ADMM requires 1.99-7.42s
Effect of Parameter Count: SLoG-Net significantly outperforms ADMM when observed signal count P < 160

2. Performance Across Different Graph Types

Graph Type	N	MRE of X̂	MRE of ĝ	ACC
ER	20	0.149	0.164	0.953
SBM	20	0.219	0.215	0.914
RG	20	0.383	0.377	0.869
BA	20	0.579	0.537	0.772
karate	34	0.454	0.452	0.958
dolphins	62	0.719	0.578	0.841

3. Computational Complexity Comparison

N	SLoG-Net	ADMM
20	0.95×10^-2s	2.04s
40	1.09×10^-2s	5.70s
60	1.27×10^-2s	9.41s
80	1.42×10^-2s	12.29s
100	1.64×10^-2s	14.62s

Ablation Studies

Training Set Size: Performance stabilizes at |T| ≥ 160k
Network Layers: K=5 is optimal
Constraint Parameter Dimension: d=2 shows significant improvement over d=1

Real Data Experiments

On Digg 2009 dataset:

SLoG-Net average AUC: 0.56
IVGD baseline AUC: 0.51
Although absolute performance is limited, SLoG-Net still outperforms comparison methods on this challenging task

Graph Signal Processing Methods

Traditional GSP methods use matrix lifting and convex programming
Limitations: High computational complexity, difficult parameter tuning

Probabilistic Models

Maximum likelihood estimation approaches
Optimal only on specific graph structures

Deep Learning Methods

Graph neural networks (GNNs)
Algorithm unrolling techniques in signal processing

Conclusions and Discussion

Main Conclusions

SLoG-Net successfully combines model-driven GSP methods with data-driven deep learning
Achieves performance comparable to ADMM with 2-3 orders of magnitude faster inference
Automatically learns optimization parameters through end-to-end training, eliminating manual tuning
Demonstrates good robustness in noisy environments

Limitations

Scalability: Currently validated primarily on small-scale graphs (N ≤ 100)
Training Data Requirements: Requires large amounts of labeled data (200k samples)
Graph Structure Dependence: Performance closely related to spectral properties of the graph
Filter Invertibility: Relies on strong invertibility assumptions

Future Directions

Large-Scale Graphs: Develop scalable versions for large-scale networks
Transfer Learning: Study model generalization across different graph structures
Theoretical Analysis: Establish theoretical guarantees for stability and transferability
Application Extensions: Extend to neuroscience, seismology, epidemiology, and other domains

In-Depth Evaluation

Strengths

Solid Theoretical Foundation: Based on GSP theory with rigorous mathematical derivations
Strong Methodological Innovation: First application of ADMM unrolling to graph source localization
Comprehensive Experiments: Covers synthetic and real data with multiple graph types and metrics
Engineering Practicality: Significant speed improvements provide practical application value
Good Interpretability: Network architecture directly corresponds to optimization algorithm, facilitating understanding

Weaknesses

Scale Limitations: Experiments primarily on small-scale graphs; large-scale applicability unknown
Strong Assumptions: Filter invertibility assumption may not hold in practical applications
Incomplete Comparisons: Lacks comparison with more recent deep learning methods
Insufficient Theoretical Analysis: Lacks theoretical guarantees for convergence and generalization

Impact

Academic Value: Provides new insights for algorithm unrolling applications in graph signal processing
Practical Value: Has application potential in network monitoring, opinion analysis, and related fields
Reproducibility: Authors provide complete code implementation

Applicable Scenarios

Source localization tasks on small to medium-scale networks
Applications with high real-time requirements
Environments with known and relatively stable graph structures
Supervised learning scenarios where training data is available

References

The paper cites 46 relevant references spanning graph signal processing, optimization theory, and deep learning, providing a solid theoretical foundation for the research.

Overall Assessment: This is a high-quality academic paper that successfully combines optimization theory with deep learning to address the important problem of source localization on graphs. While there is room for improvement in scalability and theoretical analysis, its innovation and practical value make it a significant contribution to the field.