2025-11-15T02:43:10.578367

On Random Sampling of Diffused Graph Signals with Sparse Inputs on Vertex Domain

Lai, Chai, Xu
The sampling of graph signals has recently drawn much attention due to the wide applications of graph signal processing. While a lot of efficient methods and interesting results have been reported to the sampling of band-limited or smooth graph signals, few research has been devoted to non-smooth graph signals, especially to sparse graph signals, which are also of importance in many practical applications. This paper addresses the random sampling of non-smooth graph signals generated by diffusion of sparse inputs. We aim to present a solid theoretical analysis on the random sampling of diffused sparse graph signals, which can be parallel to that of band-limited graph signals, and thus present a sufficient condition to the number of samples ensuring the unique recovery for uniform random sampling. Then, we focus on two classes of widely used binary graph models, and give explicit and tighter estimations on the sampling numbers ensuring unique recovery. We also propose an adaptive variable-density sampling strategy to provide a better performance than uniform random sampling. Finally, simulation experiments are presented to validate the effectiveness of the theoretical results.
academic

On Random Sampling of Diffused Graph Signals with Sparse Inputs on Vertex Domain

Basic Information

  • Paper ID: 2412.20041
  • Title: On Random Sampling of Diffused Graph Signals with Sparse Inputs on Vertex Domain
  • Authors: Yingcheng Lai, Li Chai, Jinming Xu (School of Control Science and Engineering, Zhejiang University)
  • Classification: eess.SP (Electrical Engineering and Systems Science - Signal Processing)
  • Publication Date: December 28, 2024
  • Paper Link: https://arxiv.org/abs/2412.20041

Abstract

Graph signal sampling has received considerable attention due to the widespread applications of graph signal processing. While numerous efficient methods and interesting results have been developed for bandlimited or smooth graph signals, research on non-smooth graph signals, particularly sparse graph signals, remains limited despite their importance in many practical applications. This paper investigates the random sampling problem of non-smooth graph signals generated by diffusion from sparse inputs, aiming to provide rigorous theoretical analysis for random sampling of diffused sparse graph signals and establish sufficient conditions on the number of samples required by uniform random sampling to ensure unique recovery. The paper focuses on analyzing two classes of widely-used binary graph models, providing explicit and tighter estimates of the sampling numbers ensuring unique recovery, and proposes an adaptive variable-density sampling strategy to achieve better performance than uniform random sampling.

Research Background and Motivation

Problem Background

  1. Importance of Graph Signal Processing: Graphs serve as a powerful framework for modeling unstructured data and complex interactions, with widespread applications in social networks, brain networks, transportation networks, and other domains
  2. Challenges in Sampling: The number of vertices in real-world networks is typically enormous, making it extremely difficult to collect information from all vertices; thus, sampling and recovery problems have become central issues in graph signal processing

Limitations of Existing Methods

  1. Research Bias: Most existing research concentrates on sampling and recovery of smooth graph signals (bandlimited graph signals), assuming that graph signals possess approximate bandlimited characteristics in the graph Fourier basis
  2. Insufficient Study of Non-smooth Signals: Limited research exists on non-smooth graph signals generated by local diffusion from sparse inputs, despite their importance in practical applications
  3. Lack of Theoretical Guarantees: Explicit theoretical guarantees for diffused sparse graph signals are lacking, particularly regarding the relationship between sampling numbers and network structure

Research Motivation

The paper addresses three key questions:

  1. For a given graph diffusion model, how many vertices must be sampled to ensure accurate reconstruction of sparse inputs?
  2. What is the relationship between the number of samples and network structure?
  3. Do alternative sampling strategies exist that provide higher recovery accuracy than uniform random sampling?

Core Contributions

  1. Theoretical Guarantees: Establishes sufficient conditions ensuring unique reconstruction under uniform random sampling, revealing the relationship between sampling numbers and the incoherence parameter μ, sparse condition number κ(Γ), and sparsity k
  2. Network Structure Analysis:
    • For Erdős-Rényi (ER) random networks, proves that approximately ~log n samples suffice to ensure recovery uniqueness
    • Reveals the relationship between connection probability and required sampling numbers, discovering that the minimum sampling requirement occurs at connection probability 0.5
    • For small-world networks, characterizes the relationship between required sampling numbers and rewiring probability
  3. Novel Sampling Strategy: Proposes an adaptive variable-density sampling strategy utilizing variable-density sampling techniques to provide performance guarantees with fewer samples than uniform random sampling
  4. Experimental Validation: Verifies the effectiveness of theoretical results through simulation experiments

Methodology Details

Problem Formulation

Consider the diffused graph signal model:

x = Hα

where:

  • H is the graph diffusion matrix
  • α ∈ ℝⁿ is the sparse input with sparsity k (i.e., ‖α‖₀ ≤ k)
  • The objective is to recover the sparse input α through random sampling of a certain number of vertices

Sampling Model

Let M = {ω₁, ..., ωₘ} denote the sampling set, with sampling matrix Cₘ defined as:

[Cₘ]ᵢ,ⱼ = {1, if j = ωᵢ; 0, otherwise}

The observed signal is:

y = CₘHα = Hₘα

where Hₘ = CₘH.

Core Theoretical Results

Main Theorem (Theorem 1)

Under uniform random sampling, when the following condition is satisfied, problem (P1) has a unique minimizing solution with probability 1-e⁻ᵋ-3/n:

m ≥ C(1+ε)μ²kκ(Γ)(log n + log μ)

where:

  • μ is the incoherence parameter
  • κ(Γ) is the sparse condition number
  • Γ = mEH*ₘHₘ⁻¹

Key Definitions

  1. Incoherence Parameter (Definition 1):
max₁≤ᵢ,ⱼ≤ₙ{|hᵢ,ⱼ|} ≤ μ, max₁≤ᵢ,ⱼ≤ₙ{|[HΓ]ᵢ,ⱼ|} ≤ μ
  1. Sparse Condition Number (Definition 2):
κ(X) = max{cond(k,X), cond(k,X⁻¹)}

Analysis of Specific Network Models

Erdős-Rényi Random Networks

For ER random networks with connection probability b under the binary diffusion model H = I + δA:

m ≥ C(1+ε)k³/²(log n - log δ²(b-b²))/(δ²(b-b²))

Key Findings:

  • The required sampling number is minimized when b = 0.5
  • As b approaches 0 or 1, all vertices must be sampled

Small-World Networks

For small-world networks with degree d and rewiring probability b:

m ≥ C(1+ε)kμ²·Δκ·(log n + log μ)

where Δκ is related to the adjacency matrix of d-regular graphs and decreases with increasing rewiring probability b.

Variable-Density Sampling Strategy

Define the weight for each vertex i as:

φᵢ = max_{j=1,...,n}{|hᵢ,ⱼ|}
φ̃ᵢ = max_{j=1,...,n}{|[HΓ]ᵢ,ⱼ|}

The sampling probability is:

pᵢ = √(φᵢφ̃ᵢ)/Σⱼ√(φⱼφ̃ⱼ)

The sampling upper bound for this strategy is:

m ≥ C(1+ε)φ̄²kκ(Γ)(log n + log φ̄)

where φ̄ is the average weight, which is always no greater than the incoherence parameter μ.

Experimental Setup

Experimental Configuration

  • Graph generation using the GSP toolbox for different graph types
  • Basis pursuit algorithm employed to solve problem (P1)
  • Evaluation metric: normalized average recovery error ‖α-α̂‖₂/‖α̂‖₂
  • 500 iterations performed for each sampling number with averaged results

Experimental Scenarios

  1. Example I: Comparison of different graph types (regular graphs, ER random graphs, star graphs)
  2. Example II: ER random networks with different connection probabilities
  3. Example III: Small-world networks with different rewiring probabilities
  4. Example IV: Variable-density sampling under doubly stochastic matrix diffusion model

Experimental Results

Main Findings

Comparison of Different Graph Types

  • ER random graphs require fewer samples compared to regular graphs and star graphs
  • This is consistent with theoretical analysis: ER random graphs possess smaller μ and κ(Γ) values

ER Random Network Analysis

  • Recovery performance is similar for connection probabilities b = 0.3 and b = 0.7, confirming the theoretical prediction of symmetry
  • Extreme connection probabilities (approaching 0 or 1) require more samples

Small-World Network Analysis

  • Required sampling numbers decrease with increasing rewiring probability
  • Validates the theoretical analysis conclusion that condition numbers decrease with rewiring probability

Variable-Density Sampling Performance

  • The variable-density sampling strategy achieves improved recovery performance compared to uniform random sampling with fewer samples

Numerical Results

Experimental results demonstrate:

  • For ER random networks, approximately ~log n samples indeed suffice to ensure sparse signal recovery
  • Network structure parameters (such as connection probability and rewiring probability) significantly affect required sampling numbers
  • Variable-density sampling offers advantages in practical applications

Graph Signal Processing Sampling Theory

  1. Smooth Signal Sampling: Pioneering work by Pesenson et al.; necessary and sufficient conditions by Anis et al.
  2. Sampling Strategies: Deterministic sampling (greedy optimization) and random sampling (variable-density probability sampling)
  3. Extended Research: Directed graphs, special graph signal types

Compressed Sensing Theory

  1. Classical Results: RIP conditions, mutual incoherence conditions
  2. Dictionary Learning: Compressed sensing with redundant dictionaries
  3. Application Domains: MRI reconstruction, channel coding, data compression
  1. Known Diffusion Models: Recovery algorithm design by Ramírez et al.
  2. Unknown Diffusion Models: Joint estimation methods for blind identification problems
  3. Application Background: Rumor source identification in social networks, inverse problems in biological signal analysis

Conclusions and Discussion

Main Conclusions

  1. Theoretical Contribution: Establishes a complete theoretical framework for random sampling of diffused sparse graph signals
  2. Network Structure Insights: Reveals the deep relationship between sampling performance and network topology
  3. Algorithm Improvement: The proposed variable-density sampling strategy possesses theoretical guarantees and practical advantages

Limitations

  1. Assumption Requirements: Requires the assumption that EH*ₘHₘ is invertible (Assumption 1)
  2. Computational Complexity: Calculation of sparse condition number κ(Γ) may be computationally intensive
  3. Practical Application: The constant C in theoretical results may require further optimization in practical applications

Future Directions

  1. Network Structure Exploration: How to design better recovery algorithms by exploiting network structure
  2. Algorithm Optimization: Specialized algorithm design for specific network types
  3. Application Extension: Validation of theoretical results in more practical scenarios

In-Depth Evaluation

Strengths

  1. Theoretical Rigor: Provides a complete mathematical proof framework utilizing mature compressed sensing theory tools
  2. Practical Value: Delivers explicit sampling number estimates for two important network models
  3. Innovation: First systematic analysis of random sampling for diffused sparse graph signals
  4. Sufficient Experiments: Validates theoretical results through multiple experimental scenarios

Weaknesses

  1. Constant Optimization: The constant C in theoretical results may not be sufficiently tight, potentially requiring fewer samples in practical applications
  2. Computational Efficiency: Insufficient analysis of computational complexity for sparse condition number calculation
  3. Network Model Limitations: Primarily analyzes binary diffusion models with limited extensibility to other diffusion models

Impact

  1. Academic Contribution: Fills an important gap in non-smooth signal sampling theory for graph signal processing
  2. Practical Value: Provides theoretical guidance for signal sampling in large-scale networks
  3. Reproducibility: Clear experimental setup, detailed theoretical derivations, and good reproducibility

Applicable Scenarios

  1. Social Network Analysis: Rumor propagation source identification, opinion diffusion analysis
  2. Epidemiology: Epidemic source tracking, transmission pathway analysis
  3. Brain Network Research: Sparse representation and sampling of neural signals
  4. Urban Sensing: Optimal deployment of sensor networks in smart cities

References

The paper cites 44 related references, primarily including:

  • Foundational graph signal processing theory (Ortega et al., Shuman et al.)
  • Compressed sensing theory (Candès & Tao, Baraniuk et al.)
  • Graph sampling theory (Pesenson, Anis et al.)
  • Related work on diffusion models (Ramírez et al., Segarra et al.)

This paper provides systematic theoretical analysis and practical algorithms for the important problem of sparse diffused signal sampling in practical applications, building upon the theoretical foundations of graph signal processing, and represents a significant contribution to the field.