2025-11-24T00:19:17.738116

Overlap Gap and Computational Thresholds in the Square Wave Perceptron

Benedetti, Bogdanov, Malatesta et al.
Square Wave Perceptrons (SWPs) form a class of neural network models with oscillating activation function that exhibit intriguing ``hardness'' properties in the high-dimensional limit at a fixed constraint density $α= O(1)$. In this work, we examine two key aspects of these models. The first is related to the so-called \emph{overlap-gap property}, that is a disconnectivity feature of the geometry of the solution space of combinatorial optimization problems proven to cause the failure of a large family of solvers, and conjectured to be a symptom of algorithmic hardness. We identify, both in the storage and in the teacher-student settings, the emergence of an overlap gap at a threshold $α_{\mathrm{OGP}}(δ)$, which can be made arbitrarily small by suitably increasing the frequency of oscillations $1/δ$ of the activation. This suggests that in this small-$δ$ regime, typical instances of the problem are hard to solve even for small values of $α$. Second, in the teacher-student setup, we show that the recovery threshold of the planted signal for message-passing algorithms can be made arbitrarily large by reducing $δ$. These properties make SWPs both a challenging benchmark for algorithms and an interesting candidate for cryptographic applications.
academic

Overlap Gap and Computational Thresholds in the Square Wave Perceptron

Basic Information

  • Paper ID: 2506.05197
  • Title: Overlap Gap and Computational Thresholds in the Square Wave Perceptron
  • Authors: Marco Benedetti, Andrej Bogdanov, Enrico M. Malatesta, Marc Mézard, Gianmarco Perrupato, Alon Rosen, Nikolaj I. Schwartzbach, Riccardo Zecchina
  • Classification: cond-mat.dis-nn (Condensed Matter - Disordered Systems and Neural Networks)
  • Publication Date: October 13, 2025 (arXiv preprint)
  • Paper Link: https://arxiv.org/abs/2506.05197

Abstract

Square Wave Perceptrons (SWPs) are a class of neural network models with oscillatory activation functions that exhibit interesting "hardness" properties in the high-dimensional limit at fixed constraint density α = O(1). This study examines two key aspects of these models: first, the overlap-gap property, a disconnectivity feature of the solution space geometry in combinatorial optimization problems that has been proven to cause numerous solvers to fail and is conjectured to be a symptom of algorithmic hardness; second, in the teacher-student setting, the recovery threshold of message-passing algorithms can become arbitrarily large by decreasing δ. These properties make SWPs both challenging benchmarks for algorithms and interesting candidates for cryptographic applications.

Research Background and Motivation

Problem Background

This research focuses on the computational complexity of the perceptron problem, particularly hardness analysis at the intersection of statistical physics and theoretical computer science. The perceptron, as the most fundamental neural network model, has important implications for understanding computational limitations of more complex neural networks through its learning and storage problems.

Core Problems

  1. Statistical-Computational Gap: In many constraint satisfaction problems, there exists a gap between the parameter region that is information-theoretically feasible and the region where known polynomial-time algorithms can operate
  2. Geometric Hardness Characteristics: How the geometric structure of solution spaces affects algorithmic computational complexity
  3. Overlap-Gap Property: Geometric disconnectivity as a predictor of algorithmic hardness

Research Motivation

  • Existing binary perceptrons (such as ABP, SBP), while exhibiting statistical-computational gaps, have relatively fixed hardness thresholds
  • A model capable of tuning hardness properties is needed to better understand the geometric causes of algorithmic failure
  • Exploring the potential application of models with extreme hardness properties in cryptography

Core Contributions

  1. Introduction of Square Wave Perceptron Model: Proposes a novel perceptron model with oscillatory activation function φ_δ(h) = -sgn(sin(πh/δ))
  2. Overlap-Gap Threshold Analysis: Identifies the overlap-gap threshold α_OGP(δ) in storage and teacher-student settings, which can become arbitrarily small by increasing oscillation frequency 1/δ
  3. Extreme Hardness Properties: Proves that in the δ→0 limit, overlap-gap exists for any α>0, indicating the problem is difficult even at small constraint densities
  4. Tunable Recovery Threshold: Demonstrates in the teacher-student setting that the recovery threshold of message-passing algorithms can become arbitrarily large by decreasing δ
  5. Cryptographic Application Prospects: Provides theoretical foundation for perceptron-based cryptographic constructions (such as collision-resistant hash functions)

Detailed Methodology

Task Definition

Storage Problem

Given dataset D = {(x^μ, y^μ)}^P_{μ=1}, find weight vector w ∈ {-1,+1}^N such that:

y^μ = φ(1/√N · w · x^μ) ∀μ ∈ [P]

where x^μ_i ~ N(0,1) are i.i.d., and y^μ = ±1 with equal probability.

Teacher-Student Problem

There exists a "teacher" weight w* ∈ {-1,+1}^N, with labels generated by the teacher:

y^μ = φ(1/√N · w* · x^μ) ∀μ ∈ [P]

The goal is to recover the teacher weights or find any valid solution.

Model Architecture

Square Wave Activation Function

φ_δ(h) = -sgn(sin(πh/δ))

This activation function has period T = 2δ, with oscillation frequency controlled by parameter δ.

Fourier Representation

φ_δ(h) = -4/π ∑_{n=0}^∞ 1/(2n+1) sin((2n+1)πh/δ)

Overlap-Gap Property Analysis

m-OGP Definition

For a given instance D, define the set S_m(q,ε,D) containing all sets of m configurations {w^1,...,w^m} satisfying:

  1. Each w^a satisfies the constraints
  2. For any a≠b: q < (w^a·w^b)/N < q+ε

The m-OGP property holds if Pr_DS_m(q,ε,D) ≠ ∅ → 0 (as N→∞).

Cloned Partition Function

N_m(q;D) = ∑_{w^1,...,w^m} ∏_{a=1}^m X_D(w^a) ∏_{a<b} δ(q - w^a·w^b/N)

Annealed Free Entropy

φ^{ann}_m(q) = lim_{N→∞} 1/(mN) ln E_D N_m(q;D)

Technical Innovations

  1. Tunable Hardness: Continuously adjust problem computational hardness through parameter δ, reaching extreme hardness as δ→0
  2. Geometric Analysis: Utilize statistical physics methods to analyze solution space geometry
  3. Multi-scale Analysis: Combine annealed approximation and replica symmetry analysis for predictions at different precision levels
  4. Small δ Limit Analytical Treatment: Precisely analyze limiting cases through perturbative expansion

Experimental Setup

Theoretical Analysis Methods

  • Annealed Computation: Provides upper bound estimates of OGP thresholds
  • Replica Symmetry (RS) Analysis: More precise free entropy calculations
  • Small δ Expansion: Perturbative analysis for the δ→0 limit

Numerical Experiments

  • System Size: N = 4000-5000
  • Sample Count: Average of 80 independent samples per data point
  • Algorithm Testing: Reinforced Approximate Message Passing (RAMP) algorithm

Evaluation Metrics

  • Satisfiability Threshold: α_c(δ) - critical constraint density where solutions exist
  • OGP Threshold: α_OGP(m,δ) - threshold where m-overlap-gap appears
  • Teacher Threshold: α_T(δ) - threshold where teacher becomes unique solution
  • Algorithm Threshold: α_alg(δ) - threshold where RAMP algorithm fails

Experimental Results

Main Results

Satisfiability Threshold

  • For δ→∞, recovers ABP capacity α^{ABP}_c ≈ 0.833
  • For δ→0, capacity approaches upper bound α_c(0) = 1
  • RS estimates coincide with annealed upper bounds in small δ limit

Overlap-Gap Threshold

In the δ→0 limit:

α^{ann}_{OGP}(m) = 1/m

Therefore α_(∞) = 0, meaning overlap-gap exists for any α > 0.

Teacher-Student Problem Results

  • Teacher threshold α_T(δ) approaches 1 as δ→0
  • Recovery threshold α_r(δ) can become arbitrarily large by decreasing δ
  • Diverging recovery threshold achieved in the reverse wedge perceptron

Algorithm Performance Analysis

RAMP algorithm performance testing shows:

  • Algorithm threshold α_alg(δ) decreases as δ decreases
  • Gap exists between RS-estimated OGP threshold and algorithm threshold
  • For δ = 1.5, gap approaches zero, consistent with known ABP results

Case Study: Reverse Wedge Perceptron

By designing activation function:

φ(h) = sgn((h-γ)h(h+γ))

At γ = γ* = √(2log2), recovery threshold diverges:

α_r ≈ (π/8)(log2)^{-3/2} ε^{-1}

where ε = |γ - γ*|.

Perceptron Theory

  • Classical Results: Gardner-Derrida capacity analysis
  • Binary Perceptrons: Hardness properties of ABP and SBP models
  • Statistical-Computational Gap: Discrepancy between Bansal-Spencer algorithms and information-theoretic limits

Overlap-Gap Property

  • Definition and Development: Original definition by Gamarnik-Zadik
  • Algorithm Failure Proofs: Universal results for stable algorithm classes
  • Application Examples: Random graphs, satisfiability problems, etc.

Statistical Physics Methods

  • Replica Method: Computing partition functions and phase transitions
  • Geometric Phase Transitions: Changes in solution space clustering structure
  • Message Passing: Theoretical analysis of BP and AMP algorithms

Conclusions and Discussion

Main Conclusions

  1. Extreme Hardness: SWP exhibits extreme computational hardness in the δ→0 limit, with overlap-gap existing for any positive constraint density
  2. Tunability: Problem hardness properties can be continuously adjusted through parameter δ
  3. Cryptographic Potential: These properties make SWP a strong candidate for cryptographic applications
  4. Geometric Understanding: Oscillatory activation functions disrupt solution space connectivity, causing algorithmic failure

Limitations

  1. Replica Symmetry: RS analysis may be inaccurate in certain parameter regions, requiring higher-order replica symmetry breaking
  2. Finite-Size Effects: Theoretical analysis is based on thermodynamic limit; actual finite systems may show deviations
  3. Algorithm Limitations: Only RAMP algorithm tested; performance of other algorithms remains to be studied
  4. Parameter Dependence: Results strongly depend on choice of parameter δ

Future Directions

  1. Full RSB Analysis: Develop complete replica symmetry breaking theory
  2. Other Algorithms: Test spectral methods, SDP relaxations, and other algorithm classes
  3. Cryptographic Implementation: Develop concrete cryptographic protocols based on SWP
  4. Generalizations: Study similar properties for other oscillatory activation functions

In-Depth Evaluation

Strengths

  1. Theoretical Innovation: First systematic study of computational hardness with oscillatory activation functions, providing new theoretical perspectives
  2. Rigorous Methodology: Combines statistical physics and theoretical computer science methods with comprehensive and in-depth analysis
  3. Profound Results: Discovers new mechanisms for tunable hardness, with important implications for understanding algorithmic limits
  4. Application Prospects: Provides new theoretical tools for cryptography

Weaknesses

  1. Technical Complexity: Analysis methods are relatively complex, potentially limiting accessibility of results
  2. Experimental Verification: Relies primarily on theoretical analysis with limited numerical experiments
  3. Practical Feasibility: Realizability of extreme parameters (δ→0) in practice is questionable
  4. Generalizability: Universality of results and applicability to other problems require further verification

Impact

  1. Theoretical Contribution: Provides new analytical tools and perspectives for computational complexity theory
  2. Interdisciplinary Value: Connects statistical physics, machine learning, and cryptography
  3. Inspirational Significance: May inspire further research on geometric hardness
  4. Practical Potential: Has application prospects in cryptography and algorithm design

Applicable Scenarios

  1. Theoretical Research: Computational complexity theory and statistical physics research
  2. Algorithm Analysis: Understanding limits and failure mechanisms of message-passing algorithms
  3. Cryptographic Design: Developing new cryptographic primitives based on hardness assumptions
  4. Machine Learning: Understanding computational obstacles in neural network training

References

The paper cites 75 related references, primarily including:

  1. Rosenblatt (1958): Original definition of the perceptron
  2. Gardner & Derrida (1989): Classical results on perceptron storage capacity
  3. Gamarnik & Zadik (2019): Definition of overlap-gap property
  4. Baldassi et al. (2015): Algorithm threshold analysis for binary perceptrons
  5. Mézard & Montanari (2009): Systematic introduction of statistical physics methods

This paper makes important contributions at the intersection of theoretical computer science and statistical physics. By introducing the square wave perceptron model with tunable hardness, it provides new tools and perspectives for understanding the geometric origins of algorithmic hardness. The discovered extreme hardness properties not only have theoretical value but also open new possibilities for cryptographic applications.