2025-11-24T00:19:17.738116

Overlap Gap and Computational Thresholds in the Square Wave Perceptron

Benedetti, Bogdanov, Malatesta et al.

Square Wave Perceptrons (SWPs) form a class of neural network models with oscillating activation function that exhibit intriguing ``hardness'' properties in the high-dimensional limit at a fixed constraint density $Î±= O(1)$. In this work, we examine two key aspects of these models. The first is related to the so-called \emph{overlap-gap property}, that is a disconnectivity feature of the geometry of the solution space of combinatorial optimization problems proven to cause the failure of a large family of solvers, and conjectured to be a symptom of algorithmic hardness. We identify, both in the storage and in the teacher-student settings, the emergence of an overlap gap at a threshold $Î±_{\mathrm{OGP}}(Î´)$, which can be made arbitrarily small by suitably increasing the frequency of oscillations $1/Î´$ of the activation. This suggests that in this small-$Î´$ regime, typical instances of the problem are hard to solve even for small values of $Î±$. Second, in the teacher-student setup, we show that the recovery threshold of the planted signal for message-passing algorithms can be made arbitrarily large by reducing $Î´$. These properties make SWPs both a challenging benchmark for algorithms and an interesting candidate for cryptographic applications.

academic

Overlap Gap and Computational Thresholds in the Square Wave Perceptron

Basic Information

Paper ID: 2506.05197
Title: Overlap Gap and Computational Thresholds in the Square Wave Perceptron
Authors: Marco Benedetti, Andrej Bogdanov, Enrico M. Malatesta, Marc Mézard, Gianmarco Perrupato, Alon Rosen, Nikolaj I. Schwartzbach, Riccardo Zecchina
Classification: cond-mat.dis-nn (Condensed Matter - Disordered Systems and Neural Networks)
Publication Date: October 13, 2025 (arXiv preprint)
Paper Link: https://arxiv.org/abs/2506.05197

Abstract

Square Wave Perceptrons (SWPs) are a class of neural network models with oscillatory activation functions that exhibit interesting "hardness" properties in the high-dimensional limit at fixed constraint density α = O(1). This study examines two key aspects of these models: first, the overlap-gap property, a disconnectivity feature of the solution space geometry in combinatorial optimization problems that has been proven to cause numerous solvers to fail and is conjectured to be a symptom of algorithmic hardness; second, in the teacher-student setting, the recovery threshold of message-passing algorithms can become arbitrarily large by decreasing δ. These properties make SWPs both challenging benchmarks for algorithms and interesting candidates for cryptographic applications.

Research Background and Motivation

Problem Background

This research focuses on the computational complexity of the perceptron problem, particularly hardness analysis at the intersection of statistical physics and theoretical computer science. The perceptron, as the most fundamental neural network model, has important implications for understanding computational limitations of more complex neural networks through its learning and storage problems.

Core Problems

Statistical-Computational Gap: In many constraint satisfaction problems, there exists a gap between the parameter region that is information-theoretically feasible and the region where known polynomial-time algorithms can operate
Geometric Hardness Characteristics: How the geometric structure of solution spaces affects algorithmic computational complexity
Overlap-Gap Property: Geometric disconnectivity as a predictor of algorithmic hardness

Research Motivation

Existing binary perceptrons (such as ABP, SBP), while exhibiting statistical-computational gaps, have relatively fixed hardness thresholds
A model capable of tuning hardness properties is needed to better understand the geometric causes of algorithmic failure
Exploring the potential application of models with extreme hardness properties in cryptography

Core Contributions

Introduction of Square Wave Perceptron Model: Proposes a novel perceptron model with oscillatory activation function φ_δ(h) = -sgn(sin(πh/δ))
Overlap-Gap Threshold Analysis: Identifies the overlap-gap threshold α_OGP(δ) in storage and teacher-student settings, which can become arbitrarily small by increasing oscillation frequency 1/δ
Extreme Hardness Properties: Proves that in the δ→0 limit, overlap-gap exists for any α>0, indicating the problem is difficult even at small constraint densities
Tunable Recovery Threshold: Demonstrates in the teacher-student setting that the recovery threshold of message-passing algorithms can become arbitrarily large by decreasing δ
Cryptographic Application Prospects: Provides theoretical foundation for perceptron-based cryptographic constructions (such as collision-resistant hash functions)

Detailed Methodology

Task Definition

Storage Problem

Given dataset D = {(x^μ, y^μ)}^P_{μ=1}, find weight vector w ∈ {-1,+1}^N such that:

y^μ = φ(1/√N · w · x^μ) ∀μ ∈ [P]

where x^μ_i ~ N(0,1) are i.i.d., and y^μ = ±1 with equal probability.

Teacher-Student Problem

There exists a "teacher" weight w* ∈ {-1,+1}^N, with labels generated by the teacher:

y^μ = φ(1/√N · w* · x^μ) ∀μ ∈ [P]

The goal is to recover the teacher weights or find any valid solution.

Model Architecture

Square Wave Activation Function

φ_δ(h) = -sgn(sin(πh/δ))

This activation function has period T = 2δ, with oscillation frequency controlled by parameter δ.

Fourier Representation

φ_δ(h) = -4/π ∑_{n=0}^∞ 1/(2n+1) sin((2n+1)πh/δ)

Overlap-Gap Property Analysis

m-OGP Definition

For a given instance D, define the set S_m(q,ε,D) containing all sets of m configurations {w^1,...,w^m} satisfying:

Each w^a satisfies the constraints
For any a≠b: q < (w^a·w^b)/N < q+ε

The m-OGP property holds if Pr_DS_m(q,ε,D) ≠ ∅ → 0 (as N→∞).

Cloned Partition Function

N_m(q;D) = ∑_{w^1,...,w^m} ∏_{a=1}^m X_D(w^a) ∏_{a<b} δ(q - w^a·w^b/N)

Annealed Free Entropy

φ^{ann}_m(q) = lim_{N→∞} 1/(mN) ln E_D N_m(q;D)

Technical Innovations

Tunable Hardness: Continuously adjust problem computational hardness through parameter δ, reaching extreme hardness as δ→0
Geometric Analysis: Utilize statistical physics methods to analyze solution space geometry
Multi-scale Analysis: Combine annealed approximation and replica symmetry analysis for predictions at different precision levels
Small δ Limit Analytical Treatment: Precisely analyze limiting cases through perturbative expansion

Experimental Setup

Theoretical Analysis Methods

Annealed Computation: Provides upper bound estimates of OGP thresholds
Replica Symmetry (RS) Analysis: More precise free entropy calculations
Small δ Expansion: Perturbative analysis for the δ→0 limit

Numerical Experiments

System Size: N = 4000-5000
Sample Count: Average of 80 independent samples per data point
Algorithm Testing: Reinforced Approximate Message Passing (RAMP) algorithm

Evaluation Metrics

Satisfiability Threshold: α_c(δ) - critical constraint density where solutions exist
OGP Threshold: α_OGP(m,δ) - threshold where m-overlap-gap appears
Teacher Threshold: α_T(δ) - threshold where teacher becomes unique solution
Algorithm Threshold: α_alg(δ) - threshold where RAMP algorithm fails

Experimental Results

Main Results

Satisfiability Threshold

For δ→∞, recovers ABP capacity α^{ABP}_c ≈ 0.833
For δ→0, capacity approaches upper bound α_c(0) = 1
RS estimates coincide with annealed upper bounds in small δ limit

Overlap-Gap Threshold

In the δ→0 limit:

α^{ann}_{OGP}(m) = 1/m

Therefore α_(∞) = 0, meaning overlap-gap exists for any α > 0.

Teacher-Student Problem Results

Teacher threshold α_T(δ) approaches 1 as δ→0
Recovery threshold α_r(δ) can become arbitrarily large by decreasing δ
Diverging recovery threshold achieved in the reverse wedge perceptron

Algorithm Performance Analysis

RAMP algorithm performance testing shows:

Algorithm threshold α_alg(δ) decreases as δ decreases
Gap exists between RS-estimated OGP threshold and algorithm threshold
For δ = 1.5, gap approaches zero, consistent with known ABP results

Case Study: Reverse Wedge Perceptron

By designing activation function:

φ(h) = sgn((h-γ)h(h+γ))

At γ = γ* = √(2log2), recovery threshold diverges:

α_r ≈ (π/8)(log2)^{-3/2} ε^{-1}

where ε = |γ - γ*|.

Perceptron Theory

Classical Results: Gardner-Derrida capacity analysis
Binary Perceptrons: Hardness properties of ABP and SBP models
Statistical-Computational Gap: Discrepancy between Bansal-Spencer algorithms and information-theoretic limits

Overlap-Gap Property

Definition and Development: Original definition by Gamarnik-Zadik
Algorithm Failure Proofs: Universal results for stable algorithm classes
Application Examples: Random graphs, satisfiability problems, etc.

Statistical Physics Methods

Replica Method: Computing partition functions and phase transitions
Geometric Phase Transitions: Changes in solution space clustering structure
Message Passing: Theoretical analysis of BP and AMP algorithms

Conclusions and Discussion

Main Conclusions

Extreme Hardness: SWP exhibits extreme computational hardness in the δ→0 limit, with overlap-gap existing for any positive constraint density
Tunability: Problem hardness properties can be continuously adjusted through parameter δ
Cryptographic Potential: These properties make SWP a strong candidate for cryptographic applications
Geometric Understanding: Oscillatory activation functions disrupt solution space connectivity, causing algorithmic failure

Limitations

Replica Symmetry: RS analysis may be inaccurate in certain parameter regions, requiring higher-order replica symmetry breaking
Finite-Size Effects: Theoretical analysis is based on thermodynamic limit; actual finite systems may show deviations
Algorithm Limitations: Only RAMP algorithm tested; performance of other algorithms remains to be studied
Parameter Dependence: Results strongly depend on choice of parameter δ

Future Directions

Full RSB Analysis: Develop complete replica symmetry breaking theory
Other Algorithms: Test spectral methods, SDP relaxations, and other algorithm classes
Cryptographic Implementation: Develop concrete cryptographic protocols based on SWP
Generalizations: Study similar properties for other oscillatory activation functions

In-Depth Evaluation

Strengths

Theoretical Innovation: First systematic study of computational hardness with oscillatory activation functions, providing new theoretical perspectives
Rigorous Methodology: Combines statistical physics and theoretical computer science methods with comprehensive and in-depth analysis
Profound Results: Discovers new mechanisms for tunable hardness, with important implications for understanding algorithmic limits
Application Prospects: Provides new theoretical tools for cryptography

Weaknesses

Technical Complexity: Analysis methods are relatively complex, potentially limiting accessibility of results
Experimental Verification: Relies primarily on theoretical analysis with limited numerical experiments
Practical Feasibility: Realizability of extreme parameters (δ→0) in practice is questionable
Generalizability: Universality of results and applicability to other problems require further verification

Impact

Theoretical Contribution: Provides new analytical tools and perspectives for computational complexity theory
Interdisciplinary Value: Connects statistical physics, machine learning, and cryptography
Inspirational Significance: May inspire further research on geometric hardness
Practical Potential: Has application prospects in cryptography and algorithm design

Applicable Scenarios

Theoretical Research: Computational complexity theory and statistical physics research
Algorithm Analysis: Understanding limits and failure mechanisms of message-passing algorithms
Cryptographic Design: Developing new cryptographic primitives based on hardness assumptions
Machine Learning: Understanding computational obstacles in neural network training

References

The paper cites 75 related references, primarily including:

Rosenblatt (1958): Original definition of the perceptron
Gardner & Derrida (1989): Classical results on perceptron storage capacity
Gamarnik & Zadik (2019): Definition of overlap-gap property
Baldassi et al. (2015): Algorithm threshold analysis for binary perceptrons
Mézard & Montanari (2009): Systematic introduction of statistical physics methods

This paper makes important contributions at the intersection of theoretical computer science and statistical physics. By introducing the square wave perceptron model with tunable hardness, it provides new tools and perspectives for understanding the geometric origins of algorithmic hardness. The discovered extreme hardness properties not only have theoretical value but also open new possibilities for cryptographic applications.