2025-11-10T03:04:02.700056

Experimental Demonstration of an Optical Neural PDE Solver via On-Chip PINN Training

Zhao, Xiao, Descos et al.

Partial differential equation (PDE) is an important math tool in science and engineering. This paper experimentally demonstrates an optical neural PDE solver by leveraging the back-propagation-free on-photonic-chip training of physics-informed neural networks.

academic

Experimental Demonstration of an Optical Neural PDE Solver via On-Chip PINN Training

Basic Information

Paper ID: 2501.00742
Title: Experimental Demonstration of an Optical Neural PDE Solver via On-Chip PINN Training
Authors: Yequan Zhao, Xian Xiao, Antoine Descos, Yuan Yuan, Xinling Yu, Geza Kurczveil, Marco Fiorentino, Zheng Zhang, Raymond G. Beausoleil
Categories: cs.LG cs.AR physics.optics
Publication Date: January 1, 2025
Paper Link: https://arxiv.org/abs/2501.00742

Abstract

Partial differential equations (PDEs) are important mathematical tools in science and engineering. This paper experimentally demonstrates an optical neural PDE solver through backpropagation-free photonic chip training of physics-informed neural networks (PINNs).

Research Background and Motivation

1. Problem to be Addressed

PDE solving holds significant importance in science and engineering. Traditional discretization-based solution methods (such as finite difference and finite element methods) require substantial digital computational resources and runtime. Physics-informed neural networks (PINNs) provide a discretization-free unsupervised approach, but current PINN training typically requires running on powerful GPUs for hours to tens of hours.

2. Problem Significance

PDEs are widely applied in:

Electromagnetic modeling and IC chip thermal analysis
Medical imaging
Autonomous system safety verification

These critical fields therefore require efficient real-time solution schemes.

3. Limitations of Existing Methods

Traditional numerical methods: Large computational resource requirements, long runtime
GPU-based PINN training: High energy consumption, unsuitable for edge device deployment
Backpropagation training of optical neural networks: Requires additional memory and photonic hardware to implement backward computation graphs, necessitates fully differentiable neural network models

4. Research Motivation

Optical neural networks (ONN) offer high throughput, low energy consumption, low latency, and high parallelism solutions. However, training PINNs on photonic chips faces significant challenges, particularly in implementing backpropagation.

Core Contributions

First experimental demonstration of an optical neural PDE solver based on backpropagation-free photonic chip training
Proposed zeroth-order optimization method to address the non-differentiability of optical neural networks without requiring calibration
Implemented on-chip PINN training on a 1×4 microring resonator (MRR) weight group
Verified concept feasibility with ℓ2 error of 5E-3 on one-dimensional heat equation solving

Detailed Methodology

Task Definition

This paper investigates the task of training PINNs on photonic chips to solve partial differential equations. Specifically, using a one-dimensional heat equation as an example:

PDE: ∂u/∂t = (1/π²)∂²u/∂x²
Initial condition: u(x,0) = sin(πx)
Boundary conditions: u(0,t) = 0; u(1,t) = 0
Objective: Learn neural network parameters θ such that û(x,t) = f_θ(x,t) approximates the true solution u(x,t)

Model Architecture

PINN Architecture

Network structure: Feedforward neural network with two hidden layers
Weight matrix dimensions: 2×4, 4×4, 4×4, 4×1
Loss function: L = L_r + L_0 + L_b
- L_r: PDE residual loss
- L_0: Initial condition loss
- L_b: Boundary condition loss

Optical Implementation

1×4 MRR weight group: Utilizing wavelength division multiplexing (WDM) technology
Time division multiplexing: Computing different weight matrices through different clock cycles
Weight encoding: Controlling MRR weight values (range 0 to 1) through thermal tuning voltage

Technical Innovations

1. Zeroth-Order Optimization Replacing Backpropagation

Traditional backpropagation method:

∂L/∂θ = (∂L/∂f)(∂f/∂θ)
Requires differentiable neural network models
Requires additional backward computation hardware

Zeroth-order optimization method:

Gradient estimation: ∂L/∂θ ≈ (L⁺ - L⁻)/(2μ)
Requires only forward propagation
Treats optical neural network as a black box

2. Zeroth-Order Method for Derivative Computation

Spatial derivatives: ∂²û/∂x² = (û₊ + û₋ - 2û)/(2μ) Temporal derivatives: ∂û/∂t = (û₊ - û₋)/(2μ)

Where û₊ and û₋ correspond to network outputs after adding and subtracting perturbations δ to the inputs, respectively.

3. Calibration-Free Training

Directly optimizes tunable parameters (voltage values) of photonic devices
Manufacturing errors and environmental noise can be automatically mitigated during training
No pre-calibration process required

Experimental Setup

Hardware Platform

Chip: 1×4 MRR weight group chip manufactured by AMF foundry
Laser: Tunable laser
Detector: Photodetector
Control: Four voltage sources for thermal tuning

Network Configuration

Input dimension: 2 (spatial coordinate x and temporal coordinate t)
Hidden layers: 2 layers with 4 neurons each
Output dimension: 1 (PDE solution û)
Activation function: σ (sigmoid)

Evaluation Metrics

ℓ2 error: Computed as the ℓ2 norm error between learned and true solutions on test point sets
Visualization: Heat maps showing the value distribution of PDE solution u(x,t)

Training Configuration

Optimization algorithm: Zeroth-order stochastic gradient descent (ZO-SGD)
Training iterations: 1000
Perturbation parameter: μ for gradient estimation

Experimental Results

Main Results

1. Numerical Simulation Results

Using experimentally measured voltage-to-weight lookup tables for simulation:

8-bit precision: Limited expressiveness prevents learning physical laws
10-bit and higher precision: Effectively learns PDE solutions

2. Hardware Experimental Results

Final ℓ2 error: 5E-3
Training convergence: ℓ2 error curves show gradual capture of physical laws described by PDEs during training
Bit precision: Experimental chip training precision falls between 8-bit and 10-bit

Ablation Studies

The paper compares performance across different bit precisions:

8-bit: Poor performance, ineffective learning
10-bit: Significantly improved performance
Hardware experiment: Falls between 8-bit and 10-bit

Experimental Findings

Robustness to manufacturing errors: Zeroth-order optimization method effectively trains despite manufacturing errors
Environmental noise adaptability: Training process adapts to environmental noise
Importance of bit precision: Limited bit precision in simulation constrains the accuracy of final learned solutions

1. Physics-Informed Neural Networks (PINN)

PINN framework proposed by Raissi et al. provides deep learning methods for solving forward and inverse problems
Traditional PINN training relies on backpropagation with large computational overhead

2. Optical Neural Network Training

Stochastic zeroth-order optimization: FLOPS method proposed by Gu et al.
In-situ backpropagation: In-situ backpropagation of photonic neural networks implemented by Pai et al.
Tensor compression methods: TT-PINN and TONN enable large-scale applications

3. Advantages of This Work

First application of zeroth-order optimization to optical PINN training
Experimental verification of concept feasibility
No need for complex calibration procedures

Conclusions and Discussion

Main Conclusions

This paper successfully demonstrates an optical neural PDE solver based on zeroth-order optimization, achieving optical implementation of PINNs through backpropagation-free chip training, providing a new solution for real-time edge PDE solving.

Limitations

Scale limitations: Current experiments conducted only on 1×4 MRR weight groups, relatively small scale
Precision limitations: Limited bit precision in simulation affects final accuracy
PDE types: Verified only on one-dimensional heat equation, requires extension to more complex PDEs
Convergence speed: Zeroth-order optimization converges slower than first-order methods

Future Directions

Scale expansion: Implement thousand-neuron scale using tensor-decomposed PINN (TT-PINN) and tensorized optical neural networks (TONN)
Precision improvement: Enhance simulation precision
Application extension: Extend to more types of PDE problems

In-Depth Evaluation

Strengths

Strong innovation: First chip-level training of optical PINN, pioneering work
Clear technical approach: Zeroth-order optimization cleverly solves the non-differentiability problem of optical devices
Sufficient experimental validation: Complete verification chain from simulation to hardware experiments
High practical value: Provides new insights for edge computing PDE solving

Weaknesses

Limited scale: 1×4 weight group scale is too small, significant gap remains before practical deployment
Insufficient precision: 5E-3 error may be insufficiently accurate for certain applications
Missing complexity analysis: Lacks detailed computational complexity and energy consumption analysis
Insufficient generalization verification: Verified only on a simple PDE, generalization capability unknown

Impact

Academic value: Opens new direction combining optical computing with scientific computing
Technology advancement: Provides important reference for practical implementation of optical neural networks
Application prospects: Potential applications in edge computing, real-time simulation, and other fields

Applicable Scenarios

Edge computing environments: Resource-constrained scenarios requiring real-time PDE solving
Low-power applications: Energy-sensitive mobile devices or IoT devices
Concept verification: Technology verification platform for optical neural computing

References

This paper cites important works in PINN, optical neural networks, zeroth-order optimization and related fields, particularly:

Pioneering work on PINN by Raissi et al.
Related research on optical neural network training
Recent advances in tensor-compressed neural networks

Overall Assessment: This is pioneering work at the intersection of optical computing and scientific computing. Although current scale and precision are limited, it establishes important foundations for future optical PDE solver development. The technical approach is sound, experimental validation is comprehensive, and it possesses significant academic value and application prospects.