2025-11-13T12:43:11.038101

Knowledge-aware equation discovery with automated background knowledge extraction

Ivanchik, Hvatov

In differential equation discovery algorithms, a priori expert knowledge is mainly used implicitly to constrain the form of the expected equation, making it impossible for the algorithm to truly discover equations. Instead, most differential equation discovery algorithms try to recover the coefficients for a known structure. In this paper, we describe an algorithm that allows the discovery of unknown equations using automatically or manually extracted background knowledge. Instead of imposing rigid constraints, we modify the structure space so that certain terms are likely to appear within the crossover and mutation operators. In this way, we mimic expertly chosen terms while preserving the possibility of obtaining any equation form. The paper shows that the extraction and use of knowledge allows it to outperform the SINDy algorithm in terms of search stability and robustness. Synthetic examples are given for Burgers, wave, and Korteweg--De Vries equations.

academic

Knowledge-aware equation discovery with automated background knowledge extraction

Basic Information

Paper ID: 2501.00444
Title: Knowledge-aware equation discovery with automated background knowledge extraction
Authors: Elizaveta Ivanchik, Alexander Hvatov (ITMO University)
Category: cs.AI
Publication Date: January 3, 2025 (arXiv preprint)
Paper Link: https://arxiv.org/abs/2501.00444

Abstract

In differential equation discovery algorithms, prior expert knowledge is primarily used implicitly to constrain the form of desired equations, preventing algorithms from truly discovering equations. Instead, most differential equation discovery algorithms attempt to recover coefficients of known structures. This paper describes an algorithm that enables the discovery of unknown equations using automatically or manually extracted background knowledge. Rather than imposing rigid constraints, the algorithm modifies the structure space to make certain terms more likely to appear in crossover and mutation operators. In this manner, the algorithm simulates expert term selection while preserving the possibility of obtaining any equation form. Experiments demonstrate that knowledge extraction and utilization make it superior to the SINDy algorithm in terms of search stability and robustness.

Research Background and Motivation

Problem Definition

Differential equation discovery is an important task for extracting interpretable physical models from observational data. Current differential equation discovery methods face the following challenges:

Over-reliance on Prior Knowledge: Existing methods such as SINDy primarily constrain equation form through predefined term libraries, essentially performing coefficient recovery rather than true equation discovery
Structure Space Limitations: Gradient-based optimization methods can only search within fixed structure spaces, limiting the ability to discover novel equations
Rigid Knowledge Utilization: Existing methods either completely avoid using background knowledge or impose overly strict structural constraints

Research Motivation

The core motivation of this paper is to develop a differential equation discovery algorithm that can:

Automatically extract and utilize background knowledge
Guide the search process while maintaining structural flexibility
Improve the stability and robustness of equation discovery

Core Contributions

Proposed a Knowledge-Aware Equation Discovery Framework: Developed an improved EPDE-based algorithm that leverages background knowledge by modifying probability distributions rather than imposing hard constraints
Designed an Automated Knowledge Extraction Mechanism: Automatically generates initial guesses based on an improved SymNet architecture and converts them into term importance distributions
Implemented Soft Knowledge Guidance: Modifies the probability distributions of crossover and mutation operators to guide the optimization process while maintaining the integrity of the search space
Validated Method Effectiveness: Experiments on Burgers equation, wave equation, and KdV equation demonstrate that the method outperforms SINDy in stability and robustness

Methodology Details

Task Definition

Given observational data $X = \{x^{(i)}\}_{i=1}^N$ on a discrete grid and corresponding observations $U = \{u^{(i)}\}_{i=1}^N$ , the goal is to discover a differential equation model describing the data:

$M(S, P, x) \rightarrow u(x) : M(S, P, x^{(i)}) \rightarrow u(x_i) \sim u^{(i)}$

where $S$ denotes structure and $P$ denotes parameters.

Model Architecture

1. Baseline EPDE Algorithm

The EPDE algorithm uses parameterized tokens as basic building blocks: $t = t(\pi_1, ..., \pi_n)$

Tokens combine to form terms: $T = t_1 \cdot ... \cdot t_{T_{length}}$ , with model form: $M(S, \{C,P\}) = \sum_{j=1}^{N_{terms}} C_j T_j$

2. Knowledge-Aware Improvements

The key innovation involves introducing term importance distributions to guide evolutionary operators:

Improved Crossover Operator: Selects terms participating in crossover according to term importance distribution rather than uniform selection.

Improved Mutation Operator:

Token replacement: Selects new tokens according to importance distribution
Term generation: Generates new terms using importance distribution

3. Automated Knowledge Extraction

Generates initial guesses using an improved SymNet architecture:

SymNet Modifications: Extends the original architecture to support arbitrary temporal derivative forms: $U_t = F(t, x, U, U_x, U_{xx}, U_{tt}, U_{ttt}, ...)$ $U_{tt} = F(t, x, U, U_x, U_t, U_{xx}, U_{ttt}, ...)$

Probability Distribution Computation:

Maps SymNet output to EPDE term space
Applies coefficient smoothing (mixing factor mf controls this)
Normalizes to obtain probability distribution

Technical Innovations

Soft Constraint Mechanism: Introduces background knowledge through probability distributions rather than hard constraints, preserving search space completeness
Adaptive Knowledge Extraction: Automatically extracts term importance from initial guesses without manual definition
Mixing Factor Adjustment: Balances the credibility of initial guesses through mixing factor to prevent over-reliance on inaccurate guesses

Experimental Setup

Datasets

Experiments use five classical partial differential equations:

Burgers Equation (Inviscid): $u_t + uu_x = 0$
Burgers Equation (Viscous): $u_t + uu_x - 0.1u_{xx} = 0$
Wave Equation: $u_{tt} - \frac{1}{25}u_{xx} = 0$
KdV Equation: $u_t + 6uu_x + u_{xxx} = 0$
Inhomogeneous KdV Equation: $u_t + 6uu_x + u_{xxx} = \cos t \sin x$

Evaluation Metrics

Mean Absolute Error (MAE): Computes error between discovered and true equation coefficients
Structural Hamming Distance (SHD): Measures difference between discovered and true equation structures
Success Rate: Proportion of successful equation discoveries among 50 runs
Convergence Time: Time required for algorithm to reach convergence

Comparison Methods

Classical EPDE Algorithm: Serves as baseline method
PySINDy Framework: Current mainstream differential equation discovery method
SymNet: Used to evaluate initial guess quality

Implementation Details

Each experiment runs 50 times for statistical results
Noise levels: 0%, 25%, 50%, 75%, 100% (relative to limit noise level)
Mixing factor: Default value 2.4, with optimization via KL divergence also tested

Experimental Results

Main Results

1. Comparison with SINDy

Experiments on multiple equations demonstrate:

Improved Stability: The improved algorithm shows more stable performance under high-noise conditions
Accuracy Advantages: Achieves lower MAE in most cases
Enhanced Robustness: Performance degrades more slowly as noise increases

2. Success Rate Improvement

According to results in Tables A.3 and A.4:

Complex Equations: Most significant success rate improvement for inhomogeneous KdV equation, reaching up to 72%
Simple Equations: Limited improvement for simple equations already achieving high success rates
Average Improvement: Average noise robustness improvement of 12.5%, ranging from 2% to 32%

3. Time Consumption

Classical EPDE: Approximately 5 seconds
Improved Algorithm: Approximately 15 seconds
PySINDy: Approximately 0.01 seconds

Ablation Studies

Mixing Factor Sensitivity Analysis

Tests impact of different mixing factors (2.4, 3.0, 3.6, 4.5):

Mixing factors optimized via KL divergence typically perform best
Appropriate mixing factor adjustment can provide additional 30% improvement in discovery rate

SymNet Initial Guess Quality

SymNet performance varies significantly across equations:

Simple Equations: Burgers equation MAE = 0.0058 ± 0.0008
Complex Equations: Inhomogeneous KdV equation MAE = 0.1497 ± 0.0214

Case Analysis

Using the wave equation as an example, the improved algorithm can discover second-order temporal derivative equations that PySINDy cannot handle, demonstrating the structural flexibility of the method.

Classification of Equation Discovery Methods

The paper categorizes existing methods into two types:

Type I (Gradient Optimization): Fixed structure, optimized parameters (e.g., SINDy, PDE-Net)
Type II (Genetic Programming): Simultaneous optimization of structure and parameters (e.g., EPDE, PySR)

Knowledge Integration Approaches

Syntactic Rules: Expert-defined syntactic constraints
Bayesian Methods: Knowledge integration based on prior distributions
Structural Constraints: Hard constraints from predefined term libraries

This paper's method represents an improvement to Type II through soft knowledge guidance via probability distributions.

Conclusions and Discussion

Main Conclusions

Soft Constraints are Effective: Introducing background knowledge through probability distributions proves more effective than hard constraints
Automated Knowledge Extraction is Feasible: The automated knowledge extraction mechanism based on SymNet improves search performance
Complex Equations Benefit More: The method shows more significant improvements for complex differential equations

Limitations

Computational Overhead: Computation time significantly increases compared to SINDy
Initial Guess Dependency: Method performance is affected by SymNet initial guess quality
Parameter Sensitivity: Key parameters such as mixing factor require careful tuning

Future Directions

Optimize Computational Efficiency: Reduce SymNet invocations and improve overall efficiency
Improve Initial Guesses: Develop more accurate initial equation guess methods
Expand Application Domains: Test method effectiveness on more equation types

In-Depth Evaluation

Strengths

Innovative Knowledge Integration Mechanism: Proposes a novel approach to leverage background knowledge through probability distribution modification rather than hard constraints
Complete Automated Pipeline: End-to-end automation from knowledge extraction to equation discovery
Comprehensive Experimental Validation: Thorough testing on multiple classical equations, including noise robustness analysis
Solid Theoretical Foundation: Explains method rationality from the perspective of probability measure geometry

Weaknesses

Computational Efficiency Issues: Significant computational overhead compared to existing methods, limiting practical applications
Method Complexity: Involves multiple components (SymNet, EPDE, probability distribution computation), increasing implementation difficulty
Parameter Tuning Requirements: Key parameters such as mixing factor require problem-specific adjustment
Limited Theoretical Analysis: Lacks theoretical guarantees for convergence and optimality

Impact

Academic Contribution: Provides a new paradigm for knowledge integration in differential equation discovery
Practical Value: Demonstrates advantages in handling complex, high-noise data
Reproducibility: Provides open-source code and detailed experimental settings

Applicable Scenarios

This method is particularly suitable for:

Complex differential equation discovery tasks
Equation recovery in high-noise environments
Applications requiring structural flexibility
Situations with partial prior knowledge but uncertain complete structure

References

The paper cites major works in the differential equation discovery field, including:

SINDy series methods 8, 10, 26, 28
PDE-Net series 12, 32
EPDE algorithm 14, 25, 30, 31
Symbolic regression methods 15, 29
Knowledge extraction related work 1-6, 16-24

Overall Assessment: This is a high-quality research paper that proposes an innovative knowledge-aware differential equation discovery method. While it has limitations in computational efficiency, it demonstrates excellent performance in methodological innovation, experimental completeness, and practical effectiveness, making a valuable contribution to the field's development.