2025-11-10T02:49:06.456849

Dynamics-aware Diffusion Models for Planning and Control

Gadginmath, Pasqualetti

This paper addresses the problem of generating dynamically admissible trajectories for control tasks using diffusion models, particularly in scenarios where the environment is complex and system dynamics are crucial for practical application. We propose a novel framework that integrates system dynamics directly into the diffusion model's denoising process through a sequential prediction and projection mechanism. This mechanism, aligned with the diffusion model's noising schedule, ensures generated trajectories are both consistent with expert demonstrations and adhere to underlying physical constraints. Notably, our approach can generate maximum likelihood trajectories and accurately recover trajectories generated by linear feedback controllers, even when explicit dynamics knowledge is unavailable. We validate the effectiveness of our method through experiments on standard control tasks and a complex non-convex optimal control problem involving waypoint tracking and collision avoidance, demonstrating its potential for efficient trajectory generation in practical applications. Our code repository is available at www.github.com/darshangm/dynamics-aware-diffusion.

academic

Dynamics-aware Diffusion Models for Planning and Control

Basic Information

Paper ID: 2504.00236
Title: Dynamics-aware Diffusion Models for Planning and Control
Authors: Darshan Gadginmath, Fabio Pasqualetti (University of California Riverside)
Categories: cs.RO (Robotics), math.OC (Optimization and Control)
Publication Date: April 2024 (arXiv v3: October 14, 2025)
Paper Link: https://arxiv.org/abs/2504.00236

Abstract

This paper addresses the challenge of generating dynamically feasible trajectories using diffusion models in complex environments, particularly in scenarios where system dynamics are critical for practical applications. The paper proposes a novel framework that directly integrates system dynamics into the denoising process of diffusion models through sequential prediction and projection mechanisms. This mechanism aligns with the diffusion model's noise schedule, ensuring that generated trajectories are both consistent with expert demonstrations and comply with underlying physical constraints. The method can generate maximum likelihood trajectories and accurately recover trajectories generated by linear feedback controllers, even when explicit dynamics knowledge is unavailable.

Research Background and Motivation

Problem Definition

Core Issue: Traditional diffusion models lack explicit dynamics-awareness in trajectory generation, and generated trajectories frequently violate physical constraints specific to particular systems
Practical Challenges: In robotics, datasets typically contain demonstration data from different robots with varying dynamics characteristics, hindering the model's generalization ability to individual robot behaviors
Safety Considerations: In safety-critical applications, constraint violations can lead to system failures, requiring computationally expensive post-processing or real-time corrections

Research Motivation

While existing diffusion models excel at learning complex data distributions, they suffer from dynamics consistency issues in control and robotics applications
There is a need for methods that ensure physical feasibility of generated trajectories while maintaining the generative capabilities of diffusion models
The approach should be applicable to both known and unknown system dynamics scenarios

Core Contributions

Dynamics-aware Denoising Mechanism: Proposes a novel algorithm that integrates system dynamics into diffusion models by incorporating projection steps during the denoising process
Applicability to Known and Unknown Systems: Demonstrates the method's effectiveness in solving complex control problems under both known and unknown system dynamics scenarios
Theoretical Guarantees: Provides theoretical proof that the method can recover trajectories generated by linear feedback controllers and generate maximum likelihood trajectories
Computational Efficiency: Implements projection through simple matrix multiplication, avoiding nonlinear optimization problems in each denoising iteration

Methodology Details

Task Definition

Consider a stochastic discrete-time LTI system:

x(t+1) = Ax(t) + Bu(t) + w(t)

where x(t) ∈ ℝⁿ is the state vector, u(t) ∈ ℝᵐ is the control input, and w(t) ∈ ℝⁿ is zero-mean process noise.

The objective is to solve the control problem:

max R(x(0:T), u(0:T-1), E)
s.t. x(t+1) = Ax(t) + Bu(t) + w(t)
     x(0) = x_init

Model Architecture

1. Known Dynamics Case (Algorithm 1)

For linear systems, the state-control trajectory can be expressed as:

τ = F[x(0); u(0:T-1)] + F_w w(0:T-1)

where F is the system matrix composed of the free response matrix A and forced response matrix C_T.

Core Algorithm Flow:

Prediction Step: τ̂_ = μ_θ(τ'_i, i, x_init, E) + √β_i ε_i
Projection Step: τ' = (√(1-β)FF† + √β_I)τ̂_

2. Unknown Dynamics Case (Algorithm 2)

Based on Willems' Fundamental Lemma, using Hankel matrix construction:

τ = [H_{T+1}(x); H_T(u)]g

where H_{T+1}(x) and H_T(u) are Hankel matrices constructed from long-term experimental data.

Technical Innovations

Sequential Projection Mechanism: Unlike existing methods that solve nonlinear optimization at each iteration, this paper implements projection through simple matrix multiplication
Noise Schedule Alignment: Projection strength aligns with the diffusion model's noise schedule β_i, ensuring gradual convergence to the feasible trajectory space
Data-driven Extension: Handles unknown systems through Hankel matrices without requiring explicit system identification

Experimental Setup

Datasets

LQR Experiments:
- 4-dimensional double integrator system
- 10,000 synthetic trajectories of length T=30
- Initial states sampled from U-1,1⁴, target states from U-4,4⁴
Waypoint Tracking and Obstacle Avoidance:
- Non-convex optimal control problems
- 10,000 different environmental conditions
- Contains V waypoints and O circular obstacles

Evaluation Metrics

State Error: ∥x(t) - x_LQR(t)∥₂
Control Error: ∥u(t) - u_LQR(t)∥₂
Trajectory deviation from numerical optimal solutions

Baseline Methods

Vanilla Diffusion: Standard diffusion model without dynamics-awareness
Algorithm 1: Proposed method with known dynamics
Algorithm 2: Proposed method with unknown dynamics

Implementation Details

Neural Network: Encoder-decoder architecture with 3 convolutional layers, 256 hidden units
Training: Adam optimizer, 30,000 epochs, batch size 64
Diffusion Settings: Linear noise schedule β_i = 0.001i, L=1000 steps

Experimental Results

Main Results

LQR Task Performance

State Error: Proposed method (known/unknown dynamics) significantly outperforms vanilla diffusion
Control Error: Maintains low error throughout the control horizon
Numerical Comparison: Average error reduction of approximately 60-70%

Waypoint Tracking and Obstacle Avoidance

Trajectory Quality: Generates smooth, physically feasible trajectories
Constraint Satisfaction: Successfully avoids obstacles and passes through specified waypoints
Error Analysis: Significant error reduction at waypoints t=5 and t=33

Key Findings

Importance of Dynamics Constraints: Methods without dynamics-awareness produce infeasible trajectories
Known vs. Unknown Dynamics: Known dynamics method slightly outperforms, but the gap is minimal
Convergence: Sequential projection ensures trajectories gradually converge to the feasible space

Physics-aware Diffusion Models

Existing methods primarily integrate physical constraints through penalty terms in loss functions
Limitation: Cannot guarantee strict adherence to system dynamics

Diffusion Models in Motion Planning

Methods like Diffuser lack explicit dynamics integration
Typically require additional controllers for correction

Diffusion Models in Control

Most relevant works include methods projecting to feasible sets
Advantages of this work: No complete dynamics knowledge required, higher computational efficiency

Conclusions and Discussion

Main Conclusions

Successfully integrates system dynamics into the denoising process of diffusion models
Generates feasible trajectories under both known and unknown dynamics scenarios
Provides theoretical guarantees for recovering trajectories from linear feedback controllers

Limitations

Current framework primarily targets linear systems
Nonlinear systems require additional linearization transformations
Highly nonlinear systems may require more complex projection mechanisms

Future Directions

Extension to nonlinear systems
Exploration of more efficient projection mechanisms
Investigation of accelerated sampling processes for real-time control

In-depth Evaluation

Strengths

Theoretical Rigor: Provides comprehensive theoretical analysis including proofs of Lemma 1 and Theorem 2
Methodological Innovation: The sequential projection mechanism is cleverly designed, avoiding computationally expensive nonlinear optimization
Strong Practicality: Addresses both known and unknown dynamics scenarios with broad applicability
Comprehensive Experiments: Full validation from simple LQR to complex non-convex problems

Weaknesses

System Limitations: Primarily targets linear systems; nonlinear extensions require further research
Computational Complexity: While avoiding nonlinear optimization, each step still requires matrix operations
Noise Assumptions: Zero-mean process noise assumption may not hold in practical applications

Impact

Academic Contribution: Provides new perspectives for physics-aware generative models
Practical Value: Direct application potential in robot trajectory planning and control
Reproducibility: Provides complete algorithm descriptions and code repository

Applicable Scenarios

Robot trajectory planning and control
Autonomous vehicle path generation
Unmanned aerial vehicle navigation
Trajectory optimization in industrial automation

References

The paper cites important works in diffusion models, physics-aware AI, and data-driven control, particularly:

Willems' Fundamental Lemma (theoretical foundation for data-driven control)
Denoising Diffusion Probabilistic Models (DDPM foundational theory)
Related constraint-aware generative model works

Overall Assessment: This is a high-quality research paper that successfully integrates physical constraints into diffusion models, providing valuable contributions to the control and robotics fields. The method demonstrates strong innovation, rigorous theoretical analysis, comprehensive experimental validation, and good practical value with significant academic impact.