2025-11-17T20:34:11.907682

Scalable Multi-modal Model Predictive Control via Duality-based Interaction Predictions

Kim, Nair, Borrelli

We propose a hierarchical architecture designed for scalable real-time Model Predictive Control (MPC) in complex, multi-modal traffic scenarios. This architecture comprises two key components: 1) RAID-Net, a novel attention-based Recurrent Neural Network that predicts relevant interactions along the MPC prediction horizon between the autonomous vehicle and the surrounding vehicles using Lagrangian duality, and 2) a reduced Stochastic MPC problem that eliminates irrelevant collision avoidance constraints, enhancing computational efficiency. Our approach is demonstrated in a simulated traffic intersection with interactive surrounding vehicles, showcasing a 12x speed-up in solving the motion planning problem. A video demonstrating the proposed architecture in multiple complex traffic scenarios can be found here: https://youtu.be/-pRiOnPb9_c. GitHub: https://github.com/MPC-Berkeley/hmpc_raidnet

academic

Basic Information

Paper ID: 2402.01116
Title: Scalable Multi-modal Model Predictive Control via Duality-based Interaction Predictions
Authors: Hansung Kim, Siddharth H. Nair, Francesco Borrelli
Classification: cs.RO cs.LG cs.SY eess.SY
Publication Date/Venue: arXiv preprint (submitted February 2024, last updated March 2025)
Paper Link: https://arxiv.org/abs/2402.01116

Abstract

This paper proposes a hierarchical architecture for achieving scalable real-time model predictive control (MPC) in complex multi-modal traffic scenarios. The architecture comprises two key components: 1) RAID-Net, an attention-based recurrent neural network that uses Lagrangian duality to predict relevant interactions between autonomous vehicles and surrounding vehicles within the MPC prediction horizon; 2) a simplified stochastic MPC formulation that enhances computational efficiency by eliminating irrelevant collision avoidance constraints. The method is validated in simulated traffic intersection environments, achieving a 12-fold acceleration in motion planning problem solving.

Research Background and Motivation

Core Problem

In complex urban driving scenarios, autonomous vehicles must navigate safely in highly uncertain environments while observing and responding to the behavior of heterogeneous traffic participants (human-driven and autonomous vehicles). These traffic participants exhibit multi-modal prediction characteristics, and considering these factors in motion planning presents significant challenges.

Problem Significance

Existing motion planning methods face dual challenges of scalability and real-time performance:

Hierarchical prediction and planning approaches: While capable of handling multi-modal predictions, they lack real-time scalability in complex scenarios
Model-based integrated planning methods: Game-theoretic approaches suffer from excessive computational complexity in multi-vehicle scenarios
End-to-end learning approaches: While scalable, they lack interpretability and safety guarantees

Limitations of Existing Methods

The number of constraints in traditional MPC grows exponentially with vehicle count and modality count (O(NM^V))
In complex traffic scenarios, most collision avoidance constraints are actually inactive
Lack of effective constraint screening mechanisms to identify truly relevant vehicle interactions

Core Contributions

Proposed RAID-Net architecture: An attention-based recurrent neural network capable of predicting relevant interactions between autonomous vehicles and surrounding vehicles within the MPC prediction horizon
Established duality-based interaction prediction theory: Utilizing Lagrangian duality and sensitivity analysis to identify active constraints
Designed hierarchical MPC framework: Significantly reduces computational complexity through constraint screening, achieving 12-fold solving acceleration
Constructed traffic intersection simulation environment: For training and evaluating the proposed algorithm

Methodology Details

Task Definition

Input: Current environmental observation $ob_t$ , including ego vehicle state, surrounding vehicle states, and semantic information Output: Control input $u_t$ to safely guide the ego vehicle to the target location Constraints: State-input constraints and multi-modal collision avoidance constraints

Model Architecture

1. Stochastic MPC Problem Formulation

Considering V target vehicles, each with M modalities, yielding M^V scenario configurations. The optimization problem is:

$\min_{\theta_t} \sum_{m=1}^{M^V} \mathbb{E}\left[\sum_{k=t}^{t+N-1} \|Q(x_{k+1|t,m}-x^{ref}_k)\|_2^2 + \|R(u_{k|t,m}-u^{ref}_k)\|_2^2\right]$

Subject to:

System dynamics constraints
State-input constraints: $P((x_{k|t,m}, u_{k|t,m}) \notin XU_k) \leq \epsilon$
Collision avoidance constraints: $P((x_{k|t,m}, o^i_{k|t,\bar{j}(i,m)}) \notin CA^i_{k|t,\bar{j}(i,m)}) \leq \epsilon$

2. Duality Theory Analysis

Converting the MPC problem into second-order cone programming (SOCP): $\min_{\theta_t} \frac{1}{2}\|Q_t\theta_t\|_2^2 + C_t^T\theta_t \quad \text{s.t.} \quad A_t\theta_t + R_t \in K$

The corresponding dual problem is: $\min_{\mu_t,\eta_t} [\mu_t^T \eta_t^T]R_t + \frac{1}{2}\|Q_t^{-1}(A_t^T[\mu_t^T \eta_t^T]^T - C_t)\|_2^2$

Through KKT conditions, if $[\mu_t^*]_s = 0$ , the corresponding constraint can be eliminated.

3. RAID-Net Architecture Design

Input Encoding:

Time-to-collision (TTC) based ego-centric graph encoding
Transformer encoder generates scenario representation feature vector $f_i \in \mathbb{R}^{d_{em}}$

Network Structure:

N decoder sequences with shared parameters
Multi-head attention mechanism captures different types of vehicle interactions
Gated recurrent units (GRU) process temporal dependencies
MLP layers learn complex interaction patterns

Output: Binary classification prediction of dual variables $\tilde{\mu}_t \in \{0,1\}^{n_c}$

Technical Innovations

Duality-guided constraint screening: First application of Lagrangian duality theory to MPC constraint screening
Attention-based interaction modeling: Captures complex interaction patterns between vehicles through multi-head attention
Temporal independence of recursive architecture: Achieves independence from prediction horizon length through parameter sharing
Safety guarantees via sensitivity analysis: Quantifies constraint violation impact on cost through shadow price theory

Experimental Setup

Dataset

Simulation Environment: Custom unsignalized traffic intersection environment
Vehicle Configuration: 1 ego vehicle + 1-3 target vehicles
Modality Settings: Total of 16 modality configurations, resulting in 624 collision avoidance constraints
Data Scale: 120,315 data points, 85% training set, 15% test set

Evaluation Metrics

Feasibility: Percentage of MPC problems with feasible solutions
Collision Rate: Percentage of time steps with collisions against target vehicles
Constraint Ratio: Average percentage of enforced constraints
Solving Time: Average MPC problem solving time
Task Completion Time: Normalized time to reach target location

Comparison Methods

Full MPC: Complete MPC problem with all constraints
MLP Baseline: Comparison method using multi-layer perceptron

Implementation Details

Prediction Horizon: N = 14, sampling time Δt = 0.2s
Network Parameters: Learning rate 0.001, batch size 1024, 3000 training epochs
Loss Weights: wp = 4 (biased toward positive class prediction for improved safety)
Solver: Gurobi for SOCP problem solving

Experimental Results

Main Results

Performance Metric	Full MPC	HMPC
Feasibility (%)	98.97	99.79
Collision Rate (%)	0	4.0
Average Constraint Enforcement Rate (%)	100	17.45
Average Solving Time (s)	0.92 ± 0.18	0.063 ± 0.073
RAID-Net Query Time (s)	-	0.013 ± 0.003
Total Computation Time (s)	0.92 ± 0.18	0.076 ± 0.076
Normalized Task Completion Time	1	0.91

Key Findings

12-fold Acceleration: HMPC algorithm achieves 12-fold speedup in solving time
High Recall Rate: RAID-Net achieves 94% recall on test set, correctly predicting 98.1% of interaction dual variables
Conservative Prediction: 17.45% constraint enforcement rate compared to 1.52% actual active constraint rate reflects safety-oriented conservative strategy
Minor Safety Trade-off: 4% collision rate primarily caused by false negative classifications

Ablation Studies

RAID-Net demonstrates superior performance compared to standard MLP networks in loss distribution, validating the effectiveness of attention mechanisms and recursive structure.

Main Research Directions

Hierarchical Prediction and Planning: Methods like Trajectron++ focusing on complex prediction models
Model-based Integrated Methods: Game-theoretic and joint optimization approaches
End-to-end Learning Methods: Deep learning approaches such as Social Attention

Advantages of This Work

Compared to existing methods, this work achieves significant computational efficiency improvements while maintaining safety and provides interpretable interaction prediction mechanisms.

Conclusions and Discussion

Main Conclusions

Proposed the first duality-theory-based MPC constraint screening framework
RAID-Net effectively predicts vehicle interactions, achieving real-time performance requirements
Achieves 12-fold computational acceleration in complex traffic scenarios

Limitations

Insufficient Safety Guarantees: Lacks theoretical safety guarantees for screened constraints
Policy Over-parameterization: Feedback policy in MPC may suffer from over-parameterization
Limited Generalization: RAID-Net's generalization capability to different intersection topologies requires further verification
Collision Risk: 4% collision rate requires further improvement

Future Directions

Utilize DAgger algorithm to address distribution shift in behavioral cloning
Test RAID-Net generalization performance on real traffic datasets
Apply duality-based interaction prediction to multi-vehicle coordinated path planning
Provide stronger theoretical safety guarantees

In-Depth Evaluation

Strengths

Theoretical Innovation: Innovative application of Lagrangian duality theory to MPC constraint screening
Practical Value: Significant computational acceleration enables real-time MPC in complex scenarios
Architecture Design: Well-designed RAID-Net with attention mechanisms and recursive structure
Comprehensive Experiments: Thorough performance evaluation in simulation environments

Weaknesses

Safety Trade-off: 4% collision rate may be unacceptable for safety-critical applications
Simulation Limitations: Validation only in simplified traffic intersection scenarios, lacking complex urban environment testing
Insufficient Theoretical Analysis: Theoretical analysis and bounds on constraint screening errors need strengthening
Missing Real-world Validation: Lacks verification on actual vehicle platforms

Impact

This work provides novel solutions to real-time MPC problems in autonomous driving, and the application of duality theory may inspire efficient solving methods for other optimization problems.

Applicable Scenarios

Autonomous driving decision-making at urban traffic intersections
Multi-robot coordination control
Other multi-agent systems requiring real-time optimization

References

The paper cites key works from multiple important domains, including:

Trajectron++: Dynamically feasible trajectory prediction
Stochastic MPC theoretical foundations
Attention mechanisms and Transformer architectures
Convex optimization and duality theory

This paper makes significant contributions in both theoretical innovation and practical value, providing effective solutions to real-time MPC problems in autonomous driving, though further improvements are needed in safety guarantees and practical deployment.