We propose a hierarchical architecture designed for scalable real-time Model Predictive Control (MPC) in complex, multi-modal traffic scenarios. This architecture comprises two key components: 1) RAID-Net, a novel attention-based Recurrent Neural Network that predicts relevant interactions along the MPC prediction horizon between the autonomous vehicle and the surrounding vehicles using Lagrangian duality, and 2) a reduced Stochastic MPC problem that eliminates irrelevant collision avoidance constraints, enhancing computational efficiency. Our approach is demonstrated in a simulated traffic intersection with interactive surrounding vehicles, showcasing a 12x speed-up in solving the motion planning problem. A video demonstrating the proposed architecture in multiple complex traffic scenarios can be found here: https://youtu.be/-pRiOnPb9_c. GitHub: https://github.com/MPC-Berkeley/hmpc_raidnet
- Paper ID: 2402.01116
- Title: Scalable Multi-modal Model Predictive Control via Duality-based Interaction Predictions
- Authors: Hansung Kim, Siddharth H. Nair, Francesco Borrelli
- Classification: cs.RO cs.LG cs.SY eess.SY
- Publication Date/Venue: arXiv preprint (submitted February 2024, last updated March 2025)
- Paper Link: https://arxiv.org/abs/2402.01116
This paper proposes a hierarchical architecture for achieving scalable real-time model predictive control (MPC) in complex multi-modal traffic scenarios. The architecture comprises two key components: 1) RAID-Net, an attention-based recurrent neural network that uses Lagrangian duality to predict relevant interactions between autonomous vehicles and surrounding vehicles within the MPC prediction horizon; 2) a simplified stochastic MPC formulation that enhances computational efficiency by eliminating irrelevant collision avoidance constraints. The method is validated in simulated traffic intersection environments, achieving a 12-fold acceleration in motion planning problem solving.
In complex urban driving scenarios, autonomous vehicles must navigate safely in highly uncertain environments while observing and responding to the behavior of heterogeneous traffic participants (human-driven and autonomous vehicles). These traffic participants exhibit multi-modal prediction characteristics, and considering these factors in motion planning presents significant challenges.
Existing motion planning methods face dual challenges of scalability and real-time performance:
- Hierarchical prediction and planning approaches: While capable of handling multi-modal predictions, they lack real-time scalability in complex scenarios
- Model-based integrated planning methods: Game-theoretic approaches suffer from excessive computational complexity in multi-vehicle scenarios
- End-to-end learning approaches: While scalable, they lack interpretability and safety guarantees
- The number of constraints in traditional MPC grows exponentially with vehicle count and modality count (O(NM^V))
- In complex traffic scenarios, most collision avoidance constraints are actually inactive
- Lack of effective constraint screening mechanisms to identify truly relevant vehicle interactions
- Proposed RAID-Net architecture: An attention-based recurrent neural network capable of predicting relevant interactions between autonomous vehicles and surrounding vehicles within the MPC prediction horizon
- Established duality-based interaction prediction theory: Utilizing Lagrangian duality and sensitivity analysis to identify active constraints
- Designed hierarchical MPC framework: Significantly reduces computational complexity through constraint screening, achieving 12-fold solving acceleration
- Constructed traffic intersection simulation environment: For training and evaluating the proposed algorithm
Input: Current environmental observation obt, including ego vehicle state, surrounding vehicle states, and semantic information
Output: Control input ut to safely guide the ego vehicle to the target location
Constraints: State-input constraints and multi-modal collision avoidance constraints
Considering V target vehicles, each with M modalities, yielding M^V scenario configurations. The optimization problem is:
minθt∑m=1MVE[∑k=tt+N−1∥Q(xk+1∣t,m−xkref)∥22+∥R(uk∣t,m−ukref)∥22]
Subject to:
- System dynamics constraints
- State-input constraints: P((xk∣t,m,uk∣t,m)∈/XUk)≤ϵ
- Collision avoidance constraints: P((xk∣t,m,ok∣t,jˉ(i,m)i)∈/CAk∣t,jˉ(i,m)i)≤ϵ
Converting the MPC problem into second-order cone programming (SOCP):
minθt21∥Qtθt∥22+CtTθts.t.Atθt+Rt∈K
The corresponding dual problem is:
minμt,ηt[μtTηtT]Rt+21∥Qt−1(AtT[μtTηtT]T−Ct)∥22
Through KKT conditions, if [μt∗]s=0, the corresponding constraint can be eliminated.
Input Encoding:
- Time-to-collision (TTC) based ego-centric graph encoding
- Transformer encoder generates scenario representation feature vector fi∈Rdem
Network Structure:
- N decoder sequences with shared parameters
- Multi-head attention mechanism captures different types of vehicle interactions
- Gated recurrent units (GRU) process temporal dependencies
- MLP layers learn complex interaction patterns
Output: Binary classification prediction of dual variables μ~t∈{0,1}nc
- Duality-guided constraint screening: First application of Lagrangian duality theory to MPC constraint screening
- Attention-based interaction modeling: Captures complex interaction patterns between vehicles through multi-head attention
- Temporal independence of recursive architecture: Achieves independence from prediction horizon length through parameter sharing
- Safety guarantees via sensitivity analysis: Quantifies constraint violation impact on cost through shadow price theory
- Simulation Environment: Custom unsignalized traffic intersection environment
- Vehicle Configuration: 1 ego vehicle + 1-3 target vehicles
- Modality Settings: Total of 16 modality configurations, resulting in 624 collision avoidance constraints
- Data Scale: 120,315 data points, 85% training set, 15% test set
- Feasibility: Percentage of MPC problems with feasible solutions
- Collision Rate: Percentage of time steps with collisions against target vehicles
- Constraint Ratio: Average percentage of enforced constraints
- Solving Time: Average MPC problem solving time
- Task Completion Time: Normalized time to reach target location
- Full MPC: Complete MPC problem with all constraints
- MLP Baseline: Comparison method using multi-layer perceptron
- Prediction Horizon: N = 14, sampling time Δt = 0.2s
- Network Parameters: Learning rate 0.001, batch size 1024, 3000 training epochs
- Loss Weights: wp = 4 (biased toward positive class prediction for improved safety)
- Solver: Gurobi for SOCP problem solving
| Performance Metric | Full MPC | HMPC |
|---|
| Feasibility (%) | 98.97 | 99.79 |
| Collision Rate (%) | 0 | 4.0 |
| Average Constraint Enforcement Rate (%) | 100 | 17.45 |
| Average Solving Time (s) | 0.92 ± 0.18 | 0.063 ± 0.073 |
| RAID-Net Query Time (s) | - | 0.013 ± 0.003 |
| Total Computation Time (s) | 0.92 ± 0.18 | 0.076 ± 0.076 |
| Normalized Task Completion Time | 1 | 0.91 |
- 12-fold Acceleration: HMPC algorithm achieves 12-fold speedup in solving time
- High Recall Rate: RAID-Net achieves 94% recall on test set, correctly predicting 98.1% of interaction dual variables
- Conservative Prediction: 17.45% constraint enforcement rate compared to 1.52% actual active constraint rate reflects safety-oriented conservative strategy
- Minor Safety Trade-off: 4% collision rate primarily caused by false negative classifications
RAID-Net demonstrates superior performance compared to standard MLP networks in loss distribution, validating the effectiveness of attention mechanisms and recursive structure.
- Hierarchical Prediction and Planning: Methods like Trajectron++ focusing on complex prediction models
- Model-based Integrated Methods: Game-theoretic and joint optimization approaches
- End-to-end Learning Methods: Deep learning approaches such as Social Attention
Compared to existing methods, this work achieves significant computational efficiency improvements while maintaining safety and provides interpretable interaction prediction mechanisms.
- Proposed the first duality-theory-based MPC constraint screening framework
- RAID-Net effectively predicts vehicle interactions, achieving real-time performance requirements
- Achieves 12-fold computational acceleration in complex traffic scenarios
- Insufficient Safety Guarantees: Lacks theoretical safety guarantees for screened constraints
- Policy Over-parameterization: Feedback policy in MPC may suffer from over-parameterization
- Limited Generalization: RAID-Net's generalization capability to different intersection topologies requires further verification
- Collision Risk: 4% collision rate requires further improvement
- Utilize DAgger algorithm to address distribution shift in behavioral cloning
- Test RAID-Net generalization performance on real traffic datasets
- Apply duality-based interaction prediction to multi-vehicle coordinated path planning
- Provide stronger theoretical safety guarantees
- Theoretical Innovation: Innovative application of Lagrangian duality theory to MPC constraint screening
- Practical Value: Significant computational acceleration enables real-time MPC in complex scenarios
- Architecture Design: Well-designed RAID-Net with attention mechanisms and recursive structure
- Comprehensive Experiments: Thorough performance evaluation in simulation environments
- Safety Trade-off: 4% collision rate may be unacceptable for safety-critical applications
- Simulation Limitations: Validation only in simplified traffic intersection scenarios, lacking complex urban environment testing
- Insufficient Theoretical Analysis: Theoretical analysis and bounds on constraint screening errors need strengthening
- Missing Real-world Validation: Lacks verification on actual vehicle platforms
This work provides novel solutions to real-time MPC problems in autonomous driving, and the application of duality theory may inspire efficient solving methods for other optimization problems.
- Autonomous driving decision-making at urban traffic intersections
- Multi-robot coordination control
- Other multi-agent systems requiring real-time optimization
The paper cites key works from multiple important domains, including:
- Trajectron++: Dynamically feasible trajectory prediction
- Stochastic MPC theoretical foundations
- Attention mechanisms and Transformer architectures
- Convex optimization and duality theory
This paper makes significant contributions in both theoretical innovation and practical value, providing effective solutions to real-time MPC problems in autonomous driving, though further improvements are needed in safety guarantees and practical deployment.