2025-11-24T03:04:18.080955

Optimal Assignment and Motion Control in Two-Class Continuum Swarms

Emerick, Patterson, Bamieh

We consider optimal swarm control problems where two different classes of agents are present. Continuum idealizations of large-scale swarms are used where the dynamics describe the evolution of the spatially-distributed densities of each agent class. The problem formulation we adopt is motivated by applications where agents of one class are assigned to agents of the other class, which we refer to as demand and resource agents respectively. Assignments have costs related to the distances between mutually assigned agents, and the overall cost of an assignment is quantified by a Wasserstein distance between the densities of the two agent classes. When agents can move, the assignment cost can decrease at the expense of a physical motion cost, and this tradeoff sets up a nonlinear infinite-dimensional optimal control problem. We show that in one spatial dimension, this problem can be converted to an infinite-dimensional, but decoupled, linear-quadratic (LQ) tracking problem when expressed in terms of the quantile functions of the respective agent densities. Solutions are given in the general one-dimensional case, as well as in the special cases of constant and periodically time-varying demands.

academic

Optimal Assignment and Motion Control in Two-Class Continuum Swarms

基本信息

论文ID: 2407.18159
标题: Optimal Assignment and Motion Control in Two-Class Continuum Swarms
作者: Max Emerick, Stacy Patterson, Bassam Bamieh
分类: eess.SY (系统与控制), cs.SY (系统与控制), math.OC (最优化与控制)
发表时间/会议: 提交于2024年7月24日，修订于2025年10月10日
论文链接: https://arxiv.org/abs/2407.18159

摘要

本文研究包含两类不同智能体的最优群体控制问题。采用大规模群体的连续体理想化模型，其中动力学描述每类智能体空间分布密度的演化。问题建模受到一类智能体需要分配给另一类智能体的应用场景启发，分别称为需求智能体和资源智能体。分配成本与相互分配智能体之间的距离相关，总分配成本通过两类智能体密度之间的Wasserstein距离量化。当智能体可以移动时，分配成本可以降低，但需要付出物理运动成本，这种权衡建立了一个非线性无穷维最优控制问题。研究表明，在一维空间情况下，当用各智能体密度的分位函数表示时，该问题可以转换为无穷维但解耦的线性二次(LQ)跟踪问题。给出了一般一维情况以及常数和周期时变需求特殊情况的解。

研究背景与动机

问题背景

随着低成本传感、处理和通信硬件的发展，自主机器人群体在应急响应、运输、物流、数据收集和国防等多个领域得到广泛应用。大规模群体在效率和鲁棒性方面具有显著优势，但随着群体规模的增大，智能体间的运动规划和协调变得越来越困难。

应用场景

论文的数学模型部分受到边缘计算和移动云计算应用的启发：

需求智能体：轻量级设备（如配备摄像头的无人机），计算和存储能力有限，但机动性强
资源智能体：重型设备（如移动边缘计算服务器），具有强大的计算能力但机动性较差
典型应用：灾难救援中的视频监控，需求智能体负责数据采集，资源智能体负责数据处理

研究动机

规模挑战：传统离散智能体建模在大规模群体中计算复杂度过高
连续体优势：将群体建模为密度分布可显著降低模型复杂度并提供宏观行为洞察
分配与运动耦合：需要同时优化任务分配和物理运动，存在本质的权衡关系
理论空白：现有研究缺乏对此类耦合问题的系统性理论分析

核心贡献

新颖问题建模：首次将动态匹配和时空控制结合，建立了包含两类智能体的连续体群体最优控制模型
数学变换突破：发现在一维情况下，可通过分位函数变换将非线性无穷维问题转化为解耦的线性二次跟踪问题
解析解构造：为一般一维情况提供了显式解析解，这在此类问题中极为罕见
特殊情况深入分析：
- 静态需求：解遵循Wasserstein测地线但时间调度由最优控制问题确定
- 周期需求：解可表示为跟踪信号的滤波版本
理论洞察：揭示了最优解的几何结构和性能限制的本质

方法详解

任务定义

给定初始资源分布 $R_0$ 和时变需求分布 $D_t$ ，在时间区间 $[0,T]$ 上求解： $\min_{R,V} \int_0^T \left( W_2^2(R_t, D_t) + \alpha^2 \int_\Omega \|V_t(x)\|_2^2 R_t(x) dx \right) dt$ 约束条件： $\partial_t R_t(x) = -\nabla \cdot (R_t(x)V_t(x))$

其中：

$W_2^2(R_t, D_t)$ ：2-Wasserstein距离的平方，量化分配成本
$V_t(x)$ ：速度场（控制变量）
$\alpha > 0$ ：权衡参数

模型架构

1. 五个核心组件

需求分布 $D_t(x)$ ：包含连续和离散部分
资源分布 $R_t(x)$ ：同样包含连续和离散部分
分配计划 $K_t(x,y)$ ：二维分布，满足边际化约束
资源动力学：连续性偏微分方程
性能目标：分配成本与运动成本的权衡

2. 关键数学变换

分位函数变换：对于一维密度 $\mu$ ，定义

累积分布函数： $F_\mu(x) = \int_{-\infty}^x \mu(\xi) d\xi$
分位函数： $Q_\mu(z) = \inf\{x : F_\mu(x) \geq z\}$

核心引理：一维情况下，2-Wasserstein距离可表示为 $W_2^2(\mu, \nu) = \int_0^1 (Q_\nu(z) - Q_\mu(z))^2 dz$

3. 动力学变换

原始双线性动力学： $\partial_t R(x,t) = -\partial_x(V(x,t)R(x,t))$

等价的分位函数动力学： $\partial_t Q_R(z,t) = U(z,t)$ 其中 $U(z,t) = V(Q_R(z,t), t)$

技术创新点

1. 分位函数空间的等距性

发现 $L^2$ 分位函数空间与2-Wasserstein密度空间之间存在等距映射，这使得复杂的最优传输问题在分位函数空间中变为简单的 $L^2$ 问题。

2. 无穷维问题的解耦

通过水平集分割技术，将无穷维LQ跟踪问题分解为无穷个独立的标量LQ跟踪问题： $\min_{r_i,u_i} \int_0^T \left( (r_i(t) - d_i(t))^2 + \alpha^2 u_i^2(t) \right) dt$ 约束： $\dot{r}_i(t) = u_i(t)$

3. 显式解构造

标量问题的最优控制具有反馈-前馈结构： $u_i(t) = -\frac{1}{\alpha^2}(p(t)r_i(t) + y_i(t))$

其中：

反馈增益： $p(t) = \alpha \tanh((T-t)/\alpha)$
前馈项： $y_i(t) = \int_t^T \phi_y(t,\tau) d_i(\tau) d\tau$

实验设置

数值验证场景

论文主要通过理论分析和数值示例验证方法有效性，而非大规模实验评估。

静态需求案例

资源分布：11个不等质量的离散智能体
需求分布：连续静态分布
参数设置： $\alpha = 2$ , $T = 10$

周期需求案例

需求函数：高斯混合模型 $D(x,t) = (1 + \sin(2\pi t))\mathcal{N}(2.5, 1) + (1 - \sin(2\pi t))\mathcal{N}(7.5, 1)$
参数变化： $\alpha \in \{0.08, 1, >1\}$

评价指标

最优成本函数值
轨迹收敛性：资源分布向需求分布的逼近程度
几何特性：验证解是否遵循Wasserstein测地线

实验结果

主要结果

静态需求情况

几何结构：最优轨迹在分位函数空间中为直线，对应密度空间中的Wasserstein测地线
时间调度：不同于经典动态最优传输的恒定速率，这里的速率由 $\phi_r(t,0)$ 确定
成本分解： $J = W_2^2(R_0, \bar{D}) \alpha \tanh(T/\alpha) + T W_2^2(D, \bar{D})$