Distributed Thompson sampling under constrained communication
Zerefa, Ren, Ma et al.
In Bayesian optimization, a black-box function is maximized via the use of a surrogate model. We apply distributed Thompson sampling, using a Gaussian process as a surrogate model, to approach the multi-agent Bayesian optimization problem. In our distributed Thompson sampling implementation, each agent receives sampled points from neighbors, where the communication network is encoded in a graph; each agent utilizes their own Gaussian process to model the objective function. We demonstrate theoretical bounds on Bayesian average regret and Bayesian simple regret, where the bound depends on the structure of the communication graph. Unlike in batch Bayesian optimization, this bound is applicable in cases where the communication graph amongst agents is constrained. When compared to sequential single-agent Thompson sampling, our bound guarantees faster convergence with respect to time as long as the communication graph is connected. We confirm the efficacy of our algorithm with numerical simulations on traditional optimization test functions, demonstrating the significance of graph connectivity on improving regret convergence.
academic
Distributed Thompson sampling under constrained communication
This paper investigates multi-agent Bayesian optimization under communication constraints. The authors propose a distributed Thompson sampling algorithm using Gaussian processes as surrogate models. In this implementation, each agent receives sampling points from neighbors, with the communication network encoded as a graph structure; each agent utilizes its own Gaussian process to model the objective function. The paper establishes theoretical bounds on Bayesian average regret and Bayesian simple regret, which depend on the structure of the communication graph. Unlike batch Bayesian optimization, these bounds apply to scenarios with restricted communication graphs between agents. Compared to sequential single-agent Thompson sampling, the algorithm guarantees faster convergence rates whenever the communication graph is connected.
The core problem addressed in this paper is black-box function optimization in multi-agent systems with communication constraints. Specifically:
Black-box Stochastic Optimization Challenge: Finding the maximum of a function when the objective function is unknown and can only be accessed through noisy evaluations
Multi-agent Collaboration Requirements: Multiple agents can sample the objective function in parallel, but their mutual communication may be restricted
Communication Constraint Realism: In practical applications (e.g., multi-robot source seeking, sensor networks), agents may not have access to information from all other agents
Centralized Approaches Inapplicable: Require a central coordinator to manage all agent data, impractical in distributed scenarios
Batch Bayesian Optimization Assumptions Too Strong: Assumes all agents have access to identical information, inconsistent with communication-constrained reality
Existing Theoretical Guarantees Demanding: Prior literature providing theoretical guarantees for distributed Bayesian optimization requires fully connected communication graphs
The authors' starting point is to develop a distributed Bayesian optimization algorithm that works under arbitrary communication graph structures and provides corresponding theoretical guarantees.
For a compact set X⊂Rd, consider an unknown continuous function f:X→R, with the goal of finding its maximum. There are M agents, each capable of querying f and receiving noisy observations y=f(x)+ϵ, where ϵ∼N(0,σϵ2).
The communication network is described by a graph G=(V,E), where ∣V∣=M, and an edge (i,j)∈E indicates that agents i and j can communicate. The data accessible to agent i at time t is Dt,i={(xτ,j,yτ,j)}j∈N(i)∪{i},τ<t.
1. Set GP prior on f
2. Initialization: For i=1,...,M, set initial data D_{1,i} and GP_{0,i}
3. For t=1,...,T:
For i=1,...,M:
a) Update posterior GP_{t,i} based on D_{t,i}
b) Sample function realization from GP_{t,i}: f̂_{t,i} ~ GP_{t,i}
c) Select query point: x_{t,i} = argmax_x f̂_{t,i}(x)
d) Observe y_{t,i}
e) Broadcast (x_{t,i}, y_{t,i}) to neighbors
f) Collect evaluations C_{t,i} from neighbors
g) Update data history: D_{t+1,i} = D_{t,i} ∪ C_{t,i} ∪ {(x_{t,i}, y_{t,i})}
Coordinator-Free Design: Each agent independently maintains its own GP model, avoiding bottlenecks of centralized approaches
Communication Graph Structure Exploitation: Theoretical analysis cleverly decomposes the communication graph into disjoint cliques and analyzes the performance of each clique separately
Information-Theoretic Analysis Framework: Utilizes information-theoretic concepts such as maximum information gain (MIG) to bound algorithm performance
Experimental results strongly support theoretical predictions:
Connectivity Impact: On both Rosenbrock and Ackley functions, graphs with higher connection probability (0.6 > 0.4 > 0.2) achieve better regret convergence performance
Consistent Performance: This trend is verified across both instantaneous simple regret and average regret metrics
Algorithm Effectiveness: Distributed Thompson sampling successfully locates extrema of both test functions
Theorem 3.1 (Bayesian Average Regret Bound):
Let {Gk}k∈{1,...,n} be a set of n disjoint cliques of communication graph G. Then the Bayesian average regret after t steps satisfies:
RAB(t)≤M1∑k=1n∣Vk∣(t∣Vk∣C1+t∣Vk∣C2ξ∣Vk∣βtΨt∣Vk∣)
Corollary 3.2: Choosing n as the clique cover number θ(G) of graph G, we obtain:
RAB(t)=O~(Mtθ(G))
Theorem 3.3 (Bayesian Simple Regret Bound):
Let Gs=(Vs,Es) be a clique of G. Then:
RSB(t)≤t∣Vs∣C1+t∣Vs∣C2ξ∣Vs∣βtΨt∣Vs∣
Corollary 3.4: Choosing Gmax as the maximum clique, we obtain:
RSB(t)=O~(t∣Vmax∣1)
This paper is the first to provide theoretical guarantees for distributed Bayesian optimization under communication constraints (non-fully connected graphs), filling an important gap in the field.
Collaborative learning in edge computing environments
Parallel optimization problems with limited communication bandwidth
Overall Assessment: This is a high-quality paper with significant theoretical contributions to the field of distributed Bayesian optimization. The authors cleverly combine graph theory, information theory, and Bayesian optimization to provide theoretical guarantees for communication-constrained scenarios commonly encountered in practice. While there is room for improvement in practical applicability, its theoretical value and guidance for future research are substantial.