Adaptive Decentralized Queue Disclosure for Impatient Tenants in Edge and Non-terrestrial Systems
Kiggundu, Han, Schotten
We study how queue-state information disclosures affect impatient tenants in multi-tenant edge systems. We propose an information-bulletin strategy in which each queue periodically broadcasts two Markov models. One is a model of steady-state service-rate behavior and the other a model of the queue length inter-change times. Tenants autonomously decide to renege or jockey based on this information. The queues observe tenant responses and adapt service rates via a learned, rule-based predictive policy designed for decentralized, partially-observed, and time-varying environments. We compare this decentralized, information-driven policy to the classical, centralized Markov Decision Process (MDP) hedging-point policy for M/M/2 systems. Numerical experiments quantify the tradeoffs in average delay, impatience and robustness to stale information. Results show that when full, instantaneous state information and stationarity hold, the hedging-point policy yields less impatience but this diminishes as information becomes partial or stale. The rule-based predictive policy on the other hand is more robust to staleness in dispatched information, making it conducive for conditions typical of edge cloud and non-terrestrial deployments.
academic
Adaptive Decentralized Queue Disclosure for Impatient Tenants in Edge and Non-terrestrial Systems
This paper investigates how queue state information disclosure affects impatient tenants in multi-tenant edge systems. The authors propose an information announcement strategy where each queue periodically broadcasts two Markov models: one characterizing steady-state service rate behavior and another modeling queue length change timing. Tenants autonomously decide whether to abandon or switch queues based on this information. Queues observe tenant responses and adapt service rates through a learning-based rule-driven prediction strategy designed for decentralized, partially observable, and time-varying environments. Numerical experiments quantify trade-offs between average delay, impatience levels, and robustness to stale information.
In heterogeneous 5G/6G deployments, multi-tenant resource sharing is driven not only by static configurations but increasingly by autonomous tenant decisions (e.g., whether to offload tasks to remote queues or process locally). Queue state disclosure (such as queue length, waiting time estimates, or service statistics) can significantly alter tenant behavior and trigger resource competition through queue jockeying and reneging.
Modern multi-access edge computing (MEC) and non-terrestrial network (NTN) environments are decentralized, characterized by partial and stale state broadcasts, and exhibit time-varying channels and mobility. In such environments, assuming a single central controller with instantaneous global state is unrealistic. However, existing disclosure rules and heuristics are typically developed for static or lightly mobile settings and fail to address three fundamental questions in decentralized control:
Traditional centralized optimization methods (such as hedging point policies) assume complete, instantaneous state information and stationarity conditions that often do not hold under typical edge cloud and non-terrestrial deployment conditions. Existing approaches show significantly degraded performance when information becomes partial or stale.
Information Announcement Concept: Introduces the information announcement concept for multi-tenant queues and formalizes two Markov descriptors (service rate distribution and change timing) as tunable state summaries suitable for resource-constrained control channels.
Theoretical Analysis: Derives closed-form expressions for queue jockeying and reneging probabilities under these descriptors and formulates a joint impatience minimization problem balancing delay, jockeying, and reneging. Proves the optimization problem is analytically intractable.
Practical Strategy: Proposes a practical rule-based prediction strategy that learns service rate vectors from tenant responses and adapts service rates online.
Comprehensive Evaluation: Provides extensive numerical evaluation quantifying the value of different announcement models and distribution intervals, demonstrating the robustness of the learning strategy under heterogeneous workloads.
Consider an M/M/2 queueing system with two queues i and j. New arrivals follow a Poisson distribution with total arrival rate λ = λᵢ + λⱼ. Each queue distributes its state information to tenants at interval r seconds, introducing staleness. The objective is to minimize a composite performance measure of average delay, jockeying events, and reneging (tenant impatience).
The service rate distribution of queue i or j in steady state follows a K-state continuous-time Markov chain (CTMC) with service rates {μᵢ}ᵢ₌₁ᴷ and {μⱼ}ⱼ₌₁ᴷ. The effective service rate is defined as:
This model quantifies the frequency of transitions in the queue system. For a queue in state n, only arrival events change state when n=0, while both arrival and departure events may occur when n≥1. The Markov model is defined as:
Better queues are determined by comparing cumulative distribution functions FX(μₖ) and FY(μₖ). X first-order stochastically dominates Y if PX > x ≥ PY > x ∀x ∈ ℝ.
Information Model Comparison: The Markov service rate model produces less impatient behavior than the queue length change timing model, as it provides direct mapping of processing speed.
Distribution Frequency Optimization: Optimal performance is achieved at 5-7 second intervals, where impatience is minimized and system stability is maintained, particularly when requests obtain service rate information.
Strategy Comparison:
Hedging point policy: More stable but higher reneging and jockeying rates
Rule-based strategy: More variable but may record lower rates at smaller intervals
Optimization Effectiveness: The optimized strategy is statistically robust, producing lower and more consistent objective values (mean=0.53 vs 1.78 without optimization).
When strategies are embedded, waiting times for reneging and jockeying requests are significantly reduced, with greater optimality observed particularly when distributing the Markov service rate model.
The paper cites important literature from queueing theory, behavior modeling, and edge computing, including:
Y. Ouyang and D. Teneketzis's research on decentralized routing signaling
B. Lin et al.'s work on optimal policies for dual-server queueing systems
3GPP technical specifications on network slicing management and orchestration
Overall Assessment: This is a high-quality research paper at the intersection of queueing theory and edge computing, proposing innovative information disclosure strategies to address tenant impatience in decentralized environments. Despite certain limitations, its theoretical contributions and practical value make it an important advance in the field.