Optimising Virtual Resource Mapping in Multi-Level NUMA Disaggregated Systems
Lakew, Svärd, Elmroth et al.
Disaggregated systems have a novel architecture motivated by the requirements of resource intensive applications such as social networking, search, and in-memory databases. The total amount of resources such as memory and CPU cores is very large in such systems. However, the distributed topology of disaggregated server systems result in non-uniform access latency and performance, with both NUMA aspects inside each box, as well as additional access latency for remote resources. In this work, we study the effects complex NUMA topologies on application performance and propose a method for improved, NUMA-aware, mapping for virtualized environments running on disaggregated systems. Our mapping algorithm is based on pinning of virtual cores and/or migration of memory across a disaggregated system and takes into account application performance, resource contention, and utilization. The proposed method is evaluated on a 288 cores and around 1TB memory system, composed of six disaggregated commodity servers, through a combination of benchmarks and real applications such as memory intensive graph databases. Our evaluation demonstrates significant improvement over the vanilla resource mapping methods. Overall, the mapping algorithm is able to improve performance by significant magnitude compared the default Linux scheduler used in system.
academic
Optimising Virtual Resource Mapping in Multi-Level NUMA Disaggregated Systems
This paper investigates the impact of complex NUMA topologies on application performance in disaggregated systems and proposes an improved NUMA-aware mapping method. The approach is based on virtual core binding and memory migration, comprehensively considering application performance, resource contention, and utilization. Evaluation on a disaggregated system comprising 6 commodity servers with 288 cores and approximately 1TB memory demonstrates significant performance improvements compared to the default Linux scheduler.
Disaggregated System Architecture Challenges: Disaggregated systems support resource-intensive applications (such as social networks, search engines, and in-memory databases) by aggregating resources from multiple physical servers, but distributed topologies introduce non-uniform access latencies and performance issues
Multi-Level NUMA Complexity: The system simultaneously exhibits intra-machine NUMA characteristics and cross-machine remote resource access latencies, forming a complex multi-level NUMA topology
Virtualization Environment Optimization: Existing Linux schedulers cannot effectively handle such complex resource mapping scenarios
Modern applications' computational resource demands exceed single-machine capabilities, making disaggregated systems an important development direction
Resource mapping strategies directly impact application performance; improper mapping can cause severe performance degradation
Comprehensive optimization considering resource contention, locality, and interference degree is necessary
First In-Depth Empirical Study of Disaggregated Systems: Conducts comprehensive measurements on real disaggregated hardware, considering resource contention, locality, and interference degree
Application Classification and Performance Metrics Framework: Employs Animal Classes classification for applications and uses IPC and MPI as performance indicators
NUMA-Aware Mapping Algorithm: Proposes an online mapping algorithm considering application classification, resource proximity, and runtime hardware performance counters
Significant Performance Improvements: Achieves average 50× performance improvement on actual systems
if VMi is a new arrival then
if Free slot is suitable for VMi given ci, ai then
Map VMi directly
else
Reshuffle existing VMs to create suitable slot
Map VMi to new slot
for each VMi do
if (expected_perf - measured_perf)/expected_perf ≥ Threshold then
Add VMi to affected list
for each affected VM do
Build potential neighbor list based on class compatibility
Compute new configuration with minimal reshuffle
Remap if beneficial
This paper cites 26 relevant references covering important research achievements in multiple domains including disaggregated systems, NUMA optimization, and virtualization technologies, providing a solid theoretical foundation for the research work.
Overall Assessment: This is a paper with significant contributions to the field of resource management in disaggregated systems. Through rigorous experimental design and comprehensive performance evaluation, it demonstrates the effectiveness of NUMA-aware mapping algorithms. Despite certain limitations, its practical value and academic contributions are substantial, laying a solid foundation for further development in this field.