Is Intelligence the Right Direction in New OS Scheduling for Multiple Resources in Cloud Environments?
By: Xinglei Dou, Lei Liu, Limin Xiao
Potential Business Impact:
Makes computers run faster and use less power.
Making it intelligent is a promising way in System/OS design. This paper proposes OSML+, a new ML-based resource scheduling mechanism for co-located cloud services. OSML+ intelligently schedules the cache and main memory bandwidth resources at the memory hierarchy and the computing core resources simultaneously. OSML+ uses a multi-model collaborative learning approach during its scheduling and thus can handle complicated cases, e.g., avoiding resource cliffs, sharing resources among applications, enabling different scheduling policies for applications with different priorities, etc. OSML+ can converge faster using ML models than previous studies. Moreover, OSML+ can automatically learn on the fly and handle dynamically changing workloads accordingly. Using transfer learning technologies, we show our design can work well across various cloud servers, including the latest off-the-shelf large-scale servers. Our experimental results show that OSML+ supports higher loads and meets QoS targets with lower overheads than previous studies.
Similar Papers
Intelligent Resource Allocation Optimization for Cloud Computing via Machine Learning
Distributed, Parallel, and Cluster Computing
Makes computer clouds work smarter and cheaper.
Learning Virtual Machine Scheduling in Cloud Computing through Language Agents
Machine Learning (CS)
Helps computers pack more tasks into cloud servers.
Machine learning-based cloud resource allocation algorithms: a comprehensive comparative review
Distributed, Parallel, and Cluster Computing
Makes computers use cloud power smarter and cheaper.