论文标题
flex:缩小用法和分配之间的差距
Flex: Closing the Gaps between Usage and Allocation
论文作者
论文摘要
数据中心是互联网数据和服务的巨型工厂。与航空业相比,全球数据中心消耗能源和发射排放量更多。不幸的是,大多数数据中心都被大大未被充分利用。主要原因之一是真实用法和配置资源之间的巨大差距,因为用户倾向于过度估计其需求和数据中心运营商通常依赖用户对资源分配的请求。在本文中,我们首先对Google群集跟踪进行了深入的分析,以揭示出低利用率的根本原因,并突出了改善它的巨大潜力。然后,我们开发了一个在线资源经理Flex,以最大程度地利用群集利用率,同时满足服务质量(QoS)。基于现实世界痕迹的大规模评估表明,与传统调度程序相比,FLEX在维持QoS的同时,flex的要求多达1.74倍,并且使用率高1.6倍。
Data centers are giant factories of Internet data and services. Worldwide data centers consume energy and emit emissions more than airline industry. Unfortunately, most of data centers are significantly underutilized. One of the major reasons is the big gaps between the real usage and the provisioned resources because users tend to over-estimate their demand and data center operators often rely on users' requests for resource allocation. In this paper, we first conduct an in-depth analysis of a Google cluster trace to unveil the root causes for low utilization and highlight the great potential to improve it. We then developed an online resource manager Flex to maximize the cluster utilization while satisfying the Quality of Service (QoS). Large-scale evaluations based on real-world traces show that Flex admits up to 1.74x more requests and 1.6x higher utilization compared to tradition schedulers while maintaining the QoS.