论文标题

擦除编码存储系统中延迟的建模和优化

Modeling and Optimization of Latency in Erasure-coded Storage Systems

论文作者

Aggarwal, Vaneet, Lan, Tian

论文摘要

随着消费者越来越多地从事社交网络和电子商务活动,企业不断依靠大数据分析来智力,而传统的IT基础设施继续迁移到云和边缘,这些趋势会导致分布式数据存储需求以前所未有的速度上升。擦除编码已迅速成为一种有前途的技术,可以降低存储成本,同时提供与复制系统相似的可靠性,该系统被Facebook,Microsoft和Google等公司广泛采用。但是,当在分布式存储中使用擦除代码时,它还带来了表征和优化访问延迟的新挑战。该专着的目的是对采用擦除代码进行分布式存储的系统进行审查(理论和实用)。 在这本专着中,我们将首先确定研究问题的关键挑战和分类法,然后概述已开发出来量化和模拟擦除编码存储的延迟的不同方法。这包括最近利用MDS保留,叉子加入,概率和延迟重新发布的调度策略的工作,以及它们的应用来表征擦除编码的分布式存储系统的访问潜伏期(例如,平均,尾巴,渐近,渐近延迟)。我们还将将问题扩展到用户正在从擦除编码的分布式存储系统流式传输视频的情况下。接下来,我们弥合理论与实践之间的差距,并讨论从原型实施中学到的经验教训。特别是,我们将讨论擦除编码存储的示例性实施,阐明了关键的自由和权衡设计程度,并总结了实际存储系统中的剩余挑战,例如内容交付和缓存。在每章的结尾讨论未来研究的开放问题。

As consumers are increasingly engaged in social networking and E-commerce activities, businesses grow to rely on Big Data analytics for intelligence, and traditional IT infrastructures continue to migrate to the cloud and edge, these trends cause distributed data storage demand to rise at an unprecedented speed. Erasure coding has seen itself quickly emerged as a promising technique to reduce storage cost while providing similar reliability as replicated systems, widely adopted by companies like Facebook, Microsoft and Google. However, it also brings new challenges in characterizing and optimizing the access latency when erasure codes are used in distributed storage. The aim of this monograph is to provide a review of recent progress (both theoretical and practical) on systems that employ erasure codes for distributed storage. In this monograph, we will first identify the key challenges and taxonomy of the research problems and then give an overview of different approaches that have been developed to quantify and model latency of erasure-coded storage. This includes recent work leveraging MDS-Reservation, Fork-Join, Probabilistic, and Delayed-Relaunch scheduling policies, as well as their applications to characterize access latency (e.g., mean, tail, asymptotic latency) of erasure-coded distributed storage systems. We will also extend the problem to the case when users are streaming videos from erasure-coded distributed storage systems. Next, we bridge the gap between theory and practice, and discuss lessons learned from prototype implementation. In particular, we will discuss exemplary implementations of erasure-coded storage, illuminate key design degrees of freedom and tradeoffs, and summarize remaining challenges in real-world storage systems such as in content delivery and caching. Open problems for future research are discussed at the end of each chapter.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源