论文标题
低潜伏期基于弹性云的复制
Resilient Cloud-based Replication with Low Latency
论文作者
论文摘要
在地理复制环境中忍受拜占庭断层的现有方法需要系统在广阔的区域链接上执行复杂的协议协议,因此通常与高响应时间有关。在本文中,我们用Spider解决了这个问题,Spider是地理分布式系统的弹性复制体系结构,利用当今的公共云基础架构的可用性特征,以最大程度地减少复杂性并减少延迟。 Spider模拟系统作为一个松散耦合复制组的集合,其成员托管在同一地理区域的不同云提供的故障域(即可用性区域)中。这个结构性组织可以通过将复制组靠近客户近处实现较低的响应时间,同时仍可以使组的复制品在短途链接上进行交互。为了处理强大一致性蜘蛛所需的组间通信,使用了一个可靠的组到组消息通道,该通道具有首先出局的语义和内置流量控制,从而大大简化了系统设计。
Existing approaches to tolerate Byzantine faults in geo-replicated environments require systems to execute complex agreement protocols over wide-area links and consequently are often associated with high response times. In this paper we address this problem with Spider, a resilient replication architecture for geo-distributed systems that leverages the availability characteristics of today's public-cloud infrastructures to minimize complexity and reduce latency. Spider models a system as a collection of loosely coupled replica groups whose members are hosted in different cloud-provided fault domains (i.e., availability zones) of the same geographic region. This structural organization makes it possible to achieve low response times by placing replica groups in close proximity to clients while still enabling the replicas of a group to interact over short-distance links. To handle the inter-group communication necessary for strong consistency Spider uses a reliable group-to-group message channel with first-in-first-out semantics and built-in flow control that significantly simplifies system design.