论文标题
云本机平台用于状态流媒体
A Cloud Native Platform for Stateful Streaming
论文作者
论文摘要
我们介绍了云本机版本的IBM流的体系结构,其中Kubernetes作为我们的目标平台。流是一种通用流系统,其平台用于管理应用程序和执行这些应用程序的计算群集。云本地流用Kubernetes代替该平台。通过使用Kubernetes作为其平台,Streams能够卸载作业管理,生命周期跟踪,地址翻译,容忍度和调度。这种卸载是可能的,因为我们定义了定制的资源,这些自定义资源将其本地集成到Kubernetes中,从而允许流将Kubernetes的Eventing System用作自己的自定义资源。我们使用四种设计模式来实施我们的系统:控制器,导体,协调员和因果链。组合控制器,导体和协调器使我们能够从异步分布式系统中构建确定性状态机。由此产生的实现消除了原始平台代码的75%。我们的实验结果表明,在大多数情况下,Kubernetes的性能是足够的替代品,但是它在超额订购,网络延迟,垃圾收集和POD恢复方面存在问题。
We present the architecture of a cloud native version of IBM Streams, with Kubernetes as our target platform. Streams is a general purpose streaming system with its own platform for managing applications and the compute clusters that execute those applications. Cloud native Streams replaces that platform with Kubernetes. By using Kubernetes as its platform, Streams is able to offload job management, life cycle tracking, address translation, fault tolerance and scheduling. This offloading is possible because we define custom resources that natively integrate into Kubernetes, allowing Streams to use Kubernetes' eventing system as its own. We use four design patterns to implement our system: controllers, conductors, coordinators and causal chains. Composing controllers, conductors and coordinators allows us to build deterministic state machines out of an asynchronous distributed system. The resulting implementation eliminates 75% of the original platform code. Our experimental results show that the performance of Kubernetes is an adequate replacement in most cases, but it has problems with oversubscription, networking latency, garbage collection and pod recovery.