论文标题

边缘和雾生态系统的自动化控制平面

A Self-stabilizing Control Plane for the Edge and Fog Ecosystems

论文作者

Georgiou, Zacharias, Georgiou, Chryssis, Pallis, George, Schiller, Elad Michael, Trihinas, Demetris

论文摘要

现在,随着传感设备(又称“事物”)和延迟敏感服务之间的计算和连通性差距的主要范式,雾计算正在出现。但是,随着雾部部署通过积累了与高度动态和动荡的网络织物相互联系的众多设备来扩展,在存在故障的情况下,需要自我配置和自我修复的需求比以往任何时候都更加明显。我们使用自动稳定化的流行方法,为分布式控制平面提出了一个容忍故障的框架,使雾服务能够应对并从非常广泛的故障模型中恢复。具体而言,我们的模型考虑了网络不确定性,数据包下降,节点故障停止故障以及违反系统设计为操作的假设,例如系统状态的任意损坏。我们的自稳定算法可以保证在恒定数量的通信回合中自动恢复,而无需外部(人类)干预。为了展示该框架的有效性,提出的自动化算法过程的正确性证明伴随着全面的评估,该评估采用了智能运输域中的现实世界数据开放且可重现的测试床。结果表明,我们的框架可确保在恒定时间内从故障中恢复雾生态系统,分析得分正确,而系统控制平面的开销则朝着IoT负载线性缩放。

Fog Computing is now emerging as the dominating paradigm bridging the compute and connectivity gap between sensing devices (a.k.a. "things") and latency-sensitive services. However, as fog deployments scale by accumulating numerous devices interconnected over highly dynamic and volatile network fabrics, the need for self-configuration and self-healing in the presence of failures is more evident now than ever. Using the prevailing methodology of self-stabilization, we propose a fault-tolerant framework for distributed control planes that enables fog services to cope and recover from a very broad fault model. Specifically, our model considers network uncertainties, packet drops, node fail-stop failures, and violations of the assumptions according to which the system was designed to operate, such as an arbitrary corruption of the system state. Our self-stabilizing algorithms guarantee automatic recovery within a constant number of communication rounds without the need for external (human) intervention. To showcase the framework's effectiveness, the correctness proof of the proposed self-stabilizing algorithmic process is accompanied by a comprehensive evaluation featuring an open and reproducible testbed utilizing real-world data from the intelligent transportation domain. Results show that our framework ensures a fog ecosystem recovery from faults in constant time, analytics are computed correctly, while the overhead to the system's control plane scales linearly towards the IoT load.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源