论文标题
边缘和云的有条件深层混合神经网络
Conditionally Deep Hybrid Neural Networks Across Edge and Cloud
论文作者
论文摘要
我们日常生活中“文字”的普遍性导致最近的雾计算中激增,涵盖了云计算和边缘智能的协作。为此,深度学习一直是促进这种智能系统的主要驱动力。但是,深度学习中的增长模型大小对在资源受限的边缘设备中的部署方面构成了重大挑战。此外,在分布式智能环境中,边缘和云系统之间必须有效的工作负载分布。为了应对这些挑战,我们提出了一个有条件的深层混合神经网络,以实现基于AI的雾计算。所提出的网络可以以分布式方式部署,由量化的层和云上的Edge和完整精确层组成。在推断期间,如果提前出口对分类结果具有很高的信心,则可以使样品在边缘退出,并且有条件地激活云上的较深层,这可以提高能源效率和推理潜伏期。我们进行了广泛的设计空间探索,目的是在边缘的能源消耗量最小化,同时在图像分类任务上实现最新的分类精度。我们表明,在边缘的二元层中,提出的条件混合网络可以处理边缘的65%的推论,从而导致5.5倍的计算能量减少,而CIFAR-10数据集上的精度降低最小。对于更复杂的数据集CIFAR-100,我们观察到,在边缘处具有4位量化的拟议网络以4.8倍的能量减少在边缘上获得52%的早期分类。该分析为我们提供了设计有效的混合网络的见解,这些混合网络的能源效率明显高于基于边缘云的分布式智能系统的全精度网络。
The pervasiveness of "Internet-of-Things" in our daily life has led to a recent surge in fog computing, encompassing a collaboration of cloud computing and edge intelligence. To that effect, deep learning has been a major driving force towards enabling such intelligent systems. However, growing model sizes in deep learning pose a significant challenge towards deployment in resource-constrained edge devices. Moreover, in a distributed intelligence environment, efficient workload distribution is necessary between edge and cloud systems. To address these challenges, we propose a conditionally deep hybrid neural network for enabling AI-based fog computing. The proposed network can be deployed in a distributed manner, consisting of quantized layers and early exits at the edge and full-precision layers on the cloud. During inference, if an early exit has high confidence in the classification results, it would allow samples to exit at the edge, and the deeper layers on the cloud are activated conditionally, which can lead to improved energy efficiency and inference latency. We perform an extensive design space exploration with the goal of minimizing energy consumption at the edge while achieving state-of-the-art classification accuracies on image classification tasks. We show that with binarized layers at the edge, the proposed conditional hybrid network can process 65% of inferences at the edge, leading to 5.5x computational energy reduction with minimal accuracy degradation on CIFAR-10 dataset. For the more complex dataset CIFAR-100, we observe that the proposed network with 4-bit quantization at the edge achieves 52% early classification at the edge with 4.8x energy reduction. The analysis gives us insights on designing efficient hybrid networks which achieve significantly higher energy efficiency than full-precision networks for edge-cloud based distributed intelligence systems.