i-Split：用于拆分计算的深网络解释性

论文标题

i-Split：用于拆分计算的深网络解释性

I-SPLIT: Deep Network Interpretability for Split Computing

论文作者

Cunico, Federico, Capogrosso, Luigi, Setti, Francesco, Carra, Damiano, Fummi, Franco, Cristani, Marco

论文摘要

这项工作在拆分计算领域迈出了重大步骤，即如何拆分深神经网络在嵌入式设备上托管其早期部分，而其余则在服务器上。到目前为止，已经确定了潜在的分裂位置，以利用独特的体系结构方面，即基于层尺寸。在此范式下，只有在执行分裂并重新训练整个管道后才能评估拆分的疗效，从而对所有合理的分裂点在时间上进行详尽的评估。在这里，我们表明，不仅层的结构确实很重要，而且其中包含的神经元的重要性也很重要。如果神经元相对于正确的班级决策，神经元很重要。因此，应在具有高密度的重要神经元的层后立即应用一个拆分，以保留流动的信息。有了这个想法，我们提出了可解释的拆分（i-split）：通过提供可靠的预测，以确定最合适的分裂点的过程，以预测该分裂在分类准确性方面的表现，并事先对其有效实现。作为I-Split的另一个重大贡献，我们表明，多类分类问题的分裂点的最佳选择还取决于网络必须处理的特定类别。在两个网络VGG16和Resnet-50以及三个数据集，Tiny-Imagenet-200，Notmnist和胸部X射线肺炎上进行了详尽的实验。源代码可在https://github.com/vips4/i-split上找到。

This work makes a substantial step in the field of split computing, i.e., how to split a deep neural network to host its early part on an embedded device and the rest on a server. So far, potential split locations have been identified exploiting uniquely architectural aspects, i.e., based on the layer sizes. Under this paradigm, the efficacy of the split in terms of accuracy can be evaluated only after having performed the split and retrained the entire pipeline, making an exhaustive evaluation of all the plausible splitting points prohibitive in terms of time. Here we show that not only the architecture of the layers does matter, but the importance of the neurons contained therein too. A neuron is important if its gradient with respect to the correct class decision is high. It follows that a split should be applied right after a layer with a high density of important neurons, in order to preserve the information flowing until then. Upon this idea, we propose Interpretable Split (I-SPLIT): a procedure that identifies the most suitable splitting points by providing a reliable prediction on how well this split will perform in terms of classification accuracy, beforehand of its effective implementation. As a further major contribution of I-SPLIT, we show that the best choice for the splitting point on a multiclass categorization problem depends also on which specific classes the network has to deal with. Exhaustive experiments have been carried out on two networks, VGG16 and ResNet-50, and three datasets, Tiny-Imagenet-200, notMNIST, and Chest X-Ray Pneumonia. The source code is available at https://github.com/vips4/I-Split.

下载PDF全文

下载文献需遵守相关版权规定

论文标题