论文标题
分裂卷积神经网络结构以有效推断
Splitting Convolutional Neural Network Structures for Efficient Inference
论文作者
论文摘要
对于具有大量输入数据的卷积神经网络(CNN),内存管理成为主要问题。降低记忆成本可以是解决这些问题的有效方法,可以通过不同的技术(例如特征图修剪,输入数据拆分等)实现这些问题。在这项研究中,解决了使用网络结构拆分减少内存利用的问题。提出了一种新技术,将网络结构分成比原始网络低的小部分。分零件几乎可以单独处理,这为更好的内存管理提供了重要的作用。该方法已在VGG16和RESNET18的两个众所周知的网络结构上进行了测试,以分类CIFAR10图像。仿真结果表明,分裂方法均减少了计算操作的数量以及内存消耗量。
For convolutional neural networks (CNNs) that have a large volume of input data, memory management becomes a major concern. Memory cost reduction can be an effective way to deal with these problems that can be realized through different techniques such as feature map pruning, input data splitting, etc. Among various methods existing in this area of research, splitting the network structure is an interesting research field, and there are a few works done in this area. In this study, the problem of reducing memory utilization using network structure splitting is addressed. A new technique is proposed to split the network structure into small parts that consume lower memory than the original network. The split parts can be processed almost separately, which provides an essential role for better memory management. The split approach has been tested on two well-known network structures of VGG16 and ResNet18 for the classification of CIFAR10 images. Simulation results show that the splitting method reduces both the number of computational operations as well as the amount of memory consumption.