论文标题
实时节能对象跟踪的暹罗神经网络的优化
Optimisation of a Siamese Neural Network for Real-Time Energy Efficient Object Tracking
论文作者
论文摘要
在本文中,提出了有关使用暹罗神经网络进行嵌入式视觉系统优化视觉对象跟踪的研究。假定该解决方案应实时运行,最好是用于高分辨率视频流,最低的能源消耗。为了满足这些要求,考虑了诸如降低计算精度和修剪之类的技术。 Brevitas是一种用于优化和定量神经网络进行FPGA实现的工具。测试了许多培训方案,其优化水平各不相同 - 从具有16位的整数均匀定量到三元和二进制网络。接下来,评估了这些优化对跟踪性能的影响。相对于原始网络,可以将卷积过滤器的大小降低10倍。获得的结果表明,使用定量可以显着降低所提出的网络的内存和计算复杂性,同时仍可以进行精确的跟踪,从而允许在嵌入式视觉系统中使用它。此外,权重的定量通过减少过度拟合会对网络培训产生积极影响。
In this paper the research on optimisation of visual object tracking using a Siamese neural network for embedded vision systems is presented. It was assumed that the solution shall operate in real-time, preferably for a high resolution video stream, with the lowest possible energy consumption. To meet these requirements, techniques such as the reduction of computational precision and pruning were considered. Brevitas, a tool dedicated for optimisation and quantisation of neural networks for FPGA implementation, was used. A number of training scenarios were tested with varying levels of optimisations - from integer uniform quantisation with 16 bits to ternary and binary networks. Next, the influence of these optimisations on the tracking performance was evaluated. It was possible to reduce the size of the convolutional filters up to 10 times in relation to the original network. The obtained results indicate that using quantisation can significantly reduce the memory and computational complexity of the proposed network while still enabling precise tracking, thus allow to use it in embedded vision systems. Moreover, quantisation of weights positively affects the network training by decreasing overfitting.