论文标题
朝着可解释的位误差耐受性
Towards Explainable Bit Error Tolerance of Resistive RAM-Based Binarized Neural Networks
论文作者
论文摘要
非挥发性内存,例如电阻RAM(RRAM),是一种新兴的节能存储,尤其是对于边缘的低功率机学习模型。但是,据报道,在超低功率设置中,RRAM的位错误率最高3.3%,这对于许多用例而言可能至关重要。二进制神经网络(BNNS)是神经网络(NNS)的资源有效变体,可以忍受一定百分比的错误,而不会损失准确性,并且需要较低的计算和存储资源。正如Hirtzlin等人提出的那样,可以通过翻转训练期间的重量标志来实现BNN中的位误差(BET),但是它们的方法具有很大的缺点,尤其是对于完全连接的神经网络(FCNN):FCNN:fcnns过于适合训练中的错误率,导致较低的误差率在较低的误差率下导致了较低的误差率。此外,未研究BET的基本原则。在这项工作中,我们改善了BNN下注的培训,并旨在解释这一属性。我们提出了直通梯度近似,以改善重量 - 登录训练,而BNNS则更少适应位错误率。为了解释达到的鲁棒性,我们定义了一个旨在测量赌注而无需断层注入的度量。我们评估了度量标准,发现它与所有测试的FCNN的准确性相关。最后,我们探讨了一个新颖的正规化程序的影响,该新规则化器优化了该指标,目的是提供准确性和下注的可配置权衡。
Non-volatile memory, such as resistive RAM (RRAM), is an emerging energy-efficient storage, especially for low-power machine learning models on the edge. It is reported, however, that the bit error rate of RRAMs can be up to 3.3% in the ultra low-power setting, which might be crucial for many use cases. Binary neural networks (BNNs), a resource efficient variant of neural networks (NNs), can tolerate a certain percentage of errors without a loss in accuracy and demand lower resources in computation and storage. The bit error tolerance (BET) in BNNs can be achieved by flipping the weight signs during training, as proposed by Hirtzlin et al., but their method has a significant drawback, especially for fully connected neural networks (FCNN): The FCNNs overfit to the error rate used in training, which leads to low accuracy under lower error rates. In addition, the underlying principles of BET are not investigated. In this work, we improve the training for BET of BNNs and aim to explain this property. We propose straight-through gradient approximation to improve the weight-sign-flip training, by which BNNs adapt less to the bit error rates. To explain the achieved robustness, we define a metric that aims to measure BET without fault injection. We evaluate the metric and find that it correlates with accuracy over error rate for all FCNNs tested. Finally, we explore the influence of a novel regularizer that optimizes with respect to this metric, with the aim of providing a configurable trade-off in accuracy and BET.