论文标题

DeepGoplus推断的数值稳定性

Numerical Stability of DeepGOPlus Inference

论文作者

Pepe, Inés Gonzalez, Chatelain, Yohan, Kiar, Gregory, Glatard, Tristan

论文摘要

卷积神经网络(CNN)目前是可用的最广泛使用的深神经网络(DNN)体系结构之一,并在许多问题上实现了最先进的性能。 CNN最初适用于计算机视觉任务,可与具有空间关系的任何数据(图像之外)配合使用,并已应用于不同的字段。但是,最近的工作突出了DNN中的数值稳定性挑战,这也与它们对噪声注入的已知敏感性有关。这些挑战会危害其性能和可靠性。本文研究了预测蛋白质功能的CNN DeepGoplus。 DeepGoplus已达到最先进的性能,并可以成功利用并注释蛋白质组学中出现的蛋白质序列。我们通过量化基础浮点数据的扰动而导致的数值不确定性来确定模型推理阶段的数值稳定性。此外,我们探索了使用降低精确的浮点格式进行DeepGoplus推断的机会,以减少记忆消耗和潜伏期。这是通过使用Monte Carlo Arithmetic仪器执行的仪器来实现的,该技术可以在实验上量化浮点操作错误和VPREC,该工具以可自定义的浮点精度格式模拟结果。将重点放在推理阶段,因为它是DeepGoplus模型的主要可交付量,该模型广泛适用于不同的环境。总而言之,我们的结果表明,尽管DeepGoplus CNN在数值上非常稳定,但只能通过较低精确的浮点格式选择性地实现。我们得出的结论是,从预训练的DeepGoplus模型获得的预测在数值上非常可靠,并有效地使用现有的浮点格式。

Convolutional neural networks (CNNs) are currently among the most widely-used deep neural network (DNN) architectures available and achieve state-of-the-art performance for many problems. Originally applied to computer vision tasks, CNNs work well with any data with a spatial relationship, besides images, and have been applied to different fields. However, recent works have highlighted numerical stability challenges in DNNs, which also relates to their known sensitivity to noise injection. These challenges can jeopardise their performance and reliability. This paper investigates DeepGOPlus, a CNN that predicts protein function. DeepGOPlus has achieved state-of-the-art performance and can successfully take advantage and annotate the abounding protein sequences emerging in proteomics. We determine the numerical stability of the model's inference stage by quantifying the numerical uncertainty resulting from perturbations of the underlying floating-point data. In addition, we explore the opportunity to use reduced-precision floating point formats for DeepGOPlus inference, to reduce memory consumption and latency. This is achieved by instrumenting DeepGOPlus' execution using Monte Carlo Arithmetic, a technique that experimentally quantifies floating point operation errors and VPREC, a tool that emulates results with customizable floating point precision formats. Focus is placed on the inference stage as it is the primary deliverable of the DeepGOPlus model, widely applicable across different environments. All in all, our results show that although the DeepGOPlus CNN is very stable numerically, it can only be selectively implemented with lower-precision floating-point formats. We conclude that predictions obtained from the pre-trained DeepGOPlus model are very reliable numerically, and use existing floating-point formats efficiently.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源