论文标题

混合深神经网络推断黑盒系统的状态模型

Hybrid Deep Neural Networks to Infer State Models of Black-Box Systems

论文作者

Mashhadi, Mohammad Jafar, Hemmati, Hadi

论文摘要

推断运行软件系统的行为模型对于多个自动软件工程任务,例如程序理解,异常检测和测试非常有用。大多数现有的动态模型推理技术是白色框,即,它们需要仪器代码以获取运行时迹线。但是,在许多系统中,无法进行整个源代码的仪器(例如,使用Black-Box第三方库时)或可能非常昂贵。不幸的是,随着时间的流逝,大多数检测状态的黑盒技术要么是单变量,要么对数据分布进行假设,或者在过去的长期行为中学习能力有限。为了克服上述问题,在本文中,我们提出了一个混合深神经网络,该网络接受输入一组时间序列,一个每个输入/输出信号,并应用一组卷积和经常性层来学习信号和模式之间的非线性相关性。我们已经将我们的行业合作伙伴的真正无人机自动驾驶仪解决方案应用于50万行C代码。随着时间的流逝,我们运行了888个随机的系统级测试案例和推断状态。我们与几种传统的时间序列更改点检测技术的比较表明,在找到以F1分数衡量的状态变更点方面,我们的方法提高了其性能高达102%。我们还表明,我们的州分类算法平均提供90.45%的F1分数,这将传统分类算法提高了17%。

Inferring behavior model of a running software system is quite useful for several automated software engineering tasks, such as program comprehension, anomaly detection, and testing. Most existing dynamic model inference techniques are white-box, i.e., they require source code to be instrumented to get run-time traces. However, in many systems, instrumenting the entire source code is not possible (e.g., when using black-box third-party libraries) or might be very costly. Unfortunately, most black-box techniques that detect states over time are either univariate, or make assumptions on the data distribution, or have limited power for learning over a long period of past behavior. To overcome the above issues, in this paper, we propose a hybrid deep neural network that accepts as input a set of time series, one per input/output signal of the system, and applies a set of convolutional and recurrent layers to learn the non-linear correlations between signals and the patterns, over time. We have applied our approach on a real UAV auto-pilot solution from our industry partner with half a million lines of C code. We ran 888 random recent system-level test cases and inferred states, over time. Our comparison with several traditional time series change point detection techniques showed that our approach improves their performance by up to 102%, in terms of finding state change points, measured by F1 score. We also showed that our state classification algorithm provides on average 90.45% F1 score, which improves traditional classification algorithms by up to 17%.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源