论文标题
工业SCADA网络的入侵检测和识别系统设计和绩效评估
Intrusion Detection and identification System Design and Performance Evaluation for Industrial SCADA Networks
论文作者
论文摘要
在本文中,我们提出了一项研究,该研究提出了一个三阶段分类器模型,该模型采用机器学习算法来开发数十种针对工业SCADA网络的攻击的入侵检测和识别系统。使用Gas Pipeline SCADA网络的实验室原型生成的数据,对机器学习分类器进行了训练和测试。数据集由三个攻击组和七个不同的攻击类别或类别组成。同一数据集进一步提供了与这七个攻击类有关的35种不同类型的子攻击的签名。这项研究需要设计三阶段的机器学习分类器作为滥用入侵检测系统,以检测和识别35个攻击子类中的每一个。分类器的第一阶段决定记录是否与正常操作或攻击签名相关联。如果发现该记录属于攻击签名,则在第二阶段,将其分为七个攻击类之一。基于由第二阶段分类器的输出确定的确定攻击类,为第三阶段的子攻击分类提供了攻击记录,其中使用了七个不同的分类器。来自第三阶段分类器的输出标识了记录所属的子攻击类型。仿真结果表明,设计探索域的专业化或在多个阶段执行分类的设计与单级设计相对于有数十种类的问题是有希望的。与文献研究的比较还表明,多阶段分类器的性能明显更好。
In this paper, we present a study that proposes a three-stage classifier model which employs a machine learning algorithm to develop an intrusion detection and identification system for tens of different types of attacks against industrial SCADA networks. The machine learning classifier is trained and tested on the data generated using the laboratory prototype of a gas pipeline SCADA network. The dataset consists of three attack groups and seven different attack classes or categories. The same dataset further provides signatures of 35 different types of sub-attacks which are related to those seven attack classes. The study entailed the design of three-stage machine learning classifier as a misuse intrusion detection system to detect and identify specifically each of the 35 attack subclasses. The first stage of the classifier decides if a record is associated with normal operation or an attack signature. If the record is found to belong to an attack signature, then in the second stage, it is classified into one of seven attack classes. Based on the identified attack class as determined by the output from the second stage classifier, the attack record is provided for a third stage sub-attack classification, where seven different classifiers are employed. The output from the third stage classifier identifies the sub-attack type to which the record belongs. Simulation results indicate that designs exploring specialization to domains or executing the classification in multiple stages versus single-stage designs are promising for problems where there are tens of classes. Comparison with studies in the literature also indicated that the multi-stage classifier performed markedly better.