论文标题

标签有效的交互式时间序列异常检测

Label-Efficient Interactive Time-Series Anomaly Detection

论文作者

Guo, Hong, Wang, Yujing, Zhang, Jieyu, Lin, Zhengjie, Tong, Yunhai, Yang, Lei, Xiong, Luoxing, Huang, Congrui

论文摘要

时间序列异常检测是一项重要的任务,并且已广泛应用于行业。由于手动数据注释昂贵且效率低下,因此大多数应用程序采用了无监督的异常检测方法,但是结果通常是最佳的且不令人满意的,对于最终客户而言。弱监督是以低成本方式获得大量标签的有希望的范式,这使客户能够通过编写启发式规则而不是单独注释每个实例来标记数据。但是,在时间序列域中,人们很难编写合理的标签函数,因为时间序列数据在数字上是连续的,难以理解的。在本文中,我们提出了一个标签有效的交互式时间序列异常检测(LEIAD)系统,该系统使用户能够通过仅与系统进行少量交互来改善无监督异常检测的结果。为了实现这一目标,系统会集成弱的监督和主动学习,同时仅使用几个标记数据自动生成标签功能。所有这些技术都是互补的,可以以加强的方式相互促进。我们对三个时间序列异常检测数据集进行了实验,表明所提出的系统在弱监督和主动学习领域都优于现有解决方案。此外,该系统已经在行业的实际情况下进行了测试,以表明其实用性。

Time-series anomaly detection is an important task and has been widely applied in the industry. Since manual data annotation is expensive and inefficient, most applications adopt unsupervised anomaly detection methods, but the results are usually sub-optimal and unsatisfactory to end customers. Weak supervision is a promising paradigm for obtaining considerable labels in a low-cost way, which enables the customers to label data by writing heuristic rules rather than annotating each instance individually. However, in the time-series domain, it is hard for people to write reasonable labeling functions as the time-series data is numerically continuous and difficult to be understood. In this paper, we propose a Label-Efficient Interactive Time-Series Anomaly Detection (LEIAD) system, which enables a user to improve the results of unsupervised anomaly detection by performing only a small amount of interactions with the system. To achieve this goal, the system integrates weak supervision and active learning collaboratively while generating labeling functions automatically using only a few labeled data. All of these techniques are complementary and can promote each other in a reinforced manner. We conduct experiments on three time-series anomaly detection datasets, demonstrating that the proposed system is superior to existing solutions in both weak supervision and active learning areas. Also, the system has been tested in a real scenario in industry to show its practicality.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源