论文标题
签名方法在销售异常检测上的应用
Applications of Signature Methods to Market Anomaly Detection
论文作者
论文摘要
异常检测是识别显着偏离规范的数据集中异常实例或事件的过程。在这项研究中,我们提出了一种基于签名的机器学习算法,以检测给定时间序列类型的给定数据集中的稀有或意外项目。我们将签名或随机签名作为异常检测算法的特征提取器的应用;此外,我们为构建随机签名提供了简单,表示的理论理由。我们的第一个应用程序是基于合成数据,旨在区分股票价格的真实和假轨迹,而这些股票价格是无法通过视觉检查而区分的。我们还通过使用来自加密货币市场的交易数据来显示现实生活中的应用。在这种情况下,我们能够通过我们的无监督学习算法在社交网络上进行高达88%的社交网络中组织的泵和倾倒尝试,从而获得基于监督学习的领域的最先进的结果。
Anomaly detection is the process of identifying abnormal instances or events in data sets which deviate from the norm significantly. In this study, we propose a signatures based machine learning algorithm to detect rare or unexpected items in a given data set of time series type. We present applications of signature or randomized signature as feature extractors for anomaly detection algorithms; additionally we provide an easy, representation theoretic justification for the construction of randomized signatures. Our first application is based on synthetic data and aims at distinguishing between real and fake trajectories of stock prices, which are indistinguishable by visual inspection. We also show a real life application by using transaction data from the cryptocurrency market. In this case, we are able to identify pump and dump attempts organized on social networks with F1 scores up to 88% by means of our unsupervised learning algorithm, thus achieving results that are close to the state-of-the-art in the field based on supervised learning.