论文标题
使用图形机学习学习嵌入式库存相关矩阵的表示
Learning Embedded Representation of the Stock Correlation Matrix using Graph Machine Learning
论文作者
论文摘要
了解金融工具之间的非线性关系在投资流程中具有各种应用,包括风险管理,投资组合建设和交易策略。在这里,我们专注于股票之间的相互联系,基于它们的相关矩阵,我们将其表示为网络,其节点代表单个库存,以及代表相应成对相关系数的节点对之间的加权链接。在金融文献中广泛使用的传统网络科学技术需要手工制作的特征,例如中心度措施来了解这种相关网络。但是,手动招募所有这些手工制作的功能可能很快成为一项艰巨的任务。取而代之的是,我们提出了一种新的方法,用于使用称为node2vec的图机学习算法以算法方式研究相关网络中的细微差别和关系。特别是,该算法将网络压缩到一个较低的维度连续空间,称为嵌入式,其中由算法识别为相似的节点对彼此靠近。通过使用标准普尔500库存数据的日志返回,我们表明我们提出的算法可以从其相关网络中学习这种嵌入。我们定义了受自然语言处理领域(NLP)用于评估嵌入以识别最佳嵌入的指标的启发的各种领域特定的定量(和客观)和定性指标。此外,我们讨论了嵌入在投资管理中的各种应用。
Understanding non-linear relationships among financial instruments has various applications in investment processes ranging from risk management, portfolio construction and trading strategies. Here, we focus on interconnectedness among stocks based on their correlation matrix which we represent as a network with the nodes representing individual stocks and the weighted links between pairs of nodes representing the corresponding pair-wise correlation coefficients. The traditional network science techniques, which are extensively utilized in financial literature, require handcrafted features such as centrality measures to understand such correlation networks. However, manually enlisting all such handcrafted features may quickly turn out to be a daunting task. Instead, we propose a new approach for studying nuances and relationships within the correlation network in an algorithmic way using a graph machine learning algorithm called Node2Vec. In particular, the algorithm compresses the network into a lower dimensional continuous space, called an embedding, where pairs of nodes that are identified as similar by the algorithm are placed closer to each other. By using log returns of S&P 500 stock data, we show that our proposed algorithm can learn such an embedding from its correlation network. We define various domain specific quantitative (and objective) and qualitative metrics that are inspired by metrics used in the field of Natural Language Processing (NLP) to evaluate the embeddings in order to identify the optimal one. Further, we discuss various applications of the embeddings in investment management.