论文标题
使用卷积神经网络从患者多元时间序列数据中学习隐藏模式:医疗费用预测的案例研究
Learning Hidden Patterns from Patient Multivariate Time Series Data Using Convolutional Neural Networks: A Case Study of Healthcare Cost Prediction
论文作者
论文摘要
目的:通过使用卷积神经网络(CNN)体系结构自动从患者保险索赔中的多元时间序列数据中自动学习隐藏的时间模式,以开发有效且可扩展的个人级别的患者成本预测方法。 方法:从2013年到2016年,我们使用了三年的医疗和药房索赔数据,从医疗保险公司那里使用,在该保险公司中,最初两年的数据被用来建立模型以预测第三年的成本。数据由成本,访问和医疗功能的多元时间序列组成,这些成本是患者健康状况的图像(即带有时间窗口的矩阵和一个维度的矩阵,以及医疗,另一个维度上的访问和成本功能)。将患者的多元时间序列图像用于具有建议的结构的CNN方法。在高参数调整后,提出的体系结构由三个具有LRELU激活函数的卷积和汇总层组成的组成,每一层的定制内核大小用于医疗保健数据。提出的CNN学习的时间模式成为了完全连接层的输入。 结论:通过拟议的CNN配置进行特征学习,可显着改善个人级医疗保健成本预测。提出的CNN能够超越寻找预定义图案形状的时间模式检测方法,因为它能够提取具有各种形状的可变数量的模式。从医疗,访问和成本数据中学到的时间模式为预测性能做出了重大贡献。高参数调整表明,考虑三个月的数据模式的预测准确性最高。我们的结果表明,从多元时间序列数据中提取的患者图像与常规图像不同,因此需要CNN架构的独特设计。
Objective: To develop an effective and scalable individual-level patient cost prediction method by automatically learning hidden temporal patterns from multivariate time series data in patient insurance claims using a convolutional neural network (CNN) architecture. Methods: We used three years of medical and pharmacy claims data from 2013 to 2016 from a healthcare insurer, where data from the first two years were used to build the model to predict costs in the third year. The data consisted of the multivariate time series of cost, visit and medical features that were shaped as images of patients' health status (i.e., matrices with time windows on one dimension and the medical, visit and cost features on the other dimension). Patients' multivariate time series images were given to a CNN method with a proposed architecture. After hyper-parameter tuning, the proposed architecture consisted of three building blocks of convolution and pooling layers with an LReLU activation function and a customized kernel size at each layer for healthcare data. The proposed CNN learned temporal patterns became inputs to a fully connected layer. Conclusions: Feature learning through the proposed CNN configuration significantly improved individual-level healthcare cost prediction. The proposed CNN was able to outperform temporal pattern detection methods that look for a pre-defined set of pattern shapes, since it is capable of extracting a variable number of patterns with various shapes. Temporal patterns learned from medical, visit and cost data made significant contributions to the prediction performance. Hyper-parameter tuning showed that considering three-month data patterns has the highest prediction accuracy. Our results showed that patients' images extracted from multivariate time series data are different from regular images, and hence require unique designs of CNN architectures.