论文标题
一台指数分解机器,零售额最小化的零售销售预测
An Exponential Factorization Machine with Percentage Error Minimization to Retail Sales Forecasting
论文作者
论文摘要
本文提出了一种新产品的新方法,以延长交货时间,但产品生命周期较短。这些SKU通常仅出售一个季节,而没有任何补充。开发了指数分解机(EFM)销售预测模型来解决此问题,不仅考虑了SKU属性,还考虑了成对的交互。 EFM模型与两倍的原始分解机(FM)显着不同:(1)解释变量的属性级公式以及(2)正响应变量的指数公式。属性级的形成不包括不可行的内部属性内部相互作用,并导致更有效的功能工程与传统的一式式单热编码进行了比较,而指数式的公式比正面的表述更有效,而对数转换的表现更有效,对于正面的分布式响应而言,指数式的表述更有效。为了估计参数,通过训练集的提议的自适应批处理梯度下降方法将百分比误差平方(PE)和误差平方(ES)最小化。新加坡鞋类零售商提供的现实世界数据用于测试提出的方法。根据平均绝对百分比误差(MAPE)和平均绝对误差(MAE)的预测性能不仅与现成模型相比,而且还通过现有的销售和需求预测研究报告的结果有利。两个外部公共数据集也证明了拟议方法的有效性。此外,我们证明了PES与ES最小化之间的理论关系,并为回归模型提供了PES最小化的重要特性。它训练模型以低估数据。该物业符合销售预测的状况,在这种情况下,单位持有成本远大于单位短缺成本。
This paper proposes a new approach to sales forecasting for new products with long lead time but short product life cycle. These SKUs are usually sold for one season only, without any replenishments. An exponential factorization machine (EFM) sales forecast model is developed to solve this problem which not only considers SKU attributes, but also pairwise interactions. The EFM model is significantly different from the original Factorization Machines (FM) from two-fold: (1) the attribute-level formulation for explanatory variables and (2) exponential formulation for the positive response variable. The attribute-level formation excludes infeasible intra-attribute interactions and results in more efficient feature engineering comparing with the conventional one-hot encoding, while the exponential formulation is demonstrated more effective than the log-transformation for the positive but not skewed distributed responses. In order to estimate the parameters, percentage error squares (PES) and error squares (ES) are minimized by a proposed adaptive batch gradient descent method over the training set. Real-world data provided by a footwear retailer in Singapore is used for testing the proposed approach. The forecasting performance in terms of both mean absolute percentage error (MAPE) and mean absolute error (MAE) compares favourably with not only off-the-shelf models but also results reported by extant sales and demand forecasting studies. The effectiveness of the proposed approach is also demonstrated by two external public datasets. Moreover, we prove the theoretical relationships between PES and ES minimization, and present an important property of the PES minimization for regression models; that it trains models to underestimate data. This property fits the situation of sales forecasting where unit-holding cost is much greater than the unit-shortage cost.