论文标题
信号图的统一预测框架
A Unified Prediction Framework for Signal Maps
论文作者
论文摘要
信号图对于蜂窝网络的计划和运行至关重要。但是,创建此类地图所需的测量值是昂贵的,通常是有偏见的,并不总是反映出感兴趣的指标和构成隐私风险。在本文中,我们开发了一个统一的框架,用于预测有限测量的细胞信号图。我们的框架建立在最新的随机预测指标或任何其他基本预测因子上。我们提出并结合了三种机制,这些机制涉及并非所有测量值对特定预测任务同样重要的事实。首先,我们设计服务质量功能($ Q $),包括信号强度(RSRP),以及对操作员感兴趣的其他指标,即覆盖范围和呼叫下降概率。通过隐式更改学习中使用的损失函数,质量功能还可以改善RSRP重要的预测(例如,在低信号强度状态下,MSE减少高达27%,而错误是至关重要的)。其次,我们介绍了权重功能($ W $),以指定在特征空间的不同位置和其他部分的预测相对重要性。我们提出基于重要性抽样的重新加权,以在采样和目标分布不同时获得公正的估计器。基于空间均匀的损失或基于用户群体密度的损失,目标可以提高20%的改善。第三,我们在此上下文中首次应用数据Shapley框架:将值($ ϕ $)分配给单个测量点,从而捕获其贡献对预测任务的重要性。通过删除具有负值的点,这将预测提高了预测(例如,从64%到94%的召回率),还可以启用数据最小化。我们使用几个现实世界数据集评估了我们的方法,并证明了预测性能的显着改善。
Signal maps are essential for the planning and operation of cellular networks. However, the measurements needed to create such maps are expensive, often biased, not always reflecting the metrics of interest, and posing privacy risks. In this paper, we develop a unified framework for predicting cellular signal maps from limited measurements. Our framework builds on a state-of-the-art random-forest predictor, or any other base predictor. We propose and combine three mechanisms that deal with the fact that not all measurements are equally important for a particular prediction task. First, we design quality-of-service functions ($Q$), including signal strength (RSRP) but also other metrics of interest to operators, i.e., coverage and call drop probability. By implicitly altering the loss function employed in learning, quality functions can also improve prediction for RSRP itself where it matters (e.g., MSE reduction up to 27% in the low signal strength regime, where errors are critical). Second, we introduce weight functions ($W$) to specify the relative importance of prediction at different locations and other parts of the feature space. We propose re-weighting based on importance sampling to obtain unbiased estimators when the sampling and target distributions are different. This yields improvements up to 20% for targets based on spatially uniform loss or losses based on user population density. Third, we apply the Data Shapley framework for the first time in this context: to assign values ($ϕ$) to individual measurement points, which capture the importance of their contribution to the prediction task. This improves prediction (e.g., from 64% to 94% in recall for coverage loss) by removing points with negative values, and can also enable data minimization. We evaluate our methods and demonstrate significant improvement in prediction performance, using several real-world datasets.