论文标题
不平衡数据流的漂移感知的多内存模型
Drift-Aware Multi-Memory Model for Imbalanced Data Streams
论文作者
论文摘要
在线类不平衡学习涉及受概念漂移和阶级失衡影响的数据流。在线学习试图在利用先前学习的信息与将新信息纳入模型之间找到权衡。这既需要模型的增量更新,又需要学习过时的信息的能力。但是,不正确的学习可能会导致追溯性干扰问题,这一现象是在新学习的信息干扰旧信息并阻碍先前学习的信息的召回时发生的。当班级没有平等表示时,问题就会变得更加严重,从而导致从模型中删除少数群体信息。在这项工作中,我们提出了漂移感知的多内存模型(DAM3),该模型解决了基于内存的模型的在线学习中的类不平衡问题。 DAM3通过合并对不平衡的漂移探测器,保留模型中类平衡表示,并使用使用的工作记忆来防止忘记旧信息的工作记忆来解决追溯性干扰,从而减轻类失衡。我们通过对现实世界和合成数据集进行的实验表明,所提出的方法可以减轻类不平衡,并胜过最新方法。
Online class imbalance learning deals with data streams that are affected by both concept drift and class imbalance. Online learning tries to find a trade-off between exploiting previously learned information and incorporating new information into the model. This requires both the incremental update of the model and the ability to unlearn outdated information. The improper use of unlearning, however, can lead to the retroactive interference problem, a phenomenon that occurs when newly learned information interferes with the old information and impedes the recall of previously learned information. The problem becomes more severe when the classes are not equally represented, resulting in the removal of minority information from the model. In this work, we propose the Drift-Aware Multi-Memory Model (DAM3), which addresses the class imbalance problem in online learning for memory-based models. DAM3 mitigates class imbalance by incorporating an imbalance-sensitive drift detector, preserving a balanced representation of classes in the model, and resolving retroactive interference using a working memory that prevents the forgetting of old information. We show through experiments on real-world and synthetic datasets that the proposed method mitigates class imbalance and outperforms the state-of-the-art methods.