新颖的班级发现而无需忘记

论文标题

新颖的班级发现而无需忘记

Novel Class Discovery without Forgetting

论文作者

Joseph, K J, Paul, Sujoy, Aggarwal, Gaurav, Biswas, Soma, Rai, Piyush, Han, Kai, Balasubramanian, Vineeth N

论文摘要

人类通过利用和适应到目前为止获得的知识来识别和区分他们不熟悉的实例的天生能力。重要的是，他们实现了这一目标，而不会在早期学习中恶化表现。受此启发的启发，我们识别并制定了NCDWF的新的，务实的问题设置：新颖的班级发现而无需忘记，哪个任务是机器学习模型从未标记的数据中逐步发现实例的新颖类别，同时在先前看到的类别上保持其性能。我们提出1）一种生成伪层表示的方法，该方法充当（不再可用）标记数据的代理，从而减轻忘记的遗忘，2）基于共同信息的正常化程序，增强了无聊的新颖类发现，而3）一个简单的类别，可以在测试数据中有助于既可以看到既可以看到了既可以识别了，又可以看到一个简单的类别。我们介绍了基于CIFAR-10，CIFAR-100和IMAGENET-1000的实验协议，以衡量知识保留和新型类发现之间的权衡。我们广泛的评估表明，现有的模型在确定新类别的同时灾难性地忘记了以前看到的类别，而我们的方法能够在竞争目标之间有效平衡。我们希望我们的工作能够吸引对这个新确定的务实问题设定的进一步研究。

Humans possess an innate ability to identify and differentiate instances that they are not familiar with, by leveraging and adapting the knowledge that they have acquired so far. Importantly, they achieve this without deteriorating the performance on their earlier learning. Inspired by this, we identify and formulate a new, pragmatic problem setting of NCDwF: Novel Class Discovery without Forgetting, which tasks a machine learning model to incrementally discover novel categories of instances from unlabeled data, while maintaining its performance on the previously seen categories. We propose 1) a method to generate pseudo-latent representations which act as a proxy for (no longer available) labeled data, thereby alleviating forgetting, 2) a mutual-information based regularizer which enhances unsupervised discovery of novel classes, and 3) a simple Known Class Identifier which aids generalized inference when the testing data contains instances form both seen and unseen categories. We introduce experimental protocols based on CIFAR-10, CIFAR-100 and ImageNet-1000 to measure the trade-off between knowledge retention and novel class discovery. Our extensive evaluations reveal that existing models catastrophically forget previously seen categories while identifying novel categories, while our method is able to effectively balance between the competing objectives. We hope our work will attract further research into this newly identified pragmatic problem setting.

下载PDF全文

下载文献需遵守相关版权规定

论文标题