论文标题
不断发展的度量学习以增量和减少特征
Evolving Metric Learning for Incremental and Decremental Features
论文作者
论文摘要
由于计算成本较低,在线度量学习已被广泛利用大规模数据分类。但是,在在线实践场景中,这些功能正在发展(例如,某些功能消失并增强了一些新功能),大多数公制学习模型不能成功地应用于这些场景,尽管它们可以有效地解决演变的实例。为了应对挑战,我们为增量和减少特征开发了一种新的在线发展度量学习(EML)模型,可以通过与平滑的Wasserstein度量距离合并来同时处理实例和特征进化。具体而言,我们的模型包含两个基本阶段:转换阶段(T阶段)和继承阶段(I阶段)。对于T阶段,我们建议从消失的特征中提取重要信息,同时忽略非信息知识,并通过将其转换为低级别的判别度量空间来将其转发为幸存的特征。它进一步探讨了异质样品的内在低级结构,以减少计算和记忆负担,尤其是对于高度维持大规模数据的数据。对于I阶段,我们继承了T阶段幸存功能的度量表现,然后扩展以包括新的增强功能。此外,使用平滑的瓦斯汀距离来表征异质和复杂样本之间的相似性关系,因为不断发展的特征在不同阶段不严格对齐。除了在一声案例中应对挑战之外,我们还将模型扩展到多命中场景。在为T级和I级的有效优化策略得出了多个数据集的广泛实验后,验证了我们的EML模型的出色性能。
Online metric learning has been widely exploited for large-scale data classification due to the low computational cost. However, amongst online practical scenarios where the features are evolving (e.g., some features are vanished and some new features are augmented), most metric learning models cannot be successfully applied to these scenarios, although they can tackle the evolving instances efficiently. To address the challenge, we develop a new online Evolving Metric Learning (EML) model for incremental and decremental features, which can handle the instance and feature evolutions simultaneously by incorporating with a smoothed Wasserstein metric distance. Specifically, our model contains two essential stages: a Transforming stage (T-stage) and a Inheriting stage (I-stage). For the T-stage, we propose to extract important information from vanished features while neglecting non-informative knowledge, and forward it into survived features by transforming them into a low-rank discriminative metric space. It further explores the intrinsic low-rank structure of heterogeneous samples to reduce the computation and memory burden especially for highly-dimensional large-scale data. For the I-stage, we inherit the metric performance of survived features from the T-stage and then expand to include the new augmented features. Moreover, a smoothed Wasserstein distance is utilized to characterize the similarity relationships among the heterogeneous and complex samples, since the evolving features are not strictly aligned in the different stages. In addition to tackling the challenges in one-shot case, we also extend our model into multishot scenario. After deriving an efficient optimization strategy for both T-stage and I-stage, extensive experiments on several datasets verify the superior performance of our EML model.