论文标题
构建视觉语义知识对象层次结构
Building a visual semantics aware object hierarchy
论文作者
论文摘要
语义差距定义为同一概念的语言表示之间的区别,这通常会导致具有不同知识背景的个体之间的误解。由于语言注释的图像广泛用于训练机器学习模型,因此语义差距问题(SGP)还导致不可避免的偏见图像注释,进一步导致当前计算机视觉任务的性能不佳。为了解决这个问题,我们提出了一种新颖的无监督方法来构建视觉语义意识到对象层次结构,旨在通过从纯粹的视觉信息中学习并消除由SGP引起的语言表示的偏见来获得分类模型。本文我们的直觉来自现实世界的知识表示,概念是层次组织的,每个概念都可以通过一组特征来描述,而不是语言注释,即视觉语义。评估由两个部分组成,首先我们将构造的层次结构应用于对象识别任务上,然后比较视觉层次结构和现有的词汇层次结构以显示我们方法的有效性。初步结果揭示了我们提出的方法的效率和潜力。
The semantic gap is defined as the difference between the linguistic representations of the same concept, which usually leads to misunderstanding between individuals with different knowledge backgrounds. Since linguistically annotated images are extensively used for training machine learning models, semantic gap problem (SGP) also results in inevitable bias on image annotations and further leads to poor performance on current computer vision tasks. To address this problem, we propose a novel unsupervised method to build visual semantics aware object hierarchy, aiming to get a classification model by learning from pure-visual information and to dissipate the bias of linguistic representations caused by SGP. Our intuition in this paper comes from real-world knowledge representation where concepts are hierarchically organized, and each concept can be described by a set of features rather than a linguistic annotation, namely visual semantic. The evaluation consists of two parts, firstly we apply the constructed hierarchy on the object recognition task and then we compare our visual hierarchy and existing lexical hierarchies to show the validity of our method. The preliminary results reveal the efficiency and potential of our proposed method.