论文标题

关于灵活预测中属性无关紧要

On Irrelevance of Attributes in Flexible Prediction

论文作者

Klopotek, Mieczyslaw A., Matuszewski, Andrzej

论文摘要

本文分析了通过增量概念形成方法获得的概念层次结构的属性,称为“灵活预测”,以确定可以要求参与属性的哪种“相关性”以进行有意义的概念层次结构。研究了简单和组合属性的选择,缩放和各个属性的分布以及相关强度的影响。矛盾的是,这两者既:属性与其他属性较弱且密切相关,对整体分类的影响恶化。适当构建派生属性以及各个属性的缩放量表会强烈影响所获得的概念层次结构。分布的属性密度似乎对分类产生了薄弱 看来,概念层次结构(分类法)反映了数据与我们对数据的客观真实的兴趣之间的折衷。为了获得更适合自己目的的分类,建议将对称性分为属性(通过将其分为依赖和独立,并应用不同的评估公式以贡献其贡献)。考虑连续变量和离散变量。考虑到前者的一些方法。

This paper analyses properties of conceptual hierarchy obtained via incremental concept formation method called "flexible prediction" in order to determine what kind of "relevance" of participating attributes may be requested for meaningful conceptual hierarchy. The impact of selection of simple and combined attributes, of scaling and of distribution of individual attributes and of correlation strengths among them is investigated. Paradoxically, both: attributes weakly and strongly related with other attributes have deteriorating impact onto the overall classification. Proper construction of derived attributes as well as selection of scaling of individual attributes strongly influences the obtained concept hierarchy. Attribute density of distribution seems to influence the classification weakly It seems also, that concept hierarchies (taxonomies) reflect a compromise between the data and our interests in some objective truth about the data. To obtain classifications more suitable for one's purposes, breaking the symmetry among attributes (by dividing them into dependent and independent and applying differing evaluation formulas for their contribution) is suggested. Both continuous and discrete variables are considered. Some methodologies for the former are considered.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源