论文标题
互联网规模3D搜索引擎的数据管理挑战
Data Management Challenges for Internet-scale 3D Search Engines
论文作者
论文摘要
本文介绍了构建Internet规模3D搜索引擎所涉及的最重要的与数据相关的挑战。讨论集中在该域中最紧迫的数据管理问题上,包括模型采集,对多种文件格式的支持,资产版本管理,数据完整性错误,数据生命周期,知识属性以及Web Crawling的合法性。该论文还讨论了许多问题,这些问题属于值得信赖的计算的标题,包括隐私,安全性,不适当的内容以及对资产的复制/混合。本文的目的是提供这些一般问题的概述,这是由互联网最大的运营搜索引擎中得出的经验数据所说明的。尽管已经在3D信息检索上发表了许多作品,但本文是第一个讨论在大规模构建实用搜索引擎时出现的现实世界挑战的文章。
This paper describes the most significant data-related challenges involved in building internet-scale 3D search engines. The discussion centers on the most pressing data management issues in this domain, including model acquisition, support for multiple file formats, asset versioning, data integrity errors, the data lifecycle, intellectual property, and the legality of web crawling. The paper also discusses numerous issues that fall under the rubric of trustworthy computing, including privacy, security, inappropriate content, and copying/remixing of assets. The goal of the paper is to provide an overview of these general issues, illustrated by empirical data drawn from the internet's largest operational search engine. While numerous works have been published on 3D information retrieval, this paper is the first to discuss the real-world challenges that arise in building practical search engines at scale.