论文标题
数据科学:全面概述
Data Science: A Comprehensive Overview
论文作者
论文摘要
二十一世纪已经迎来了大数据和数据经济的时代,其中具有重要知识,见解和潜力的数据DNA已成为所有基于数据的生物的内在组成部分。对数据DNA及其生物的适当理解取决于数据科学的新领域及其基石分析。尽管大数据是否仅是炒作和嗡嗡声,但数据科学仍处于非常早期的阶段,但正在出现重大挑战和机遇,或者受到数据科学的研究,创新,商业,职业和教育的启发。本文提供了有关数据科学的基本方面的全面调查和教程:从数据分析到数据科学,数据科学概念,数据科学时代的全局,数据创新的主要挑战和方向,数据分析的性质,新的工业化和新工业化和数据经济中的工业化和服务机会,数据经济,职业以及数据教育的职业以及未来的数据科学的能力。除了提供丰富的观察,课程以及对数据科学和分析的思考之外,本文是该领域第一个绘制全面大局的文章。
The twenty-first century has ushered in the age of big data and data economy, in which data DNA, which carries important knowledge, insights and potential, has become an intrinsic constituent of all data-based organisms. An appropriate understanding of data DNA and its organisms relies on the new field of data science and its keystone, analytics. Although it is widely debated whether big data is only hype and buzz, and data science is still in a very early phase, significant challenges and opportunities are emerging or have been inspired by the research, innovation, business, profession, and education of data science. This paper provides a comprehensive survey and tutorial of the fundamental aspects of data science: the evolution from data analysis to data science, the data science concepts, a big picture of the era of data science, the major challenges and directions in data innovation, the nature of data analytics, new industrialization and service opportunities in the data economy, the profession and competency of data education, and the future of data science. This article is the first in the field to draw a comprehensive big picture, in addition to offering rich observations, lessons and thinking about data science and analytics.