论文标题

协作数据分析的集成平台

An Integrated Platform for Collaborative Data Analytics

论文作者

Oesch, Sean, Gillen, Rob, Karnowski, Tom

论文摘要

尽管数据科学家之间的协作是组织生产力的关键,但数据分析师面临实现这一目标的重大障碍,包括数据共享,访问和配置所需的计算环境以及一种统一的共享知识方法。这些协作的每个障碍都与知识管理的基本问题有关,“组织如何更有效地使用知识?”。在本文中,我们将知识管理的问题考虑在协作数据分析中的问题,并将综合知识管理平台Shareal作为解决方案。 Shareal平台由三个核心组件组成:完整的堆栈Web应用程序,用于分析流数据的仪表板和用于执行实时分析的高性能计算(HPC)群集。先前的研究没有将知识管理应用于协作分析或开发了与Shareal具有相同功能的平台。 Shareal通过Web应用程序提供数据和分析的直观共享,通过HPC群集通过HPC群集提供了数据和分析,并通过实时消息传递应用程序来克服障碍。

While collaboration among data scientists is a key to organizational productivity, data analysts face significant barriers to achieving this end, including data sharing, accessing and configuring the required computational environment, and a unified method of sharing knowledge. Each of these barriers to collaboration is related to the fundamental question of knowledge management "how can organizations use knowledge more effectively?". In this paper, we consider the problem of knowledge management in collaborative data analytics and present ShareAL, an integrated knowledge management platform, as a solution to that problem. The ShareAL platform consists of three core components: a full stack web application, a dashboard for analyzing streaming data and a High Performance Computing (HPC) cluster for performing real time analysis. Prior research has not applied knowledge management to collaborative analytics or developed a platform with the same capabilities as ShareAL. ShareAL overcomes the barriers data scientists face to collaboration by providing intuitive sharing of data and analytics via the web application, a shared computing environment via the HPC cluster and knowledge sharing and collaboration via a real time messaging application.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源