论文标题
发电机 - 跨站点和存储媒体处理科学数据
Dynamo -- Handling Scientific Data Across Sites and Storage Media
论文作者
论文摘要
Dynamo是用于科学数据管理的全栈软件解决方案。 Dynamo的体系结构是模块化的,可扩展的且可自定义的,使该软件适合在整个安装尺度上管理数据,从单个位置存储的一些Terabytes到在全球计算网格中分布的数百个PB。本文记录了Dynamo的核心系统设计,并描述了实施各种数据管理任务的应用程序。还在CERN大型强子对撞机和小型分析设施的CMS实验中,还提供了有关系统的运行经验的简短报告。
Dynamo is a full-stack software solution for scientific data management. Dynamo's architecture is modular, extensible, and customizable, making the software suitable for managing data in a wide range of installation scales, from a few terabytes stored at a single location to hundreds of petabytes distributed across a worldwide computing grid. This article documents the core system design of Dynamo and describes the applications that implement various data management tasks. A brief report is also given on the operational experiences of the system at the CMS experiment at the CERN Large Hadron Collider and at a small scale analysis facility.