论文标题

高能物理分析的数据湖中的智能缓存

Smart caching in a Data Lake for High Energy Physics analysis

论文作者

Tedeschi, Tommaso, Ciangottini, Diego, Baioletti, Marco, Poggioni, Valentina, Spiga, Daniele, Storchi, Loriano, Tracolli, Mirco

论文摘要

在几乎所有科学领域的数据生产的持续增长都会在数据访问和管理方面引起新的问题,尤其是在最终用户以及他们可以访问的资源的情况下,全球范围内分布了。这项工作的重点是在高能物理领域的数据湖基础设施中的数据缓存管理。我们提出了一种基于强化学习技术的自主方法,以改善用户体验并控制基础架构的维护成本。

The continuous growth of data production in almost all scientific areas raises new problems in data access and management, especially in a scenario where the end-users, as well as the resources that they can access, are worldwide distributed. This work is focused on the data caching management in a Data Lake infrastructure in the context of the High Energy Physics field. We are proposing an autonomous method, based on Reinforcement Learning techniques, to improve the user experience and to contain the maintenance costs of the infrastructure.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源