PuspDownDB：使用S3计算加速DBM

论文标题

PuspDownDB：使用S3计算加速DBM

PushdownDB: Accelerating a DBMS using S3 Computation

论文作者

Yu, Xiangyao, Youill, Matt, Woicik, Matthew, Ghanem, Abdurrahman, Serafini, Marco, Aboulnaga, Ashraf, Stonebraker, Michael

论文摘要

本文使用最近发布的称为S3 Select的功能，研究了将DBMS Analytics查询部分推向简单存储服务（S3）引擎（S3）引擎的有效性。我们表明，某些DBMS原语（过滤器，投影，聚合）始终可以成本效益地移动到S3中。其他更复杂的操作（JOIN，TOP-K，组合）需要重新实现以利用S3 SELECT，并且通常是候选人进行下调。我们通过使用我们开发的新DBM进行实验来证明这些功能，PushDownDB。通过包括TPC-H查询在内的查询集合进行的实验表明，PuspDownDB平均比不使用S3选择的基线快30％和6.7倍。

This paper studies the effectiveness of pushing parts of DBMS analytics queries into the Simple Storage Service (S3) engine of Amazon Web Services (AWS), using a recently released capability called S3 Select. We show that some DBMS primitives (filter, projection, aggregation) can always be cost-effectively moved into S3. Other more complex operations (join, top-K, group-by) require reimplementation to take advantage of S3 Select and are often candidates for pushdown. We demonstrate these capabilities through experimentation using a new DBMS that we developed, PushdownDB. Experimentation with a collection of queries including TPC-H queries shows that PushdownDB is on average 30% cheaper and 6.7X faster than a baseline that does not use S3 Select.

下载PDF全文

下载文献需遵守相关版权规定

论文标题