论文标题
PuspDownDB:使用S3计算加速DBM
PushdownDB: Accelerating a DBMS using S3 Computation
论文作者
论文摘要
本文使用最近发布的称为S3 Select的功能,研究了将DBMS Analytics查询部分推向简单存储服务(S3)引擎(S3)引擎的有效性。我们表明,某些DBMS原语(过滤器,投影,聚合)始终可以成本效益地移动到S3中。其他更复杂的操作(JOIN,TOP-K,组合)需要重新实现以利用S3 SELECT,并且通常是候选人进行下调。我们通过使用我们开发的新DBM进行实验来证明这些功能,PushDownDB。通过包括TPC-H查询在内的查询集合进行的实验表明,PuspDownDB平均比不使用S3选择的基线快30%和6.7倍。
This paper studies the effectiveness of pushing parts of DBMS analytics queries into the Simple Storage Service (S3) engine of Amazon Web Services (AWS), using a recently released capability called S3 Select. We show that some DBMS primitives (filter, projection, aggregation) can always be cost-effectively moved into S3. Other more complex operations (join, top-K, group-by) require reimplementation to take advantage of S3 Select and are often candidates for pushdown. We demonstrate these capabilities through experimentation using a new DBMS that we developed, PushdownDB. Experimentation with a collection of queries including TPC-H queries shows that PushdownDB is on average 30% cheaper and 6.7X faster than a baseline that does not use S3 Select.