论文标题
在记忆中处理的现代入门
A Modern Primer on Processing in Memory
论文作者
论文摘要
本文讨论了旨在使计算接近数据的最新研究,我们广泛地调用了内存处理(PIM)的方法。 PIM将计算机制放置在数据存储的位置或附近(即内存芯片或模块内部,在3D堆叠内存的逻辑层中,存储器控制器,存储设备或芯片中),以便减少或消除计算单元和存储器单位之间的数据移动。尽管PIM的总体想法并不是什么新鲜事物,但我们讨论了应用程序的激励趋势以及记忆电路和技术,这些趋势极大地加剧了在现代计算系统中启用它的需求。 We examine at least two promising new approaches to designing PIM systems to accelerate important data-intensive applications: (1) processing-using-memory, which exploits fundamental analog operational principles of memory chips to perform massively-parallel operations in-situ in memory, (2) processing-near-memory, which exploits different logic and memory integration technologies (e.g., 3D-stacked memory technology) to place computation logic close to内存电路,从而实现对数据的高带宽,低能和低延迟访问权限。在两种方法中,我们都描述和解决了相关的跨层研究,设计和采用挑战,这些挑战在设备,体系结构,系统,编译器,编程模型和应用程序中。我们的重点是以低成本在实际计算平台中采用的PIM设计的开发。我们通过讨论解决PIM实际采用的关键挑战的工作来结束。我们认为,从以过程为中心的心态(和基础架构)的转变仍然是PIM的最大采用挑战,一旦克服,它就可以释放一种从根本上启动节能,高性能和可持续的新设计,使用和编程计算系统的新方法。
This paper discusses recent research that aims to enable computation close to data, an approach we broadly call processing-in-memory (PIM). PIM places computation mechanisms in or near where the data is stored (i.e., inside memory chips or modules, in the logic layer of 3D-stacked memory, in the memory controllers, in storage devices or chips), so that data movement between the computation units and memory/storage units is reduced or eliminated. While the general idea of PIM is not new, we discuss motivating trends in applications as well as memory circuits and technology that greatly exacerbate the need for enabling it in modern computing systems. We examine at least two promising new approaches to designing PIM systems to accelerate important data-intensive applications: (1) processing-using-memory, which exploits fundamental analog operational principles of memory chips to perform massively-parallel operations in-situ in memory, (2) processing-near-memory, which exploits different logic and memory integration technologies (e.g., 3D-stacked memory technology) to place computation logic close to memory circuitry, and thereby enable high-bandwidth, low-energy, and low-latency access to data. In both approaches, we describe and tackle relevant cross-layer research, design, and adoption challenges in devices, architecture, systems, compilers, programming models, and applications. Our focus is on the development of PIM designs that can be adopted in real computing platforms at low cost. We conclude by discussing work on solving key challenges to the practical adoption of PIM. We believe that the shift from a processor-centric to a memory-centric mindset (and infrastructure) remains the largest adoption challenge for PIM, which, once overcome, can unleash a fundamentally energy-efficient, high-performance, and sustainable new way of designing, using, and programming computing systems.