论文标题
使用DSP切片作为内容添加的更新队列
Using DSP Slices as Content-Addressable Update Queues
论文作者
论文摘要
内容 - 可调的内存(CAM)是用于构建内存库,路由表和危险检测逻辑的强大抽象。如果没有FPGA设备上可用的天然CAM结构,则必须使用手头的结构原始素来模拟其功能。这种仿真在消费基础资源(通常是通用织物和片上块RAM(BRAM))中引起了显着的开销。这通常会激发减轻权衡的措施,例如减少记忆缓存的关联性。本文介绍了一种在内存更新队列中实现危险分辨率的技术,该技术掩盖了读取模型 - 循环循环的芯片内存读数延迟,同时保证了整个内存带宽的交付。 DSP切片的创新使用使他们可以假设并组合(a)标签和数据存储的功能,(b)标签匹配,以及(c)此键值值存储方案中的数据更新。提出的方法通过将此资源类型添加为实现CAM的另一个选项,为设计师提供了额外的灵活性。
Content-Addressable Memory (CAM) is a powerful abstraction for building memory caches, routing tables and hazard detection logic. Without a native CAM structure available on FPGA devices, their functionality must be emulated using the structural primitives at hand. Such an emulation causes significant overhead in the consumption of the underlying resources, typically general-purpose fabric and on-chip block RAM (BRAM). This often motivates mitigating trade-offs, such as the reduction of the associativity of memory caches. This paper describes a technique to implement the hazard resolution in a memory update queue that hides the off-chip memory readout latency of read-modify-write cycles while guaranteeing the delivery of the full memory bandwidth. The innovative use of DSP slices allows them to assume and combine the functions of (a) the tag and data storage, (b) the tag matching, and (c) the data update in this key-value storage scenario. The proposed approach provides designers with extra flexibility by adding this resource type as another option to implement CAM.