论文标题

部分可观测时空混沌系统的无模型预测

A Unified Hardware Architecture for Convolutions and Deconvolutions in CNN

论文作者

Bai, Lin, Lyu, Yecheng, Huang, Xinming

论文摘要

In this paper, a scalable neural network hardware architecture for image segmentation is proposed. By sharing the same computing resources, both convolution and deconvolution operations are handled by the same process element array. In addition, access to on-chip and off-chip memories is optimized to alleviate the burden introduced by partial sum.例如,通过针对Xilinx ZC706 FPGA来实现SEGNET-BASIC,它分别实现了Xilinx ZC706 FPGA,分别实现了151.5 GOPS和94.3 GOPS的性能,用于卷积和反向卷积。 This unified convolution/deconvolution design is applicable to other CNNs with deconvolution.

In this paper, a scalable neural network hardware architecture for image segmentation is proposed. By sharing the same computing resources, both convolution and deconvolution operations are handled by the same process element array. In addition, access to on-chip and off-chip memories is optimized to alleviate the burden introduced by partial sum. As an example, SegNet-Basic has been implemented using the proposed unified architecture by targeting on Xilinx ZC706 FPGA, which achieves the performance of 151.5 GOPS and 94.3 GOPS for convolution and deconvolution respectively. This unified convolution/deconvolution design is applicable to other CNNs with deconvolution.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源