论文标题
FP-STEREO:嵌入式应用程序的硬件有效的立体声愿景
FP-Stereo: Hardware-Efficient Stereo Vision for Embedded Applications
论文作者
论文摘要
快速准确的深度估计或立体声匹配在嵌入式立体声视觉系统中至关重要,需要大量的设计工作才能在准确性,速度和硬件成本之间达到适当的平衡。为了减少设计工作并实现适当的平衡,我们提出了FP-STEREO,用于自动在FPGA上构建高性能立体声匹配管道。 FP-STEREO由一个开源硬件有效的库组成,允许设计人员立即获得所需的实现。在立体声匹配管道的每个阶段,我们的库中支持了各种方法,并开发了一系列技术来利用并行性并减少资源开销。为了提高可用性,FP-STEREO可以通过我们优化的HLS模板自动生成FPGA加速器的合成C代码。为了指导用户正确的设计选择满足特定的应用程序要求,对我们库的各种配置进行了详细的比较,以调查准确性/速度/成本权衡。实验结果还表明,FP-STEREO的表现优于各个方面的最新FPGA设计,包括降低6.08%的误差,速度更快2倍,资源使用率降低30%,能源消耗降低40%。与GPU设计相比,FP-STEREO以竞争速度达到相同的精度,而消耗能量却少得多。
Fast and accurate depth estimation, or stereo matching, is essential in embedded stereo vision systems, requiring substantial design effort to achieve an appropriate balance among accuracy, speed and hardware cost. To reduce the design effort and achieve the right balance, we propose FP-Stereo for building high-performance stereo matching pipelines on FPGAs automatically. FP-Stereo consists of an open-source hardware-efficient library, allowing designers to obtain the desired implementation instantly. Diverse methods are supported in our library for each stage of the stereo matching pipeline and a series of techniques are developed to exploit the parallelism and reduce the resource overhead. To improve the usability, FP-Stereo can generate synthesizable C code of the FPGA accelerator with our optimized HLS templates automatically. To guide users for the right design choice meeting specific application requirements, detailed comparisons are performed on various configurations of our library to investigate the accuracy/speed/cost trade-off. Experimental results also show that FP-Stereo outperforms the state-of-the-art FPGA design from all aspects, including 6.08% lower error, 2x faster speed, 30% less resource usage and 40% less energy consumption. Compared to GPU designs, FP-Stereo achieves the same accuracy at a competitive speed while consuming much less energy.