BI3D：通过二元分类进行立体声深度估计

论文标题

BI3D：通过二元分类进行立体声深度估计

Bi3D: Stereo Depth Estimation via Binary Classifications

论文作者

Badki, Abhishek, Troccoli, Alejandro, Kim, Kihwan, Kautz, Jan, Sen, Pradeep, Gallo, Orazio

论文摘要

基于立体声的深度估计是计算机视觉的基石，最先进的方法可实时提供准确的结果。但是，对于几种自主导航等应用程序，对于较低的延迟而言，交易准确性可能很有用。我们提出BI3D，该方法通过一系列二元分类来估算深度。它没有像现有的立体声方法那样测试对象是否处于特定的深度$ d $，而是将它们归类为远比$ d $更近或更远。该属性提供了平衡准确性和延迟的强大机制。给定严格的时间预算，BI3D可以在几毫秒内检测到比给定距离更接近的对象，或者以任意粗略的量化估算深度，并具有量化水平数的复杂性线性。 BI3D还可以使用分配的量化水平来获得连续的深度，但在特定的深度范围内。对于标准立体声（即在整个范围内连续深度），我们的方法靠近或与最新的，精心调整的立体声方法相提并论。

Stereo-based depth estimation is a cornerstone of computer vision, with state-of-the-art methods delivering accurate results in real time. For several applications such as autonomous navigation, however, it may be useful to trade accuracy for lower latency. We present Bi3D, a method that estimates depth via a series of binary classifications. Rather than testing if objects are at a particular depth $D$, as existing stereo methods do, it classifies them as being closer or farther than $D$. This property offers a powerful mechanism to balance accuracy and latency. Given a strict time budget, Bi3D can detect objects closer than a given distance in as little as a few milliseconds, or estimate depth with arbitrarily coarse quantization, with complexity linear with the number of quantization levels. Bi3D can also use the allotted quantization levels to get continuous depth, but in a specific depth range. For standard stereo (i.e., continuous depth on the whole range), our method is close to or on par with state-of-the-art, finely tuned stereo methods.

下载PDF全文

下载文献需遵守相关版权规定

论文标题