论文标题
双向图推理网络,用于泛型分割
Bidirectional Graph Reasoning Network for Panoptic Segmentation
论文作者
论文摘要
最新的关于综合分段的研究诉诸于单个端到端网络,以结合实例分割和语义分割的任务。但是,先前的模型仅通过多分支方案在架构层面上统一了两个相关任务,或者通过单向特征融合揭示了它们之间的潜在相关性,这无视对象和背景之间的显式语义和共发生关系。受到上下文信息对于识别和本地化对象至关重要的事实的启发,包容性对象细节对于解析背景场景至关重要,因此我们研究了对对象和背景之间的相关性进行明确建模,以在泛型段任务中对图像进行整体了解。我们介绍了一个双向图推理网络(BGRNET),该网络将图形结构纳入常规的全景分割网络中,以挖掘前景事物和背景物质类别内和之间的模块化内和模型关系。特别是,BGRNET首先在实例和语义分割分支中构建特定于图像的图,分别在建议级别和班级级别上启用灵活的推理。为了建立单独的分支之间的相关性并充分利用事物和事物之间的互补关系,我们提出了一个双向图连接模块,以可学习的方式跨分支扩散信息。实验结果证明了我们的BGRNET的优势,该BGRNET在具有挑战性的可可和ADE20K泛型分割基准上实现了新的最新性能。
Recent researches on panoptic segmentation resort to a single end-to-end network to combine the tasks of instance segmentation and semantic segmentation. However, prior models only unified the two related tasks at the architectural level via a multi-branch scheme or revealed the underlying correlation between them by unidirectional feature fusion, which disregards the explicit semantic and co-occurrence relations among objects and background. Inspired by the fact that context information is critical to recognize and localize the objects, and inclusive object details are significant to parse the background scene, we thus investigate on explicitly modeling the correlations between object and background to achieve a holistic understanding of an image in the panoptic segmentation task. We introduce a Bidirectional Graph Reasoning Network (BGRNet), which incorporates graph structure into the conventional panoptic segmentation network to mine the intra-modular and intermodular relations within and between foreground things and background stuff classes. In particular, BGRNet first constructs image-specific graphs in both instance and semantic segmentation branches that enable flexible reasoning at the proposal level and class level, respectively. To establish the correlations between separate branches and fully leverage the complementary relations between things and stuff, we propose a Bidirectional Graph Connection Module to diffuse information across branches in a learnable fashion. Experimental results demonstrate the superiority of our BGRNet that achieves the new state-of-the-art performance on challenging COCO and ADE20K panoptic segmentation benchmarks.