论文标题
RankSeg:一个基于排名的一致框架
RankSEG: A Consistent Ranking-based Framework for Segmentation
论文作者
论文摘要
细分已成为计算机视觉和自然语言处理的基本领域,该领域将标签分配给每个像素/功能,以从图像/文本中提取感兴趣的区域。为了评估分割的性能,骰子和IOU指标用于衡量地面真理与预测的分割之间的重叠程度。在本文中,我们建立了关于骰子/IOU指标的理论基础,包括贝叶斯规则和骰子 - /iou-calibration,类似于分类 - 校准或分类中的Fisher一致性。我们证明,具有大多数操作损失的现有基于阈值的框架在骰子/IOU指标方面不一致,因此可能导致次优的解决方案。为了解决这个陷阱,我们提出了一个基于排名的框架,即rankdice/starkiou,灵感来自贝叶斯细分规则的启发。开发了具有GPU并行执行的三种数值算法,以在大规模和高维分段中实现所提出的框架。我们研究所提出的框架的统计特性。我们表明它是骰子 - 校准的,其多余的风险范围和收敛速度也提供了。 Rankdice/Mrankdice的数值有效性在各种模拟示例和精细注销的CityScapes,Pascal VOC和具有最先进的深度学习架构的Kvasir-Seg数据集中得到了证明。
Segmentation has emerged as a fundamental field of computer vision and natural language processing, which assigns a label to every pixel/feature to extract regions of interest from an image/text. To evaluate the performance of segmentation, the Dice and IoU metrics are used to measure the degree of overlap between the ground truth and the predicted segmentation. In this paper, we establish a theoretical foundation of segmentation with respect to the Dice/IoU metrics, including the Bayes rule and Dice-/IoU-calibration, analogous to classification-calibration or Fisher consistency in classification. We prove that the existing thresholding-based framework with most operating losses are not consistent with respect to the Dice/IoU metrics, and thus may lead to a suboptimal solution. To address this pitfall, we propose a novel consistent ranking-based framework, namely RankDice/RankIoU, inspired by plug-in rules of the Bayes segmentation rule. Three numerical algorithms with GPU parallel execution are developed to implement the proposed framework in large-scale and high-dimensional segmentation. We study statistical properties of the proposed framework. We show it is Dice-/IoU-calibrated, and its excess risk bounds and the rate of convergence are also provided. The numerical effectiveness of RankDice/mRankDice is demonstrated in various simulated examples and Fine-annotated CityScapes, Pascal VOC and Kvasir-SEG datasets with state-of-the-art deep learning architectures.