无监督的时间视频分割是预测剩余手术持续时间的辅助任务

论文标题

无监督的时间视频分割是预测剩余手术持续时间的辅助任务

Unsupervised Temporal Video Segmentation as an Auxiliary Task for Predicting the Remaining Surgery Duration

论文作者

Rivoir, Dominik, Bodenstedt, Sebastian, von Bechtolsheim, Felix, Distler, Marius, Weitz, Jürgen, Speidel, Stefanie

论文摘要

估计手术过程中剩余的手术持续时间（RSD）可能对或计划和麻醉剂量估计有用。随着在计算机视觉中基于深度学习的方法的最新成功，仅基于内镜摄像头的视觉数据，已经提出了几种全自动RSD预测的神经网络方法。我们调查使用无监督的时间视频细分作为辅助学习任务，是否可以改善RSD预测。与以前的工作相反，该工作将监督的手术阶段识别作为辅助任务，我们通过提出一个类似但无监督的学习目标来避免需要手动注释，从而将视频序列插入暂时的连贯段。在多个实验设置中，通过学习辅助任务获得的结果通过特征提取，预处理或正则化纳入了深层RSD模型。此外，我们为RSD训练提出了一种新颖的损失功能，该功能试图抵消RSD地面真相的不利特征。使用我们的无监督方法作为RSD培训的辅助任务，我们的表现优于其他自我监督的方法，并且可以与受监督的最先进的方法相媲美。结合新颖的RSD损失，我们的表现略高于监督方法。

Estimating the remaining surgery duration (RSD) during surgical procedures can be useful for OR planning and anesthesia dose estimation. With the recent success of deep learning-based methods in computer vision, several neural network approaches have been proposed for fully automatic RSD prediction based solely on visual data from the endoscopic camera. We investigate whether RSD prediction can be improved using unsupervised temporal video segmentation as an auxiliary learning task. As opposed to previous work, which presented supervised surgical phase recognition as auxiliary task, we avoid the need for manual annotations by proposing a similar but unsupervised learning objective which clusters video sequences into temporally coherent segments. In multiple experimental setups, results obtained by learning the auxiliary task are incorporated into a deep RSD model through feature extraction, pretraining or regularization. Further, we propose a novel loss function for RSD training which attempts to counteract unfavorable characteristics of the RSD ground truth. Using our unsupervised method as an auxiliary task for RSD training, we outperform other self-supervised methods and are comparable to the supervised state-of-the-art. Combined with the novel RSD loss, we slightly outperform the supervised approach.

下载PDF全文

下载文献需遵守相关版权规定

论文标题