论文标题
通过自我培训对替代任务进行的半监督人群进行计数
Semi-Supervised Crowd Counting via Self-Training on Surrogate Tasks
论文作者
论文摘要
大多数现有的人群计数系统都取决于对象位置注释的可用性,这可能是昂贵的。为了降低注释成本,一种有吸引力的解决方案是利用大量未标记的图像以半监督的方式构建人群计数模型。本文从特征学习的角度解决了半监督的人群计数问题。我们的关键想法是利用未标记的图像来训练通用功能提取器,而不是人群计数器的整个网络。该设计的理由是,学习特征提取器可以更可靠,更健壮,以对未标记数据产生的不可避免的嘈杂监督。另外,除了良好的功能提取器之外,还可以构建具有更少密度映射注释的密度映射回归器。具体而言,我们提出了一种新颖的半监督人群计数方法,该方法建立在两个创新组件上:(1)一组相关的二进制分割任务源自原始密度映射映射任务作为替代预测目标; (2)通过利用提出的自我训练方案从标记和未标记的数据中学到替代目标预测因子,该方案完全利用了这些二进制分割任务的基本约束。通过实验,我们表明所提出的方法优于现有的半监视人群计数方法和其他代表性基准。
Most existing crowd counting systems rely on the availability of the object location annotation which can be expensive to obtain. To reduce the annotation cost, one attractive solution is to leverage a large number of unlabeled images to build a crowd counting model in semi-supervised fashion. This paper tackles the semi-supervised crowd counting problem from the perspective of feature learning. Our key idea is to leverage the unlabeled images to train a generic feature extractor rather than the entire network of a crowd counter. The rationale of this design is that learning the feature extractor can be more reliable and robust towards the inevitable noisy supervision generated from the unlabeled data. Also, on top of a good feature extractor, it is possible to build a density map regressor with much fewer density map annotations. Specifically, we proposed a novel semi-supervised crowd counting method which is built upon two innovative components: (1) a set of inter-related binary segmentation tasks are derived from the original density map regression task as the surrogate prediction target; (2) the surrogate target predictors are learned from both labeled and unlabeled data by utilizing a proposed self-training scheme which fully exploits the underlying constraints of these binary segmentation tasks. Through experiments, we show that the proposed method is superior over the existing semisupervised crowd counting method and other representative baselines.