论文标题
人群与人群注意力卷积神经网络计数
Crowd counting with crowd attention convolutional neural network
论文作者
论文摘要
由于场景的复杂性和规模变化,人群计数是一个具有挑战性的问题。尽管深度学习在人群计数上取得了很大的改善,但场景的复杂性会影响这些方法的判断,并且通常将某些物体视为错误的人。在人群计数结果中导致潜在的巨大错误。为了解决这个问题,我们提出了一种新颖的端到端模型,称为人群注意力卷积神经网络(CAT-CNN)。我们的CAT-CNN可以通过自动编码置信图来适应地评估每个像素位置的人头的重要性。在置信图的指导下,人头在估计密度图中的位置更加关注以编码最终密度图,这可以有效地避免巨大的错误判断。人群数可以通过集成最终密度图来获得。为了编码高度精制的密度图,每个图像的总人群数量都在设计的分类任务中分类,我们首先明确映射了总体级别类别的先验以配备地图。为了验证我们提出的方法的效率,在三个高度挑战的数据集上进行了广泛的实验。结果确定了我们方法比许多最新方法的优越性。
Crowd counting is a challenging problem due to the scene complexity and scale variation. Although deep learning has achieved great improvement in crowd counting, scene complexity affects the judgement of these methods and they usually regard some objects as people mistakenly; causing potentially enormous errors in the crowd counting result. To address the problem, we propose a novel end-to-end model called Crowd Attention Convolutional Neural Network (CAT-CNN). Our CAT-CNN can adaptively assess the importance of a human head at each pixel location by automatically encoding a confidence map. With the guidance of the confidence map, the position of human head in estimated density map gets more attention to encode the final density map, which can avoid enormous misjudgements effectively. The crowd count can be obtained by integrating the final density map. To encode a highly refined density map, the total crowd count of each image is classified in a designed classification task and we first explicitly map the prior of the population-level category to feature maps. To verify the efficiency of our proposed method, extensive experiments are conducted on three highly challenging datasets. Results establish the superiority of our method over many state-of-the-art methods.