论文标题
有效的人群通过结构化知识转移计数
Efficient Crowd Counting via Structured Knowledge Transfer
论文作者
论文摘要
人群计数是一项面向应用程序的任务,其推论效率对于现实世界应用至关重要。但是,大多数以前的作品都依赖于重型骨干网络,并且需要过度的运行时间消耗,这将严重限制其部署范围并导致较差的可扩展性。为了解放这些人群计数模型,我们提出了一个新颖的结构化知识转移(SKT)框架,该框架完全利用了训练有素的教师网络的结构化知识,以产生轻巧但仍然有效的学生网络。 Specifically, it is integrated with two complementary transfer modules, including an Intra-Layer Pattern Transfer which sequentially distills the knowledge embedded in layer-wise features of the teacher network to guide feature learning of the student network and an Inter-Layer Relation Transfer which densely distills the cross-layer correlation knowledge of the teacher to regularize the student's feature evolutio Consequently, our student network can derive the layer-wise and cross-layer knowledge from the teacher network学习紧凑而有效的功能。对三个基准测试的广泛评估很好地证明了我们的SKT对广泛的人群计数模型的有效性。特别是,仅使用原始型号的参数和计算成本的$ 6 \%$,我们基于VGG的型号至少在NVIDIA 1080 GPU上获得至少6.5 $ \ times $加速,甚至达到了最新的性能。我们的代码和模型可在{\ url {https://github.com/hcplab-sysu/skt}}上获得。
Crowd counting is an application-oriented task and its inference efficiency is crucial for real-world applications. However, most previous works relied on heavy backbone networks and required prohibitive run-time consumption, which would seriously restrict their deployment scopes and cause poor scalability. To liberate these crowd counting models, we propose a novel Structured Knowledge Transfer (SKT) framework, which fully exploits the structured knowledge of a well-trained teacher network to generate a lightweight but still highly effective student network. Specifically, it is integrated with two complementary transfer modules, including an Intra-Layer Pattern Transfer which sequentially distills the knowledge embedded in layer-wise features of the teacher network to guide feature learning of the student network and an Inter-Layer Relation Transfer which densely distills the cross-layer correlation knowledge of the teacher to regularize the student's feature evolutio Consequently, our student network can derive the layer-wise and cross-layer knowledge from the teacher network to learn compact yet effective features. Extensive evaluations on three benchmarks well demonstrate the effectiveness of our SKT for extensive crowd counting models. In particular, only using around $6\%$ of the parameters and computation cost of original models, our distilled VGG-based models obtain at least 6.5$\times$ speed-up on an Nvidia 1080 GPU and even achieve state-of-the-art performance. Our code and models are available at {\url{https://github.com/HCPLab-SYSU/SKT}}.