论文标题
跨区域油棕榈树通过多层注意域适应网络进行计数和检测
Cross-regional oil palm tree counting and detection via multi-level attention domain adaptation network
论文作者
论文摘要
在大区域中对棕榈树种植园进行准确的评估可以在经济和生态方面产生有意义的影响。但是,基于手动人工监测工作的有限解决方案有限的解决方案,巨大的空间规模和各个地区的地质特征的种类繁多。尽管近年来基于深度学习的算法表现出在形成自动化方法方面的潜力,但是覆盖不同地区的不同特征所需的标签工作在很大程度上限制了其在大规模问题中的有效性。在本文中,我们提出了一种新型的域自适应油棕榈树检测方法,即多级注意域自适应网络(MADAN)以收获跨区域油棕榈树的计数和检测。 MADAN由4个步骤组成:首先,我们采用了基于批处理标准化网络(BIN)的特征提取器来提高模型的概括能力,集成了批处理归一化和实例归一化。其次,我们将一个多级注意机制(MLA)嵌入到我们的体系结构中,以增强可传递性,包括功能级别的注意力和熵级别的关注。然后,我们设计了一个最小熵正则化(MER),以通过将熵级别的注意值分配给熵惩罚来提高分类器预测的置信度。最后,我们采用了基于滑动窗口的预测和基于IOU的后处理方法来获得最终检测结果。我们使用大规模油棕种植园区的三个不同的卫星图像进行了全面的消融实验,并进行了六项转移任务。与基线方法(无DA)相比,MADAN的平均F1评分方面将检测准确性提高了14.98%,并且比现有域适应方法高3.55%-14.49%。
Providing an accurate evaluation of palm tree plantation in a large region can bring meaningful impacts in both economic and ecological aspects. However, the enormous spatial scale and the variety of geological features across regions has made it a grand challenge with limited solutions based on manual human monitoring efforts. Although deep learning based algorithms have demonstrated potential in forming an automated approach in recent years, the labelling efforts needed for covering different features in different regions largely constrain its effectiveness in large-scale problems. In this paper, we propose a novel domain adaptive oil palm tree detection method, i.e., a Multi-level Attention Domain Adaptation Network (MADAN) to reap cross-regional oil palm tree counting and detection. MADAN consists of 4 procedures: First, we adopted a batch-instance normalization network (BIN) based feature extractor for improving the generalization ability of the model, integrating batch normalization and instance normalization. Second, we embedded a multi-level attention mechanism (MLA) into our architecture for enhancing the transferability, including a feature level attention and an entropy level attention. Then we designed a minimum entropy regularization (MER) to increase the confidence of the classifier predictions through assigning the entropy level attention value to the entropy penalty. Finally, we employed a sliding window-based prediction and an IOU based post-processing approach to attain the final detection results. We conducted comprehensive ablation experiments using three different satellite images of large-scale oil palm plantation area with six transfer tasks. MADAN improves the detection accuracy by 14.98% in terms of average F1-score compared with the Baseline method (without DA), and performs 3.55%-14.49% better than existing domain adaptation methods.