深入学习，再加上新型分类方法来对发展中国家的城市环境进行分类

论文标题

深入学习，再加上新型分类方法来对发展中国家的城市环境进行分类

Deep-learning coupled with novel classification method to classify the urban environment of the developing world

论文作者

Cheng, Qianwei, Rahman, AKM Mahbubur, Sarker, Anis, Nayem, Abu Bakar Siddik, Paul, Ovi, Ali, Amin Ahsan, Amin, M Ashraful, Shibasaki, Ryosuke, Zaber, Moinul

论文摘要

快速的全球化和人类的相互依存关系，使人类向城市空间移民的巨大流动。随着高清卫星图像的出现，高分辨率数据，诸如深神经网络之类的计算方法，功能强大的硬件；城市规划正在发生范式转变。现在，关于城市环境的旧数据数据已得到大量高频数据的补充。在本文中，我们提出了一种新颖的分类方法，该方法很容易用于机器分析，并显示了该方法在发展中国家环境中的适用性。最先进的人主要由建筑结构，建筑类型等的分类来控制，并且在很大程度上代表了发达国家，这对于孟加拉国等发展中国家不足以使周围的环境对分类至关重要。此外，传统方法提出了小规模的分类，这些分类为有限的信息提供了较差的可扩展性且计算缓慢。我们根据非正式和正式空间对城市地区进行分类。 50公里x 50公里的50公里，孟加拉国达卡的Google Earth Image在视觉上注释和分类。该分类广泛基于两个维度：城市化和城市环境的建筑形式。因此，城市空间分为四个类：1）高度非正式； 2）适度的非正式； 3）适度正式； 4）高度正式的领域。总共确定了16个子类。对于语义细分，使用了Google的DeepLabv3+模型，从而增加了过滤器的视野以结合更大的上下文。包括70％城市空间的图像用于训练，其余30％用于测试和验证。该模型能够以75％的精度分割，而60％的平均值IOU。

Rapid globalization and the interdependence of humanity that engender tremendous in-flow of human migration towards the urban spaces. With advent of high definition satellite images, high resolution data, computational methods such as deep neural network, capable hardware; urban planning is seeing a paradigm shift. Legacy data on urban environments are now being complemented with high-volume, high-frequency data. In this paper we propose a novel classification method that is readily usable for machine analysis and show applicability of the methodology on a developing world setting. The state-of-the-art is mostly dominated by classification of building structures, building types etc. and largely represents the developed world which are insufficient for developing countries such as Bangladesh where the surrounding is crucial for the classification. Moreover, the traditional methods propose small-scale classifications, which give limited information with poor scalability and are slow to compute. We categorize the urban area in terms of informal and formal spaces taking the surroundings into account. 50 km x 50 km Google Earth image of Dhaka, Bangladesh was visually annotated and categorized by an expert. The classification is based broadly on two dimensions: urbanization and the architectural form of urban environment. Consequently, the urban space is divided into four classes: 1) highly informal; 2) moderately informal; 3) moderately formal; and 4) highly formal areas. In total 16 sub-classes were identified. For semantic segmentation, Google's DeeplabV3+ model was used which increases the field of view of the filters to incorporate larger context. Image encompassing 70% of the urban space was used for training and the remaining 30% was used for testing and validation. The model is able to segment with 75% accuracy and 60% Mean IoU.

下载PDF全文

下载文献需遵守相关版权规定

论文标题