论文标题
使用社交媒体图像进行构建功能分类
Using Social Media Images for Building Function Classification
论文作者
论文摘要
建筑物实例水平上的城市土地利用对于许多应用来说是至关重要的地理信息,但很难获得。缩小这一差距的直观方法是从地面图像预测建筑物的功能。社交媒体图像平台包含数十亿张图像,其中包含各种各样的图像,包括但不限于街头视角。为了应付此问题,本研究提出了一条过滤管道,以从大型社交媒体图像数据集中产生高质量的地面图像。管道确保所有结果图像都具有完整且有效的地理标签,并带有指南针方向,以将图像内容和空间对象与地图相关联。 我们分析了来自Flickr的文化多样性社交媒体数据集的方法,其中包含来自全球42个城市的2800万张图像。然后在3类构建功能分类任务的上下文中评估所获得的数据集。这项研究中考虑的三个建筑类别是:商业,住宅和其他建筑物。微调的最新体系结构在过滤的图像上产生高达0.51的F1得分。我们的分析表明,该性能受到从OpenStreetMap获得的标签的质量的限制,如果仅考虑人类验证的标签,指标会增加0.2。因此,我们认为这些标签是薄弱的,并将管道中的所得图像以及它们显示为弱标记的数据集的建筑物。
Urban land use on a building instance level is crucial geo-information for many applications, yet difficult to obtain. An intuitive approach to close this gap is predicting building functions from ground level imagery. Social media image platforms contain billions of images, with a large variety of motifs including but not limited to street perspectives. To cope with this issue this study proposes a filtering pipeline to yield high quality, ground level imagery from large social media image datasets. The pipeline ensures that all resulting images have full and valid geotags with a compass direction to relate image content and spatial objects from maps. We analyze our method on a culturally diverse social media dataset from Flickr with more than 28 million images from 42 cities around the world. The obtained dataset is then evaluated in a context of 3-classes building function classification task. The three building classes that are considered in this study are: commercial, residential, and other. Fine-tuned state-of-the-art architectures yield F1-scores of up to 0.51 on the filtered images. Our analysis shows that the performance is highly limited by the quality of the labels obtained from OpenStreetMap, as the metrics increase by 0.2 if only human validated labels are considered. Therefore, we consider these labels to be weak and publish the resulting images from our pipeline together with the buildings they are showing as a weakly labeled dataset.