论文标题
BON:扩展人类活动识别的扩展公共领域数据集
BON: An extended public domain dataset for human activity recognition
论文作者
论文摘要
人体戴的第一人称视觉(FPV)摄像头使从受试者的角度提取有关环境的丰富信息来源。但是,与其他活动环境(例如厨房和室外卧床)相比,基于可穿戴摄像头的eg中心办公室活动的研究进展速度很慢,这主要是由于缺乏足够的数据集来培训在办公室环境中人类活动识别的更复杂的(例如,深度学习)模型。本文提供了使用胸部安装的GoPro Hero摄像头,提供了三个地理位置:巴塞罗那(西班牙),牛津(英国)和内罗毕(肯尼亚)的不同办公室设置中收集的大型公开办公活动数据集(BON)的详细信息。 BON数据集包含十八个常见的办公活动,可以将其分为人与人之间的互动(例如,与同事聊天),人对象(例如,在白板上写作)和本体感受(例如,步行)。为5秒钟的视频段提供注释。通常,BON包含25个受试者和2639个分段。为了促进子域中的进一步研究,我们还提供了可以用作将来研究的基准的结果。
Body-worn first-person vision (FPV) camera enables to extract a rich source of information on the environment from the subject's viewpoint. However, the research progress in wearable camera-based egocentric office activity understanding is slow compared to other activity environments (e.g., kitchen and outdoor ambulatory), mainly due to the lack of adequate datasets to train more sophisticated (e.g., deep learning) models for human activity recognition in office environments. This paper provides details of a large and publicly available office activity dataset (BON) collected in different office settings across three geographical locations: Barcelona (Spain), Oxford (UK) and Nairobi (Kenya), using a chest-mounted GoPro Hero camera. The BON dataset contains eighteen common office activities that can be categorised into person-to-person interactions (e.g., Chat with colleagues), person-to-object (e.g., Writing on a whiteboard), and proprioceptive (e.g., Walking). Annotation is provided for each segment of video with 5-seconds duration. Generally, BON contains 25 subjects and 2639 total segments. In order to facilitate further research in the sub-domain, we have also provided results that could be used as baselines for future studies.