论文标题

与机器人的语言互动:研究问题和建议,从NSF未来方向报告

Spoken Language Interaction with Robots: Research Issues and Recommendations, Report from the NSF Future Directions Workshop

论文作者

Marge, Matthew, Espy-Wilson, Carol, Ward, Nigel

论文摘要

随着机器人技术的迅速发展,越来越需要更有效的人类机器人互动来实现机器人对社会的全部潜力。尽管口语必须是解决方案的一部分,但我们提供口语互动功能的能力仍然非常有限。因此,国家科学基金会召集了一个研讨会,将言语,语言和机器人研究人员汇总在一起,讨论需要做什么。结果是该报告,在其中确定所需的关键科学和工程进步。 我们的建议与八个一般主题广泛有关。首先,满足人类需求需要应对语音技术和用户体验设计方面的新挑战。其次,这需要更好地模型的语言使用的社会和互动方面。第三,为了鲁棒性,机器人需要与用户进行更高的宽宽沟通以及更好地处理不确定性,包括同时考虑多个假设和目标。需要第四,更强大的适应方法,以使机器人能够在新环境,新任务以及不同的用户群体中进行通信,而无需大量重新设计或收集大量的培训数据。第五,由于机器人是体现的,因此语音应与其他交流方式一起发挥作用,例如凝视,手势,姿势和运动。第六,由于机器人在复杂的环境中运行,因此语音组件需要访问机器人对对象,位置,噪声源,用户和其他人的了解的丰富而有效的表示。第七,由于机器人是实时运行的,因此他们的语音和语言处理组件也必须。除了更多的研究之外,我们还需要在基础架构和资源方面进行更多工作,包括可共享的软件模块和内部接口,廉价的硬件,基线系统和多样化的语料库。

With robotics rapidly advancing, more effective human-robot interaction is increasingly needed to realize the full potential of robots for society. While spoken language must be part of the solution, our ability to provide spoken language interaction capabilities is still very limited. The National Science Foundation accordingly convened a workshop, bringing together speech, language, and robotics researchers to discuss what needs to be done. The result is this report, in which we identify key scientific and engineering advances needed. Our recommendations broadly relate to eight general themes. First, meeting human needs requires addressing new challenges in speech technology and user experience design. Second, this requires better models of the social and interactive aspects of language use. Third, for robustness, robots need higher-bandwidth communication with users and better handling of uncertainty, including simultaneous consideration of multiple hypotheses and goals. Fourth, more powerful adaptation methods are needed, to enable robots to communicate in new environments, for new tasks, and with diverse user populations, without extensive re-engineering or the collection of massive training data. Fifth, since robots are embodied, speech should function together with other communication modalities, such as gaze, gesture, posture, and motion. Sixth, since robots operate in complex environments, speech components need access to rich yet efficient representations of what the robot knows about objects, locations, noise sources, the user, and other humans. Seventh, since robots operate in real time, their speech and language processing components must also. Eighth, in addition to more research, we need more work on infrastructure and resources, including shareable software modules and internal interfaces, inexpensive hardware, baseline systems, and diverse corpora.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源