论文标题

在赛车运动中实现AI驱动的语义角色识别

Implementing AI-powered semantic character recognition in motor racing sports

论文作者

Rodríguez, Jose David Fernández, Molina, David Daniel Albarracín, Cebolla, Jesús Hormigo

论文摘要

电视节目程序的电视制作人通常会覆盖视觉和文本媒体,以提供有关驾驶员的屏幕上下文,例如驾驶员的名称,位置或照片。通常,这是由人类制作人完成的,他在屏幕上可以视觉识别驱动程序,手动切换与每个人关联的上下文媒体,并与摄影师和其他电视制作人协调,以使赛车手保持射击,而上下文媒体则在屏幕上。这个劳动密集型和高度专用的过程主要适合静态叠加层,并且很难在短镜头中同时覆盖许多驱动程序的上下文信息。本文介绍了一个系统,该系统在很大程度上可以自动化这些任务,并可以使用深度学习使动态叠加层在屏幕上移动时跟踪驱动程序,而无需人工干预。该系统不仅是理论上的,而且在现场比赛中,电视制作公司在Formula E比赛中已经部署了实施。我们提出了实施过程中面临的挑战,并讨论了含义。此外,我们涵盖了这一新技术发展的未来应用和路线图。

Oftentimes TV producers of motor-racing programs overlay visual and textual media to provide on-screen context about drivers, such as a driver's name, position or photo. Typically this is accomplished by a human producer who visually identifies the drivers on screen, manually toggling the contextual media associated to each one and coordinating with cameramen and other TV producers to keep the racer in the shot while the contextual media is on screen. This labor-intensive and highly dedicated process is mostly suited to static overlays and makes it difficult to overlay contextual information about many drivers at the same time in short shots. This paper presents a system that largely automates these tasks and enables dynamic overlays using deep learning to track the drivers as they move on screen, without human intervention. This system is not merely theoretical, but an implementation has already been deployed during live races by a TV production company at Formula E races. We present the challenges faced during the implementation and discuss the implications. Additionally, we cover future applications and roadmap of this new technological development.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源