论文标题

视觉变压器和基于Yolov5的驱动器嗜睡检测框架

Vision Transformers and YoloV5 based Driver Drowsiness Detection Framework

论文作者

Krishna, Ghanta Sai, Supriya, Kundrapu, Vardhan, Jai, K, Mallikharjuna Rao

论文摘要

由于独特的驾驶特征,人类驾驶员具有独特的驾驶技术,知识和情感。驾驶员昏昏欲睡一直是一个严重的问题,危害道路安全。因此,必须设计有效的嗜睡检测算法以绕过道路事故,这是至关重要的。杂项研究工作已经解决了检测异常的人类驾驶员行为的问题,以通过计算机视觉技术检查驾驶员的正面和汽车动力学。尽管如此,常规方法仍无法捕获复杂的驾驶员行为特征。但是,以深度学习体系结构的起源,还进行了大量的研究来分析和使用神经网络算法识别驾驶员的嗜睡。本文介绍了一个基于视觉变形金刚和Yolov5架构的新型框架,以实现驱动器嗜睡识别。提出了定制的Yolov5预训练的建筑,以提取面部提取,目的是提取感兴趣的区域(ROI)。由于以前的体系结构的局限性,本文引入了视觉变压器进行二进制图像分类,该二进制图像分类在公共数据集UTA-RLDD上经过培训和验证。该模型分别达到了96.2 \%和97.4 \%的培训和验证精度。对于进一步的评估,在各种光明情况下的39名参与者的自定义数据集上测试了拟议的框架,并达到了95.5 \%的准确性。进行的实验揭示了我们在智能运输系统中实用应用框架的重要潜力。

Human drivers have distinct driving techniques, knowledge, and sentiments due to unique driving traits. Driver drowsiness has been a serious issue endangering road safety; therefore, it is essential to design an effective drowsiness detection algorithm to bypass road accidents. Miscellaneous research efforts have been approached the problem of detecting anomalous human driver behaviour to examine the frontal face of the driver and automobile dynamics via computer vision techniques. Still, the conventional methods cannot capture complicated driver behaviour features. However, with the origin of deep learning architectures, a substantial amount of research has also been executed to analyze and recognize driver's drowsiness using neural network algorithms. This paper introduces a novel framework based on vision transformers and YoloV5 architectures for driver drowsiness recognition. A custom YoloV5 pre-trained architecture is proposed for face extraction with the aim of extracting Region of Interest (ROI). Owing to the limitations of previous architectures, this paper introduces vision transformers for binary image classification which is trained and validated on a public dataset UTA-RLDD. The model had achieved 96.2\% and 97.4\% as it's training and validation accuracies respectively. For the further evaluation, proposed framework is tested on a custom dataset of 39 participants in various light circumstances and achieved 95.5\% accuracy. The conducted experimentations revealed the significant potential of our framework for practical applications in smart transportation systems.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源