论文标题
BWCNN:眨眼到Word,一种实时卷积神经网络方法
BWCNN: Blink to Word, a Real-Time Convolutional Neural Network Approach
论文作者
论文摘要
肌萎缩性外侧硬化症(ALS)是大脑和脊髓的进行性神经退行性疾病,导致运动功能麻痹。患者保留眨眼的能力,可用于交流。在这里,我们提出了一个人工智能(AI)系统,该系统使用眼光闪烁与外界进行通信,并在实时互联网(IoT)设备上运行。该系统使用卷积神经网络(CNN)找到闪烁的模式,该模式被定义为一系列开放和封闭状态。每个模式都映射到表现出患者意图的单词集合。为了调查准确性和延迟之间的最佳权衡,我们研究了几种卷积网络架构,例如Resnet,Squeezenet,Densenet和IntectionV3,并评估了它们的性能。我们发现,在对特定任务进行了高参数进行微调后,InceptionV3体系结构以99.20%和94ms延迟的准确性导致了最佳性能。这项工作表明,如何适应深度学习体系结构的最新进展,以适应能够改善患者生活质量的临床系统。
Amyotrophic lateral sclerosis (ALS) is a progressive neurodegenerative disease of the brain and the spinal cord, which leads to paralysis of motor functions. Patients retain their ability to blink, which can be used for communication. Here, We present an Artificial Intelligence (AI) system that uses eye-blinks to communicate with the outside world, running on real-time Internet-of-Things (IoT) devices. The system uses a Convolutional Neural Network (CNN) to find the blinking pattern, which is defined as a series of Open and Closed states. Each pattern is mapped to a collection of words that manifest the patient's intent. To investigate the best trade-off between accuracy and latency, we investigated several Convolutional Network architectures, such as ResNet, SqueezeNet, DenseNet, and InceptionV3, and evaluated their performance. We found that the InceptionV3 architecture, after hyper-parameter fine-tuning on the specific task led to the best performance with an accuracy of 99.20% and 94ms latency. This work demonstrates how the latest advances in deep learning architectures can be adapted for clinical systems that ameliorate the patient's quality of life regardless of the point-of-care.