说话者的最先进

论文标题

说话者的最先进

State-of-the-art in speaker recognition

论文作者

Faundez-Zanuy, Marcos, Monte-Moreno, Enric

论文摘要

语音技术的最新进展产生了新工具，可用于提高扬声器识别的性能和灵活性，而在使用指纹或IRIS识别技术时，几乎没有自由度或替代方法的程度，语音提供了更大的灵活性和不同的级别来执行识别：该系统可以迫使用户以每种特定的方式说话，以每种尝试来输入。同样，使用语音输入，系统具有其他自由度，例如只有用户知道的知识/代码或难以伪造的辩证/语义特征。本文以演讲者的认可提供并概述艺术的状态，并特别强调了专业人士和反对派以及当前的研究行。当前的研究线包括改进的分类系统，以及通过概率语法使用高水平信息。总之，说话者的认可与已经探索了所有可能性的技术相去甚远。

Recent advances in speech technologies have produced new tools that can be used to improve the performance and flexibility of speaker recognition While there are few degrees of freedom or alternative methods when using fingerprint or iris identification techniques, speech offers much more flexibility and different levels for performing recognition: the system can force the user to speak in a particular manner, different for each attempt to enter. Also with voice input the system has other degrees of freedom, such as the use of knowledge/codes that only the user knows, or dialectical/semantical traits that are difficult to forge. This paper offers and overview of the state of the art in speaker recognition, with special emphasis on the pros and contras, and the current research lines. The current research lines include improved classification systems, and the use of high level information by means of probabilistic grammars. In conclusion, speaker recognition is far away from being a technology where all the possibilities have already been explored.

下载PDF全文

下载文献需遵守相关版权规定

论文标题