论文标题
在土著社区中启用互动转录
Enabling Interactive Transcription in an Indigenous Community
论文作者
论文摘要
我们提出了一种新颖的转录工作流程,将口头术语检测和人类在循环中结合在一起,以及试点实验。这项工作基于几乎为零的资源场景,到目前为止,只有几个术语涉及两种濒临灭绝的语言。我们表明,在转录的早期阶段,当可用数据不足以训练强大的ASR系统时,可以利用少量孤立单词的转录来引导语音集合的转录。
We propose a novel transcription workflow which combines spoken term detection and human-in-the-loop, together with a pilot experiment. This work is grounded in an almost zero-resource scenario where only a few terms have so far been identified, involving two endangered languages. We show that in the early stages of transcription, when the available data is insufficient to train a robust ASR system, it is possible to take advantage of the transcription of a small number of isolated words in order to bootstrap the transcription of a speech collection.