论文标题

Poetictts-文学研究的可控诗阅读

PoeticTTS -- Controllable Poetry Reading for Literary Studies

论文作者

Koch, Julia, Lux, Florian, Schauffler, Nadja, Bernhart, Toni, Dieterle, Felix, Kuhn, Jonas, Richter, Sandra, Viehhauser, Gabriel, Vu, Ngoc Thang

论文摘要

由于诗意语音固有的特定语调模式,诗歌的语音综合是具有挑战性的。在这项工作中,我们提出了一种将诗歌与几乎像人类一样自然的综合诗的方法,以使文学学者能够系统地检查有关文本,口语实现和听众对诗歌的相互作用的假设。为了满足文学研究的这些特殊要求,我们通过从人类参考朗诵中克隆韵律价值来重新合成诗,然后利用细粒度的韵律控制来操纵在人类的环境中的合成语音来改变朗诵W.R.T. W.R.T.具体现象。我们发现,对诗歌的TTS模型进行鉴定会在很大程度上捕捉诗歌语调模式,这对韵律克隆和操纵非常有益,并在客观评估和人类研究中都验证了我们方法的成功。

Speech synthesis for poetry is challenging due to specific intonation patterns inherent to poetic speech. In this work, we propose an approach to synthesise poems with almost human like naturalness in order to enable literary scholars to systematically examine hypotheses on the interplay between text, spoken realisation, and the listener's perception of poems. To meet these special requirements for literary studies, we resynthesise poems by cloning prosodic values from a human reference recitation, and afterwards make use of fine-grained prosody control to manipulate the synthetic speech in a human-in-the-loop setting to alter the recitation w.r.t. specific phenomena. We find that finetuning our TTS model on poetry captures poetic intonation patterns to a large extent which is beneficial for prosody cloning and manipulation and verify the success of our approach both in an objective evaluation as well as in human studies.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源