论文标题
评估元音早期声音学习高质量模拟的功能和指标
Evaluating Features and Metrics for High-Quality Simulation of Early Vocal Learning of Vowels
论文作者
论文摘要
尽管他们的声带的声学不匹配是科学辩论的热门话题,但婴儿使用听觉线索就学会说话的方式。早期声音学习使用宣传性语音综合的模拟为获得这一过程有了更深入的了解。这些模拟中的关键参数之一是特征的选择,也是评估综合声音和参考目标之间的声学误差的度量。我们为评估一组40个特征 - 量表组合的性能做出了贡献,以优化使用高质量的关节合成器优化静态元音的任务。为此,我们评估了归一化的F1-F2共振剂空间中共振峰误差的可用性和特征 - 金属误差表面的投影。我们表明,这种方法可用于评估特征和指标的影响,并为感知结果提供见解。
The way infants use auditory cues to learn to speak despite the acoustic mismatch of their vocal apparatus is a hot topic of scientific debate. The simulation of early vocal learning using articulatory speech synthesis offers a way towards gaining a deeper understanding of this process. One of the crucial parameters in these simulations is the choice of features and a metric to evaluate the acoustic error between the synthesised sound and the reference target. We contribute with evaluating the performance of a set of 40 feature-metric combinations for the task of optimising the production of static vowels with a high-quality articulatory synthesiser. Towards this end we assess the usability of formant error and the projection of the feature-metric error surface in the normalised F1-F2 formant space. We show that this approach can be used to evaluate the impact of features and metrics and also to offer insight to perceptual results.