震源源估计鲁棒性：语音源估计技术灵敏度的比较

论文标题

震源源估计鲁棒性：语音源估计技术灵敏度的比较

Glottal source estimation robustness: A comparison of sensitivity of voice source estimation techniques

论文作者

Drugman, Thomas, Dubuisson, Thomas, Moinet, Alexis, D'Alessandro, Nicolas, Dutoit, Thierry

论文摘要

本文解决了直接从语音波形估算语音源的问题。基于反疗法主导区域（ACDR）的新原理用于估计glottal开放阶段。将该技术与另外两种最先进的著名方法进行了比较，即Z-Transform（ZZT）的零和迭代自适应逆滤波（IAIF）算法。分解质量通过两种客观措施对合成信号进行评估：光谱失真和震颤的赋量测定速率。通过分析噪声和发光闭合即时（GCI）位置误差的影响来测试技术鲁棒性。除了评估基本频率的影响和对性能的第一个共振剂的影响外。我们提出的方法显示出鲁棒性的显着改善，这在分解真实语音时可能会引起极大的兴趣。

This paper addresses the problem of estimating the voice source directly from speech waveforms. A novel principle based on Anticausality Dominated Regions (ACDR) is used to estimate the glottal open phase. This technique is compared to two other state-of-the-art well-known methods, namely the Zeros of the Z-Transform (ZZT) and the Iterative Adaptive Inverse Filtering (IAIF) algorithms. Decomposition quality is assessed on synthetic signals through two objective measures: the spectral distortion and a glottal formant determination rate. Technique robustness is tested by analyzing the influence of noise and Glottal Closure Instant (GCI) location errors. Besides impacts of the fundamental frequency and the first formant on the performance are evaluated. Our proposed approach shows significant improvement in robustness, which could be of a great interest when decomposing real speech.

下载PDF全文

下载文献需遵守相关版权规定

论文标题