Time selected multiple algorithms for reliable Fo tracking in difficult recording conditions
DOI:
https://doi.org/10.36505/ExLing-2011/04/0023/000192Abstract
Prosodic data mining in large spontaneous speech corpora often requires acoustic analysis of recordings done in poor conditions. The most detrimental pertain to the absence first harmonic in voiced segments and the presence of echo. In such cases it is very difficult to separate inter vocalic consonantal voicing from echo, or recover Fo from remaining harmonics. To address these problems, 10 different pitch tracking algorithms were implemented in the software program WinPitch, in order to allow an expert user, guided by an underlying displayed narrow band spectrogram, to select manually the appropriate process to deliver a satisfactory Fo curve for a selected time segment.
References
Boersma, P. 1993. Accurate short-term analysis of the fundamental frequency and the harmonics-to-noise ratio of a sampled sound. Proc. of Institute of Phonetic Sciences of the University of Amsterdam, vol. 17, 97-110, Amsterdam, The Netherlands.
de Cheveigné, A., Kawahara, H. 2002. YIN, a fundamental frequency estimator for speech and music. JASA, 111:1917-1930.
Hess, W. 1983. Pitch Determination of Speech Signals: Algorithms and Devices. Heidelberg, Germany, Springer-Verlag.
Martin, Ph. 1981. Mesure de la fréquence fondamentale par intercorrélation avec une fonction peigne. Proc. 12th JEP, GALF, Montréal.
Martin, Ph. 2000. Peigne et brosse pour Fo : Mesure de la fréquence fondamentale par alignement de spectres séquentiels. Proc. of XXIIIèmes JEP Aussois, France.
Noll, A.M. 1964. Short-time spectrum and cepstrum techniques for vocal-pitch detection. JASA, vol. 36, 2, 296-302.
Rabiner, L.R. 1977. On the use of autocorrelation analysis for pitch detection. IEEE Trans. Acoust, Speech, Signal Processing, VOL. ASSP-25, NO. 1, 1977
Ross, M.J., Shaffer, H.L., Cohen, A., Freudberg, R., Manley, H. J. 1974. Average magnitude difference function pitch extractor. IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-22, 353-362, Oct. 1974.
Praat: www.praat.org
Rhapsodie: Corpus de référence de français parlé, http://rhapsodie.risc.cnrs.fr/fr/
WinPitch: www.winpitch.com
Downloads
Published
Issue
Section
License
Articles are published under the Creative Commons Attribution 4.0 International License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.