Time selected multiple algorithms for reliable Fo tracking in difficult recording conditions

Authors

  • Philippe Martin LLF, UFR Linguistique, Université Paris Diderot, France Author

DOI:

https://doi.org/10.36505/ExLing-2011/04/0023/000192

Abstract

Prosodic data mining in large spontaneous speech corpora often requires acoustic analysis of recordings done in poor conditions. The most detrimental pertain to the absence first harmonic in voiced segments and the presence of echo. In such cases it is very difficult to separate inter vocalic consonantal voicing from echo, or recover Fo from remaining harmonics. To address these problems, 10 different pitch tracking algorithms were implemented in the software program WinPitch, in order to allow an expert user, guided by an underlying displayed narrow band spectrogram, to select manually the appropriate process to deliver a satisfactory Fo curve for a selected time segment.

 

References

Boersma, P. 1993. Accurate short-term analysis of the fundamental frequency and the harmonics-to-noise ratio of a sampled sound. Proc. of Institute of Phonetic Sciences of the University of Amsterdam, vol. 17, 97-110, Amsterdam, The Netherlands.

de Cheveigné, A., Kawahara, H. 2002. YIN, a fundamental frequency estimator for speech and music. JASA, 111:1917-1930.

Hess, W. 1983. Pitch Determination of Speech Signals: Algorithms and Devices. Heidelberg, Germany, Springer-Verlag.

Martin, Ph. 1981. Mesure de la fréquence fondamentale par intercorrélation avec une fonction peigne. Proc. 12th JEP, GALF, Montréal.

Martin, Ph. 2000. Peigne et brosse pour Fo : Mesure de la fréquence fondamentale par alignement de spectres séquentiels. Proc. of XXIIIèmes JEP Aussois, France.

Noll, A.M. 1964. Short-time spectrum and cepstrum techniques for vocal-pitch detection. JASA, vol. 36, 2, 296-302.

Rabiner, L.R. 1977. On the use of autocorrelation analysis for pitch detection. IEEE Trans. Acoust, Speech, Signal Processing, VOL. ASSP-25, NO. 1, 1977

Ross, M.J., Shaffer, H.L., Cohen, A., Freudberg, R., Manley, H. J. 1974. Average magnitude difference function pitch extractor. IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-22, 353-362, Oct. 1974.

Praat: www.praat.org

Rhapsodie: Corpus de référence de français parlé, http://rhapsodie.risc.cnrs.fr/fr/

WinPitch: www.winpitch.com

Downloads

Published

01-01-2011

How to Cite

Time selected multiple algorithms for reliable Fo tracking in difficult recording conditions. (2011). Linguistic Proceedings Series, 4(1), 95-98. https://doi.org/10.36505/ExLing-2011/04/0023/000192

Share