In this paper we propose a new algorithm to detect vowels in a speech utterance and infer the rate at which speech was produced. To achieve this we determine a smooth trajectory that corresponds to a high frequency energy envelope, modulated by the low frequency energy content. Peak picking performed on this trajectory gives an estimate of the number of vowels in the utterance. To dispose of falsely detected vowels, a peak pruning post-processing step is incorporated. Experimental results show that the proposed algorithm is more accurate than the two speech rate determination algorithms on which it was inspired.
Dekens, T, Martens, H, Van Nuffelen, G, De Bodt, M & Verhelst, W 2014, Speech Rate Determination by Vowel Detection on the Modulated Energy Envelope. in 2014 Proceedings of the 22nd European Signal Processing Conference (EUSIPCO). European Signal Processing Conference Proceedings, IEEE, pp. 1252-1256, 22nd European Signal Processing Conference, EUSIPCO 2014, Lisbon, Portugal, 1/09/14.
Dekens, T., Martens, H., Van Nuffelen, G., De Bodt, M., & Verhelst, W. (2014). Speech Rate Determination by Vowel Detection on the Modulated Energy Envelope. In 2014 Proceedings of the 22nd European Signal Processing Conference (EUSIPCO) (pp. 1252-1256). (European Signal Processing Conference Proceedings). IEEE.
@inproceedings{c71113e82c954c9199e5c7e830f62322,
title = "Speech Rate Determination by Vowel Detection on the Modulated Energy Envelope",
abstract = "In this paper we propose a new algorithm to detect vowels in a speech utterance and infer the rate at which speech was produced. To achieve this we determine a smooth trajectory that corresponds to a high frequency energy envelope, modulated by the low frequency energy content. Peak picking performed on this trajectory gives an estimate of the number of vowels in the utterance. To dispose of falsely detected vowels, a peak pruning post-processing step is incorporated. Experimental results show that the proposed algorithm is more accurate than the two speech rate determination algorithms on which it was inspired.",
keywords = "speech rate, vowel detection",
author = "Tomas Dekens and Heidi Martens and {Van Nuffelen}, Gwen and {De Bodt}, Marc and Werner Verhelst",
year = "2014",
month = sep,
language = "English",
isbn = "978-0-9928626-1-9",
series = "European Signal Processing Conference Proceedings",
publisher = "IEEE",
pages = "1252--1256",
booktitle = "2014 Proceedings of the 22nd European Signal Processing Conference (EUSIPCO)",
note = "22nd European Signal Processing Conference, EUSIPCO 2014 ; Conference date: 01-09-2014 Through 05-09-2014",
}