In this paper, we present a database with speech in different types of background noises. The speech and noise were recorded with a set of different microphones and including some sensors that pick up the speech vibrations by making contact with the skull, the throat and the ear canal, respectively. As these sensors should be less sensitive to noise sources, our database can be especially useful for investigating the properties of these special microphones and comparing them to those of conventional microphones for applications requiring noise robust speech capturing and processing. In this paper we describe some experiments that were carried out using this database in the field of Voice Activity Detection (VAD). It is shown that the signals of a special microphone such as the throat microphone exhibit a high signal to noise ratio and that this property can be exploited to significantly improve the accuracy of a VAD algorithm.
Dekens, T, Patsis, G, Verhelst, W, Beaugendre, F & Capman, F 2008, A Multi-Sensor Speech Database with Applications towards Robust Speech Processing in Hostile Environments. in The sixth international conference on Language Resources and Evaluation (LREC 2008). ELRA, Finds and Results from the Swedish Cyprus Expedition: A Gender Perspective at the Medelhavsmuseet, Stockholm, Sweden, 21/09/09. <http://www.etro.vub.ac.be/Research/DSSP/PUB_FILES/int_conf/LREC-2008-Dekens.pdf>
Dekens, T., Patsis, G., Verhelst, W., Beaugendre, F., & Capman, F. (2008). A Multi-Sensor Speech Database with Applications towards Robust Speech Processing in Hostile Environments. In The sixth international conference on Language Resources and Evaluation (LREC 2008) ELRA. http://www.etro.vub.ac.be/Research/DSSP/PUB_FILES/int_conf/LREC-2008-Dekens.pdf
@inproceedings{2cad56055f784bd4ab2e46198321cd95,
title = "A Multi-Sensor Speech Database with Applications towards Robust Speech Processing in Hostile Environments",
abstract = "In this paper, we present a database with speech in different types of background noises. The speech and noise were recorded with a set of different microphones and including some sensors that pick up the speech vibrations by making contact with the skull, the throat and the ear canal, respectively. As these sensors should be less sensitive to noise sources, our database can be especially useful for investigating the properties of these special microphones and comparing them to those of conventional microphones for applications requiring noise robust speech capturing and processing. In this paper we describe some experiments that were carried out using this database in the field of Voice Activity Detection (VAD). It is shown that the signals of a special microphone such as the throat microphone exhibit a high signal to noise ratio and that this property can be exploited to significantly improve the accuracy of a VAD algorithm.",
keywords = "multi-sensor speech database, voice activity detector, bone contucted microphone",
author = "Tomas Dekens and Georgios Patsis and Werner Verhelst and Fr{\'e}d{\'e}ric Beaugendre and Fran{\c c}ois Capman",
year = "2008",
month = may,
day = "30",
language = "English",
booktitle = "The sixth international conference on Language Resources and Evaluation (LREC 2008)",
publisher = "ELRA",
note = "Finds and Results from the Swedish Cyprus Expedition: A Gender Perspective at the Medelhavsmuseet ; Conference date: 21-09-2009 Through 25-09-2009",
}