In soundtrack post-production for film, video or television series, it is often necessary or desirable to replace the original dialogues recorded live on the set by re-recorded studio dialogues because the original location recordings are often unsuitable for use in the final soundtrack since they may be corrupted by some kind of background noise or simply because of an unacceptable quality of performance. This {"}dialogue replacement{"} is known to introduce a lot of mismatches between the words the audience perceives and the actual lip and mouth movements in the picture. To resolve this problem, synchronization systems have been developed that allow for automatically replacing the original location recordings with the studio dialogues. However, these systems lack robustness and often deliver time-scaled dialogue that is either insufficiently synchronized with the reference dialogue, of poor quality, or both. In this presentation, we propose both modifications to the basic system for automatic time synchronization as known from the state-of-the-art as well as techniques that improve the robustness of such a system.
Soens, P & Verhelst, W 2008, Split time warping of speech for noise robust and speaker independent Automatic Dialogue Replacement (ADR). in TTCSO Language & S Technology (eds), Split time warping of speech for noise robust and speaker independent Automatic Dialogue Replacement (ADR). Split time warping of speech for noise robust and speaker independent Automatic Dialogue Replacement (ADR). <http://www.ccl.kuleuven.be/CLIF08/CLIFSymposium.html#Split_time_warping_of_speech_for_noise>
Soens, P., & Verhelst, W. (2008). Split time warping of speech for noise robust and speaker independent Automatic Dialogue Replacement (ADR). In T. T. C. S. O. Language, & S. Technology (Eds.), Split time warping of speech for noise robust and speaker independent Automatic Dialogue Replacement (ADR) (Split time warping of speech for noise robust and speaker independent Automatic Dialogue Replacement (ADR)). http://www.ccl.kuleuven.be/CLIF08/CLIFSymposium.html#Split_time_warping_of_speech_for_noise
@inbook{2ff38b1416c74636b6022650259f6a95,
title = "Split time warping of speech for noise robust and speaker independent Automatic Dialogue Replacement (ADR)",
abstract = "In soundtrack post-production for film, video or television series, it is often necessary or desirable to replace the original dialogues recorded live on the set by re-recorded studio dialogues because the original location recordings are often unsuitable for use in the final soundtrack since they may be corrupted by some kind of background noise or simply because of an unacceptable quality of performance. This {"}dialogue replacement{"} is known to introduce a lot of mismatches between the words the audience perceives and the actual lip and mouth movements in the picture. To resolve this problem, synchronization systems have been developed that allow for automatically replacing the original location recordings with the studio dialogues. However, these systems lack robustness and often deliver time-scaled dialogue that is either insufficiently synchronized with the reference dialogue, of poor quality, or both. In this presentation, we propose both modifications to the basic system for automatic time synchronization as known from the state-of-the-art as well as techniques that improve the robustness of such a system.",
keywords = "Automatic Dialogue Replacement, automatic post-synchronization",
author = "Pieter Soens and Werner Verhelst",
note = "The Third CLIF Symposium on Language and Speech Technology",
year = "2008",
month = feb,
day = "5",
language = "English",
series = "Split time warping of speech for noise robust and speaker independent Automatic Dialogue Replacement (ADR)",
editor = "Language, {The Third Clif Symposium On} and Speech Technology",
booktitle = "Split time warping of speech for noise robust and speaker independent Automatic Dialogue Replacement (ADR)",
}