In this paper, we describe an approach to segmenting news video based on the perceived shift in content using features spanning multiple modalities. We investigate a number of multimedia features, which serve as potential indicators of a change in story, in order to determine which are the most effective. The efficacy of our approach is demonstrated by the performance of our prototype, where a number of feature combinations demonstrate an up to 18% improvement in WindowDiff score compared to other state of the art story segmenters. In our investigation, there is no, one, clearly superior feature, rather the best segmentation occurs when there is synergy between multiple features. A further investigation into the effect on segmentation performance, while varying the number of training examples versus the number of features used, reveal that having better feature combinations is more important than having more training examples. Our work suggests that it is possible to train robust story segmenters for news video using only a handful of broadcasts, provided a good initial feature selection is made.
Poulisse, G-J, Moens, M-F, Dekens, T & Deschacht, K 2010, 'News story segmentation in multiple modalities', Multimedia Tools and Applications, vol. 48, no. May 2010, pp. 3-22. <http://www.springerlink.com/content/9jq571u23504g631/>
Poulisse, G.-J., Moens, M.-F., Dekens, T., & Deschacht, K. (2010). News story segmentation in multiple modalities. Multimedia Tools and Applications, 48(May 2010), 3-22. http://www.springerlink.com/content/9jq571u23504g631/
@article{9cec32445c7f48a19b45a883de8fcb16,
title = "News story segmentation in multiple modalities",
abstract = "In this paper, we describe an approach to segmenting news video based on the perceived shift in content using features spanning multiple modalities. We investigate a number of multimedia features, which serve as potential indicators of a change in story, in order to determine which are the most effective. The efficacy of our approach is demonstrated by the performance of our prototype, where a number of feature combinations demonstrate an up to 18% improvement in WindowDiff score compared to other state of the art story segmenters. In our investigation, there is no, one, clearly superior feature, rather the best segmentation occurs when there is synergy between multiple features. A further investigation into the effect on segmentation performance, while varying the number of training examples versus the number of features used, reveal that having better feature combinations is more important than having more training examples. Our work suggests that it is possible to train robust story segmenters for news video using only a handful of broadcasts, provided a good initial feature selection is made.",
keywords = "News video segmentation, Story detection, Feature extraction",
author = "Gert-Jan Poulisse and Marie-Francine Moens and Tomas Dekens and Koen Deschacht",
year = "2010",
month = may,
language = "English",
volume = "48",
pages = "3--22",
journal = "Multimedia Tools and Applications",
issn = "1380-7501",
publisher = "Springer Netherlands",
number = "May 2010",
}