This paper targets the Bipolar Disorder Challenge (BDC) task of Audio Visual Emotion Challenge (AVEC) 2018. Firstly, two novel features are proposed: 1) a histogram based arousal feature, in which the continuous arousal values are estimated from the audio cues by a Long Short-Term Memory Recurrent Neural Network (LSTM-RNN) model; 2) a Histogram of Displacement (HDR) based upper body posture feature, which characterizes the displacement and velocity of the key body points in the video segment. In addition, we propose a multi-stream bipolar disorder classification framework with Deep Neural Networks (DNNs) and a Random Forest, and adopt the ensemble learning strategy to alleviate the possible over-fitting problem due to the limited training data. Experimental results show that the proposed arousal feature and upper body posture feature are discriminative for different bipolar episodes, and our proposed framework achieves promising classification results on the development set, with the unweighted average recall (UAR) of 0.714, which is higher than the baseline result 0.635. On test set evaluation, our system obtains the same UAR (0.574) as the challenge baseline.
Yang, L, Li, Y, Chen, H, Jiang, D, Oveneke, MC & Sahli, H 2018, Bipolar Disorder Recognition with Histogram Features of Arousal and Body Gestures. in AVEC 2018 - Proceedings of the 2018 Audio/Visual Emotion Challenge and Workshop, co-located with MM 2018. AVEC 2018 - Proceedings of the 2018 Audio/Visual Emotion Challenge and Workshop, co-located with MM 2018, ACM, New York, NY, USA, pp. 15-21. https://doi.org/10.1145/3266302.3266308
Yang, L., Li, Y., Chen, H., Jiang, D., Oveneke, M. C., & Sahli, H. (2018). Bipolar Disorder Recognition with Histogram Features of Arousal and Body Gestures. In AVEC 2018 - Proceedings of the 2018 Audio/Visual Emotion Challenge and Workshop, co-located with MM 2018 (pp. 15-21). (AVEC 2018 - Proceedings of the 2018 Audio/Visual Emotion Challenge and Workshop, co-located with MM 2018). ACM. https://doi.org/10.1145/3266302.3266308
@inproceedings{072b21c9cc6c475ab08340026941c2fa,
title = "Bipolar Disorder Recognition with Histogram Features of Arousal and Body Gestures",
abstract = "This paper targets the Bipolar Disorder Challenge (BDC) task of Audio Visual Emotion Challenge (AVEC) 2018. Firstly, two novel features are proposed: 1) a histogram based arousal feature, in which the continuous arousal values are estimated from the audio cues by a Long Short-Term Memory Recurrent Neural Network (LSTM-RNN) model; 2) a Histogram of Displacement (HDR) based upper body posture feature, which characterizes the displacement and velocity of the key body points in the video segment. In addition, we propose a multi-stream bipolar disorder classification framework with Deep Neural Networks (DNNs) and a Random Forest, and adopt the ensemble learning strategy to alleviate the possible over-fitting problem due to the limited training data. Experimental results show that the proposed arousal feature and upper body posture feature are discriminative for different bipolar episodes, and our proposed framework achieves promising classification results on the development set, with the unweighted average recall (UAR) of 0.714, which is higher than the baseline result 0.635. On test set evaluation, our system obtains the same UAR (0.574) as the challenge baseline.",
keywords = "Arousal, Bipolar disorder, Ensemble learning, Histogram of displacement range, Model fusion",
author = "Le Yang and Yan Li and Haifeng Chen and Dongmei Jiang and Oveneke, {Meshia C{\'e}dric} and Hichem Sahli",
year = "2018",
month = oct,
day = "15",
doi = "10.1145/3266302.3266308",
language = "English",
isbn = "978-1-4503-5983-2",
series = "AVEC 2018 - Proceedings of the 2018 Audio/Visual Emotion Challenge and Workshop, co-located with MM 2018",
publisher = "ACM",
pages = "15--21",
booktitle = "AVEC 2018 - Proceedings of the 2018 Audio/Visual Emotion Challenge and Workshop, co-located with MM 2018",
}