ETROVUB

Pingfan Song, Xin Deng, João Mota, Nikos Deligiannis, Pier Luigi Dragotti, Miguel Rodrigues

Contribution to journal

Abstract ■

Real-world data processing problems often involve various image modalities associated with a certain scene, including RGB images, infrared images or multi-spectral images. The fact that different image modalities often share certain attributes, such as edges, textures and other structure primitives, represents an opportunity to enhance various image processing tasks. This paper proposes a new approach to construct a high-resolution (HR) version of a low-resolution (LR) image given another HR image modality as guidance, based on joint sparse representations induced by coupled dictionaries. The proposed approach captures complex dependency correlations, including similarities and disparities, between different image modalities in a learned sparse feature domain in lieu of the original image domain. It consists of two phases: coupled dictionary learning phase and coupled super-resolution phase. The learning phase learns a set of dictionaries from the training dataset to couple different image modalities together in the sparse feature domain. In turn, the super-resolution phase leverages such dictionaries to construct a HR version of the LR target image with another related image modality for guidance. In the advanced version of our approach, multi-stage strategy and neighbourhood regression concept are introduced to further improve the model capacity and performance. Extensive guided image super-resolution experiments on real multimodal images demonstrate that the proposed approach admits distinctive advantages with respect to the state-of-the-art approaches, for example, overcoming the texture copying artifacts commonly resulting from inconsistency between the guidance and target images. Of particular relevance, the proposed model demonstrates much better robustness than competing deep models in a range of noisy scenarios.

Reference ■

Song, P, Deng, X, Mota, J, Deligiannis, N, Dragotti, PL & Rodrigues, M 2019, 'Multimodal Image Super-resolution via Joint Sparse Representations induced by Coupled Dictionaries', IEEE Transactions on Computational Imaging, vol. 6, pp. 57-72. https://doi.org/10.1109/TCI.2019.2916502

Song, P., Deng, X., Mota, J., Deligiannis, N., Dragotti, P. L., & Rodrigues, M. (2019). Multimodal Image Super-resolution via Joint Sparse Representations induced by Coupled Dictionaries. IEEE Transactions on Computational Imaging, 6, 57-72. https://doi.org/10.1109/TCI.2019.2916502

@article{de8508b9b31c4286bf9eb9847462c3a3,
title = "Multimodal Image Super-resolution via Joint Sparse Representations induced by Coupled Dictionaries",
abstract = "Real-world data processing problems often involve various image modalities associated with a certain scene, including RGB images, infrared images or multi-spectral images. The fact that different image modalities often share certain attributes, such as edges, textures and other structure primitives, represents an opportunity to enhance various image processing tasks. This paper proposes a new approach to construct a high-resolution (HR) version of a low-resolution (LR) image given another HR image modality as guidance, based on joint sparse representations induced by coupled dictionaries. The proposed approach captures complex dependency correlations, including similarities and disparities, between different image modalities in a learned sparse feature domain in lieu of the original image domain. It consists of two phases: coupled dictionary learning phase and coupled super-resolution phase. The learning phase learns a set of dictionaries from the training dataset to couple different image modalities together in the sparse feature domain. In turn, the super-resolution phase leverages such dictionaries to construct a HR version of the LR target image with another related image modality for guidance. In the advanced version of our approach, multi-stage strategy and neighbourhood regression concept are introduced to further improve the model capacity and performance. Extensive guided image super-resolution experiments on real multimodal images demonstrate that the proposed approach admits distinctive advantages with respect to the state-of-the-art approaches, for example, overcoming the texture copying artifacts commonly resulting from inconsistency between the guidance and target images. Of particular relevance, the proposed model demonstrates much better robustness than competing deep models in a range of noisy scenarios.",
keywords = "Multimodal image super-resolution, coupled dictionary learning, joint sparse representation, side information",
author = "Pingfan Song and Xin Deng and Jo{\~a}o Mota and Nikolaos Deligiannis and Dragotti, {Pier Luigi} and Miguel Rodrigues",
year = "2019",
doi = "10.1109/TCI.2019.2916502",
language = "English",
volume = "6",
pages = "57--72",
journal = "IEEE Transactions on Computational Imaging",
issn = "2333-9403",
publisher = "IEEE",
}

DOI