The introduction of diverse text-to-image generation models has sparked significant interest across various sectors. While these models provide the groundbreaking capability to convert textual descriptions into visual data, their widespread usage has ignited concerns over misusing realistic synthesized images. Despite the pressing need, research on detecting such synthetic images remains limited. This paper aims to bridge this gap by evaluating the ability of several existing detectors to detect synthesized images produced by text-to-image generation models. Our research includes testing four popular text-to-image generation models: Stable Diffusion (SD), Latent Diffusion (LD), GLIDE, and DALLâ‹…E-MINI (DM), and leverages two benchmark prompt-image datasets as real images. Additionally, our research focuses on identifying robust, efficient, lightweight detectors to minimize computational resource usage. Recognizing the limitations of current detection approaches, we propose a novel detector grounded in latent space analysis tailored for recognizing text-to-image synthesized visuals. Experimental results demonstrate that the proposed detector not only achieves high prediction accuracy but also exhibits enhanced robustness against image perturbations while maintaining lower computational complexity compared to existing models in detecting text-to-image generated synthetic images.
Yang, Y, Moremada, MWGC & Deligiannis, N 2024, On the detection of images generated from text. in 2024 IEEE International Conference on Image Processing, ICIP 2024 - Proceedings. Proceedings - International Conference on Image Processing, ICIP, IEEE, pp. 3792-3798. https://doi.org/10.1109/ICIP51287.2024.10648177
Yang, Y., Moremada, M. W. G. C., & Deligiannis, N. (2024). On the detection of images generated from text. In 2024 IEEE International Conference on Image Processing, ICIP 2024 - Proceedings (pp. 3792-3798). (Proceedings - International Conference on Image Processing, ICIP). IEEE. https://doi.org/10.1109/ICIP51287.2024.10648177
@inproceedings{a86e705f8f8640bc830c29803de0a4cb,
title = "On the detection of images generated from text",
abstract = "The introduction of diverse text-to-image generation models has sparked significant interest across various sectors. While these models provide the groundbreaking capability to convert textual descriptions into visual data, their widespread usage has ignited concerns over misusing realistic synthesized images. Despite the pressing need, research on detecting such synthetic images remains limited. This paper aims to bridge this gap by evaluating the ability of several existing detectors to detect synthesized images produced by text-to-image generation models. Our research includes testing four popular text-to-image generation models: Stable Diffusion (SD), Latent Diffusion (LD), GLIDE, and DALLâ‹…E-MINI (DM), and leverages two benchmark prompt-image datasets as real images. Additionally, our research focuses on identifying robust, efficient, lightweight detectors to minimize computational resource usage. Recognizing the limitations of current detection approaches, we propose a novel detector grounded in latent space analysis tailored for recognizing text-to-image synthesized visuals. Experimental results demonstrate that the proposed detector not only achieves high prediction accuracy but also exhibits enhanced robustness against image perturbations while maintaining lower computational complexity compared to existing models in detecting text-to-image generated synthetic images.",
author = "Yuqing Yang and Moremada, {Moremada Watte Gedara Charuka} and Nikos Deligiannis",
note = "Funding Information: This research received funding from the FWO (Grant G0A4720N) and the Flemish Government under the\u201DOnderzoeksprogramma Artifici\u00EBle Intel-ligentie (AI) Vlaanderen\u201D programme. The research was also conducted in the context of the CHIST-ERA CON-NET project. Publisher Copyright: {\textcopyright} 2024 IEEE.",
year = "2024",
doi = "10.1109/ICIP51287.2024.10648177",
language = "English",
isbn = "9798350349405",
series = "Proceedings - International Conference on Image Processing, ICIP",
publisher = "IEEE",
pages = "3792--3798",
booktitle = "2024 IEEE International Conference on Image Processing, ICIP 2024 - Proceedings",
}