Impact Factor (2025): 6.9
DOI Prefix: 10.47001/IRJIET
Vol 9 No 3 (2025): Volume 9, Issue 3, March 2025 | Pages: 67-77
International Research Journal of Innovations in Engineering and Technology
OPEN ACCESS | Research Article | Published Date: 11-03-2025
Text-to-image synthesis is an intriguing field of study that seeks to create visuals from textual descriptions. The primary objective of this domain is to provide visuals that align with the provided written description for both semantic coherence and visual reality. Despite significant advancements in text-to-image synthesis in recent years, it continues to encounter numerous hurdles, primarily concerning picture realism and semantic coherence. To address these challenges, selecting diverse datasets with comprehensive annotations will markedly improve model performance in addressing these difficulties. Datasets with varied visual material and comprehensive textual descriptions aid models in understanding intricate links between text and images, enhancing both semantic coherence and image authenticity. This review paper examines 20 datasets available for text-to-image synthesis, categorizing them by scope, variety, and application domains. The meticulous selection and curation of datasets are crucial for enhancing text-to-image synthesis technology. Ultimately, the careful selection and curation of datasets play a pivotal role in advancing the state-of-the-art in text-to-image synthesis.
Text-to-Image Datasets, Dataset Diversity, Dataset Limitations, Scene Complexity, Generative AI Datasets
Haitham ALHAJI, & Alaa Yaseen Taqa. (2025). Text-to-Image Datasets: Characteristics, Challenges, and Opportunities. International Research Journal of Innovations in Engineering and Technology - IRJIET, 9(3), 67-77. Article DOI https://doi.org/10.47001/IRJIET/2025.903009
This work is licensed under Creative common Attribution Non Commercial 4.0 Internation Licence