Original scientific paper
https://doi.org/10.2478/crdj-2023-0004
Trends and Challenges of Text-to-Image Generation: Sustainability Perspective
Dora Ivezić
orcid.org/0009-0002-5761-6677
; University of Zagreb, Faculty of Electrical Engineering and Computing
*
Marina Bagić Babac
orcid.org/0000-0003-4979-2216
; University of Zagreb, Faculty of Electrical Engineering and Computing
* Corresponding author.
Abstract
Text-to-image generation is a rapidly growing field that aims to generate images from textual descriptions. This paper provides a comprehensive overview of the latest trends and developments, highlighting their importance and relevance in various domains, such as art, photography, marketing, and learning. The paper describes and compares various text-to-image models and discusses the challenges and limitations of this field. The findings of this paper demonstrate that recent advancements in deep learning and computer vision have led to significant progress in text-to-image models, enabling them to generate high-quality images from textual descriptions. However, challenges such as ensuring the legality and ethical implications of the final products generated by these models need to be addressed. This paper provides insights into these challenges and suggests future directions for this field. In addition, this study emphasises the need for a sustainability-oriented approach in the text-to-image domain. As text-to-image models advance, it is crucial to conscientiously assess their impact on ecological, cultural, and societal dimensions. Prioritising ethical model use while being mindful of their carbon footprint and potential effects on human creativity becomes crucial for sustainable progress.
Keywords
artificial intelligence; natural language processing; text-to-image generation; sustainability; ethical artificial intelligence
Hrčak ID:
310618
URI
Publication date:
30.6.2023.
Visits: 910 *