Skoči na glavni sadržaj

Izvorni znanstveni članak

https://doi.org/10.2478/crdj-2023-0004

Trends and Challenges of Text-to-Image Generation: Sustainability Perspective

Dora Ivezić orcid id orcid.org/0009-0002-5761-6677 ; Sveučilište u Zagrebu, Fakultet elektrotehnike i računarstva *
Marina Bagić Babac orcid id orcid.org/0000-0003-4979-2216 ; Sveučilište u Zagrebu, Fakultet elektrotehnike i računarstva

* Dopisni autor.


Puni tekst: engleski pdf 316 Kb

str. 56-77

preuzimanja: 339

citiraj


Sažetak

Text-to-image generation is a rapidly growing field that aims to generate images from textual descriptions. This paper provides a comprehensive overview of the latest trends and developments, highlighting their importance and relevance in various domains, such as art, photography, marketing, and learning. The paper describes and compares various text-to-image models and discusses the challenges and limitations of this field. The findings of this paper demonstrate that recent advancements in deep learning and computer vision have led to significant progress in text-to-image models, enabling them to generate high-quality images from textual descriptions. However, challenges such as ensuring the legality and ethical implications of the final products generated by these models need to be addressed. This paper provides insights into these challenges and suggests future directions for this field. In addition, this study emphasises the need for a sustainability-oriented approach in the text-to-image domain. As text-to-image models advance, it is crucial to conscientiously assess their impact on ecological, cultural, and societal dimensions. Prioritising ethical model use while being mindful of their carbon footprint and potential effects on human creativity becomes crucial for sustainable progress.

Ključne riječi

artificial intelligence; natural language processing; text-to-image generation; sustainability; ethical artificial intelligence

Hrčak ID:

310618

URI

https://hrcak.srce.hr/310618

Datum izdavanja:

30.6.2023.

Posjeta: 838 *