Media creation with artificial intelligence: Difference between revisions

m
no edit summary
mNo edit summary
mNo edit summary
Line 112: Line 112:


===Workflows===
===Workflows===
====Beginner workflows====
[https://www.nvidia.com/en-us/glossary/multimodal-large-language-models/ Multimodal LLM]s and plugins-using LLMs such as ChatGPT, Copilot, Gemini, Grok and Meta AI provide image generation capabilities.
[https://www.nvidia.com/en-us/glossary/multimodal-large-language-models/ Multimodal LLM]s and plugins-using LLMs such as ChatGPT, Copilot, Gemini, Grok and Meta AI provide image generation capabilities.
* In addition, there are specialized tools (e.g., diffusion-based systems) that offer more control and customization. However, most users already have access to at least one of these platforms and can begin generating images immediately.
* In addition, there are specialized tools (e.g., diffusion-based systems) that offer more control and customization. However, most users already have access to at least one of '''ChatBot''' and can begin generating images immediately.
* For ''high-volume generation'', a paid subscription or plan is typically required.
* For '''high-volume generation''', a '''paid subscription''' or plan is typically '''required'''.


For image editing and post-processing, dedicated graphics software such as [https://www.gimp.org/downloads/ GIMP], [https://krita.org Krita], or [https://www.adobe.com/products/photoshop.html Photoshop] is recommended. These tools allow precise control (e.g., masking, compositing, color correction) and can complement GenAI workflows.
'''For image editing''' and post-processing, '''dedicated graphics software''' such as [https://www.gimp.org/downloads/ GIMP], [https://krita.org Krita], or [https://www.adobe.com/products/photoshop.html Photoshop] '''is recommended'''. These tools allow precise control (e.g., '''masking, compositing, color correction''') and can '''complement GenAI workflows'''.


In general you will always want to takes these steps: Generate, refine via text prompts, select final candidates, refine via graphic tools.
In general you will always want to takes these steps: Generate, refine via text prompts, select final candidates, refine via graphic tools.


====General notes on image upgrading====
In context of its limitations, chatbot-based workflows are most often nonetheless a big '''improvement over pure manual workflows''': They '''speed up prototyping''' and let you explore different creative directions for drafts.
For upgrading a very low quality yet important image you probably want to upgrade specific elements first so details are not hallucinated to an unacceptable level.


text prompt + low quality image + before updated element 1 for reference + before updated element 2 for reference = higher quality image
When you want to work with chatbots, you effectively have to learn [[wp:prompt_engineering|prompt engineering]]: You basically learn how to write ''good prompts''.


====Beginner workflows====
=====General notes on image upgrading=====
In context of its limitations, chatbot-based workflows are still improvements over a pure manual workflows: They speed up prototyping and let you explore different creative directions for drafts.
For upgrading a very low quality yet important image you probably want to '''upgrade specific elements first''' so details are not hallucinated to an unacceptable level.


When you want to work with chatbots, you effectively have to learn [[wp:prompt_engineering|prompt engineering]]: You basically learn how to write ''good prompts''.
text prompt + low quality image + before updated element 1 for reference + before updated element 2 for reference = higher quality image


=====ChatGPT=====
=====ChatGPT=====
8,844

edits