Media creation with artificial intelligence
Possibilities and limitations
Generative artificial intelligence (GenAI) can ease and accelerate content creation in all forms and sizes. The difficulties to use or create own setups will continue to lower basically with each new release from commercial forerunners.
You can use GenAI via websites, desktop clients or dedicated programs that may feature totally local running logic. For building own programmatic solutions you will either need downloaded AI models or API keys to access cloud services that do the heavy computation remotely. (With enough expertize you can even build agentic AIs (such as Open Claw) that use your other tools and carry out task automatically. But caution: Probabilistic AIs can hallucinate and pose a thread to your system. As mitigation MCPs should be in use. Sandboxes are even better and can limited the agents usefulness and add an extra layer of complexity which then eats up the time you intended to save.)
GenAI work probabilistic. Don't expect same results when repeating prompts with same inputs. Same text prompts may eventually generate similar but not identical content. Therefore, in some scenarios you may want to generate many outputs to chose the best candidate for your intermediate or end goal.
Sounds, voice acting and music
- Voice-cloning of existing or creation of new voices. For natural voices you may want to look for emotional text-to-speech.
- Music generations
Image generation
- Content generation based on:
- Text prompts
- Own drafts
- Merging (main image and references)
- Changing existing content
- Expanding
- Inpainting (replacement of subsections)
- Style transfers
3D content generation
There exists content generators that turn 2D data into 3D data by calculating plausible assumptions for the missing dimension.
Copyright and fair use
Gaming corporations have an interest in not upsetting gaming communities as this means bad PR and therefore lower sales of their products. For this reason cosplaying, fan art creation and modding is most often tolerated. The last two points may be even actively supported by the companies. Mods can change games in many ways and increase the replay value. As side effect they increase the positive opinion of a player over the game company making them more likely to buy also other games from the same company. Therefore, pretty much all companies have community managers on their payroll. Technically, the companies allow copyrights to be ignored to a limited degree as they profit from it. Also, with enough labor own works can fall under the fair use rule.
Modifications (mods) often stay behind of being own games in terms new content or features added. GenAI has the potential to break that situation. Creating an endless stream of new content based on old content is not in the interest of the company. When you consider involved artists and technicans that helped creating the game, things get even more complicated. Mass-produced fan made content that gets created for "free" might endanger the living of before-named people. Therefore, the content creation should not surpass the original game or at least not draw that much attention to a significant about of possibly otherwise money spending players that the companies take negative effects. Game communities are should therefore work towards a situation that poses no "competition". As for 2026, that danger is still more hypothetical than realistic, but it will become relevant one day.
Tools
2D images
All big LLM applications such as ChatGPT, Claude, Gemini, Grok and Mistral support image generation. Of course there are other tools but you probably already at least one of these. Therefore you have there an account and can instantly use it for image generation. For mass production you probably need a paid subscription or "plan" so that more images can be created in a defined time frame (by that plan).
For image editing you also want a specialized graphics tool such a gimp, krita or Photoshop (which itself has GenAI functions).
Easy Access: ChatGPT (The notes here may work the same for other well known LLMs.)
- Prompt exactly what you want (even if it just "higher quality"). Either it works or not.
- Merging: When possible drop both images at the same time into the prompt. Re-editing an image means a loss of details.
- The the context window gained to much control over the currently expected output, then start a new prompt that includes all the accumulated changes you want to make.
- When you subscription allows it, output multiple final images. As every piece will be different, use have to chose the best one. You can also photoshop (merge) multiple final images to together by using masks.
(Add some examples here.)
Specialized: Canva
Local solutions: AUTOMATIC1111 (aka Stable diffusion), ComfyUI
Videos
Grok (xAI)
Gemini (Google)
Sora (OpenAI)
3D objects
- ...
3D animations
- ...
World generators
- World generator inside Unreal Engine 5
- ...