This short was created entirely using AI tools — from concept and visuals to final editing and sound design. It’s a complete example of a modern AI-powered video production workflow.
1. Prompts in ChatGPT – The Starting Point
The entire process begins in ChatGPT. This is where detailed prompts were created to define each scene: character appearance, camera framing, lighting, motion, and overall mood.
The key was precision — instead of generating random visuals, the goal was to design cinematic, intentional shots ready to be used in a short-form video.
2. Image Generation in Grok
Based on those prompts, images were generated in Grok. Each image represents a separate shot — with different action, composition, and energy.
This step builds a consistent visual foundation, ensuring that all scenes match stylistically before moving into animation.
3. Video Creation in Grok
Next, the static images were transformed into short video clips — also using Grok.
Each clip includes:
- camera movement
- subtle character motion
- cinematic dynamics
This is where the visuals come to life and start to resemble real film footage.
4. Editing in DaVinci Resolve
All generated clips were assembled in DaVinci Resolve, where the actual video editing took place.
This stage included:
- precise cuts aligned with rhythm
- fast-paced transitions
- building overall flow and intensity
- optimizing timing for the YouTube Shorts format
Editing is critical — even strong AI visuals need proper pacing to become engaging content.
5. Music Generated with Suno
The soundtrack was generated using Suno.
Instead of using stock music, the audio was tailored specifically to match the video’s tone:
- cinematic atmosphere
- matching tempo and energy
- seamless integration with visual rhythm
Music plays a major role in short-form content — it drives emotion and enhances viewer retention.
Final Workflow
The entire pipeline looks like this:
ChatGPT (prompts) → Grok (images) → Grok (video) → DaVinci Resolve (editing) → Suno (music)
This project demonstrates how AI can now handle nearly every stage of video production. Instead of traditional filming, the process becomes:
idea → prompt → image → animation → edit → final short
With the right prompts and workflow, it’s possible to create dynamic, high-quality video content without a camera, actors, or a physical set.






Really solid breakdown 🔥
What I like most is that you’re not just showing the final result, but the whole process behind it. People think AI does everything for you, but this clearly shows there’s a lot of skill in crafting prompts and putting it all together to get that cinematic feel 💪
And the training vibe is on point — dynamic, powerful, makes you want to watch more 😄
Definitely looking forward to more behind-the-scenes like this. Super valuable for anyone getting into AI video 🚀