Craig Does AI: JSON Prompts for Advanced ChatGPT Image 2.0 Control
Generated: 2026-04-26 · API: Gemini 2.5 Flash · Modes: Summary
Craig Does AI: JSON Prompts for Advanced ChatGPT Image 2.0 Control
Clip title: I Tested ChatGPT’s New Image 2.0 and Accidentally Stumbled Upon an Awesome Workflow Author / channel: Craig Does AI URL: https://www.youtube.com/watch?v=qXUww5tnLHs
Summary
The video introduces a significant discovery regarding image generation with GPT Image 2.0, which the presenter suggests could establish it as a leading tool. Having previously developed a custom “JSON Image Creator V.3” Gem for image prompting, the presenter describes how, during recent testing with GPT Image 2.0, he stumbled upon an unexpected and powerful “hack.” This breakthrough provides enhanced control over image generation and enables advanced capabilities not widely known. The presenter not only demonstrates this functionality but also offers all necessary resources—including the Gem itself, its source files, and a detailed Notion document—for viewers to replicate and experiment with his findings.
The core of the presenter’s method lies in utilizing JSON-structured prompts rather than simple text commands. He explains that JSON prompts offer a superior structure, leading to more precise and consistent image generation. In an initial demonstration, he uses his JSON Image Creator V.3 to generate a detailed prompt for “a group of NASA astronauts on the moon playing kickball, but the ball floats away due to no gravity.” This JSON code, when submitted to ChatGPT (which uses DALL-E 3 for image generation), produces a high-quality, realistic image. Furthermore, ChatGPT’s interface allows for seamless aspect ratio adjustments (e.g., from square to 16:9 landscape or 9:16 portrait) and in-image editing, such as changing the color of the ball or adding elements like a fish to an eagle’s talons, all while maintaining image consistency.
The most exciting revelation, however, is a trick for generating consistent, sequential images, perfect for storyboarding. This advanced feature is exclusively available to paid ChatGPT users and requires activating a “Thinking” mode. By leveraging the initial JSON prompt, users can instruct ChatGPT to create a series of images (ideally around five or six) that tell a continuous story, with each subsequent image drawing consistency from the previously generated one. The presenter showcases this by creating a humorous sequence of a chimpanzee and a miniature donkey-giraffe playing football, transitioning from a yard to a street scene, with the chimp eventually diving and being consoled by the donkey-giraffe.
While this storyboarding hack offers unprecedented narrative capabilities in AI image generation, the presenter notes a limitation: image quality can begin to degrade and introduce “artifacts” after about the fifth or sixth image in a sequence. He suggests a workaround of copying the most consistent image into a new chat to maintain quality for longer narratives. Beyond photographic styles, the system also supports generating illustrations, providing further creative flexibility. Overall, the discovery highlights a powerful, structured approach to AI image creation that significantly enhances control and enables complex narrative development, setting GPT Image 2.0 apart as a formidable tool for creators.
Video Description & Links
Description
GPT Image 2 dropped today and I was testing it with a Gemini Gem I built for structured image prompting. I was not expecting to find what I found.
I stumbled onto something I personally cannot stop thinking about. All the different ways I can use it. And I had to get it out to you as fast as I could.
I have been using Nano Banana 2 since day one and it is still a tool I reach for daily. But after spending this afternoon testing GPTImage 2 with a JSON prompting method, I think there is a new top dog for certain workflows.
In this video you will see the full JSON prompting workflow using the Gemini Gem and GPT Image 2, how the built-in aspect ratio tool instantly reformats any image for YouTube, TikTok, or short form, the trick I discovered that I was not looking for and cannot stop thinking about, and what to watch out for when editing images in the same chat session.
Drop a comment and let me know what you find when you try this. If this helped, a like goes a long way.
✅ Here is the Notion doc with the Gem, source files, and the full breakdown: https://www.notion.so/GPT-Image-2-JSON-Prompting-Workflow-and-Storyboard-Method-34a606421d128009acc7c617695ac68e?source=copy_link
GPTImage2 AIImageGeneration GeminiGem #NotebookLM ChatGPT AIImages ImagePrompting JSONPrompt AIWorkflow AITools CraigDoesAI AIContent GenerativeAI ImageAI Higgsfield Seedance AICreative PromptEngineering AITutorial AIForCreators ContentCreation AIHack GPT4o ArtificialIntelligence AIExperiment
Tags
ai prompts, ai, prompt chatgpt, claude, gemini, chatgpt, learn how to prompt ai, ai prompt engineering, prompt engineering, prompting for beginners, prompting techniques, prompt engineering for beginners, how to write good prompts for ai, how to prompt chatgpt, how to prompt better, write better prompts, ai for beginners, ai for beginners tutorial, ai tips and tricks, prompting tricks, prompting tips, how to use chatgpt, grok, chatgpt prompts, chatgpt tutorial, chat gpt
URLs
Related Concepts
- JSON prompting — Wikipedia
- Image generation control — Wikipedia
- Prompt engineering — Wikipedia
- Structured prompting workflows — Wikipedia
- Custom Gems — Wikipedia
- Sequential image generation — Wikipedia
- AI storyboarding — Wikipedia
- Image consistency — Wikipedia
- In-image editing — Wikipedia
- Aspect ratio adjustment — Wikipedia
- ChatGPT Thinking mode — Wikipedia
- Image artifacts — Wikipedia
- Narrative development — Wikipedia
- Prompt-driven automation — Wikipedia
- Generative AI workflows — Wikipedia
- Multi-image sequences — Wikipedia