Custom Gems

Custom Gems is a technique for controlling image generation in ChatGPT Image 2.0 through structured JSON-formatted prompts processed via the Gemini 2.5 Flash API. Rather than relying solely on natural language instructions, Custom Gems enables users to specify parameters and requirements in a machine-readable format. This approach provides more granular control over the output generation process by leveraging the capabilities of Gemini 2.5 Flash to interpret and execute structured requests.

How It Works

The technique operates by converting user intent into JSON-structured prompts that are processed through Gemini 2.5 Flash before being passed to ChatGPT Image 2.0. The JSON format allows for explicit specification of generation parameters, constraints, and desired characteristics that might be ambiguous or difficult to express in natural language alone. This intermediary processing step enables the system to parse requirements more precisely and translate them into commands that the image generation model can execute with greater accuracy.

Practical Applications

Custom Gems is particularly useful for workflows requiring consistent, reproducible image generation with specific technical or stylistic requirements. Users working on projects that demand precise control over visual output—such as batch generation with uniform parameters or complex multi-condition specifications—can benefit from the structured nature of the JSON approach. The technique bridges the gap between human-readable instructions and machine-executable directives, making it suitable for both automated pipelines and manual image generation tasks.

Source Notes