A reusable agent skill for using OpenAI GPT Image models in design and software-building workflows.
It helps coding agents turn a user objective into low-cost visual exploration, tracked iteration, and polished final image assets before moving into frontend implementation, direct publishing, or optional structured design tools.
- Generates design references and production image assets with
gpt-image-2 - Edits existing images and multi-image references
- Supports Responses API image iteration with
previous_response_idandimage_generation_call_id - Saves outputs into a topic-organized image memory palace
- Writes metadata manifests with prompts, model, size, quality, latency, request IDs, and estimated cost
- Provides prompting patterns for UI mockups, ads, product mockups, diagrams, and edits
- Encourages cheap first-pass exploration before upgrading only the strongest directions to higher quality
Use this skill when you want an agent to:
- create visual references before building UI
- generate launch graphics, model cards, social visuals, or product mockups
- make several image variations and keep the prompt history organized
- edit an image while preserving layout, brand, identity, or structure
- keep a reusable local archive of generated images and manifests
- hand off a visual direction into frontend code or an optional structured design tool
Install with the skills CLI:
npx skills add Ashish-Soni08/gpt-image-design-workflowsRestart your agent after installing so the skill metadata is rediscovered.
Clone or copy this repository into your Codex skills directory:
~/.codex/skills/gpt-image-design-workflows
Windows example:
git clone https://github.com/Ashish-Soni08/gpt-image-design-workflows.git `
C:/Users/<you>/.codex/skills/gpt-image-design-workflowsRestart Codex after installing.
Install with the skills CLI:
npx skills add Ashish-Soni08/gpt-image-design-workflowsOr clone/copy into:
~/.agents/skills/gpt-image-design-workflows
Restart Claude Code after installing.
- Node.js 22 or newer recommended
- An OpenAI API key with access to GPT Image models
Set your OpenAI API key in the environment. Do not hard-code it in files.
PowerShell, current session:
$env:OPENAI_API_KEY="your_api_key_here"PowerShell, persistent Windows user variable:
[Environment]::SetEnvironmentVariable("OPENAI_API_KEY", "your_api_key_here", "User")Restart your terminal or agent after setting a persistent variable.
macOS/Linux:
export OPENAI_API_KEY="your_api_key_here"By default, images are saved under:
./image-memory-palace
Override this with your preferred long-term archive. For example:
--memory-root D:/Github/design/image-memory-palaceor:
$env:IMAGE_MEMORY_PALACE_ROOT="D:/Github/design/image-memory-palace"Each topic receives a folder and manifest:
image-memory-palace/
ai-builder-dashboard/
20260501-114544_builder-blueprint_generate_low_1536x1024_v01-of-02.png
manifest.json
_index/
global-manifest.json
Output filenames use:
YYYYMMDD-HHMMSS_label_mode_quality_size_v01-of-02.png
Use --label to name a run or creative direction. If --label is omitted, the script falls back to the topic slug.
The core loop is:
objective -> brief -> low-cost variants -> evaluate -> selective refinement -> final asset or handoff
Use a budgeted approval model for smooth autonomy:
- If the user gives a broad objective, create a concise brief and propose a small exploration budget.
- Before the first paid API call, summarize model, mode, size, quality, variant count, memory topic, and estimated cost when available.
- Ask for approval unless the user already gave a specific generation/edit command.
- After approval, continue without asking again while staying inside the approved budget.
- Pause before exceeding budget, increasing variant count, using
quality high, using experimental sizes above2560x1440, editing sensitive/private images, publishing assets, or making uncertain-cost API calls.
The routing rule is:
Need a shareable visual artifact? Use GPT Image directly.
Need implementation-ready UI? Use GPT Image -> optional design-tool handoff -> code.
For direct artifacts, generate 2-8 candidates, pick a winner, then refine with edits or Responses API iteration.
For app screens, generate concepts first, then rebuild the best direction structurally in code or in whatever design tool is available.
Quality defaults:
low: fast exploration, batch variants, cheap previews.medium: default for strong design references and marketing concepts.high: dense text, diagrams, charts, final assets, high-resolution outputs, or identity-sensitive edits.
For gpt-image-2, size must use WIDTHxHEIGHT, both edges must be multiples of 16, max edge must be under 3840, aspect ratio must be at most 3:1, and total pixels must be between 655,360 and 8,294,400. Treat sizes above 2560x1440 as experimental.
node scripts/gpt_image_workflow.mjs `
--mode generate `
--topic ai-builder-dashboard `
--label builder-blueprint `
--prompt "Create a realistic production UI reference for an AI builder dashboard. Dense but elegant, practical controls, clear hierarchy, realistic app content, no marketing hero." `
--size 1536x1024 `
--quality medium `
--n 3node scripts/gpt_image_workflow.mjs `
--mode edit `
--topic ai-builder-dashboard `
--label premium-density-edit `
--image D:/path/to/reference.png `
--prompt "Preserve the layout and navigation. Make the visual design more premium, calmer, and more information dense." `
--size 1536x1024 `
--quality mediumUse this when you want to keep refining the same generated image.
node scripts/gpt_image_workflow.mjs `
--mode responses-generate `
--topic ai-builder-dashboard `
--prompt "Generate a polished dashboard reference for an AI app builder." `
--response-model gpt-5.5 `
--quality mediumThen use the saved response_id or image_generation_call_id from the manifest:
node scripts/gpt_image_workflow.mjs `
--mode responses-edit `
--topic ai-builder-dashboard `
--prompt "Preserve layout. Make the interface denser and more premium." `
--previous-response-id resp_... `
--quality mediumUse dry-run mode to verify manifests and folder creation without calling the API:
node scripts/gpt_image_workflow.mjs `
--dry-run `
--mode generate `
--topic skill-test `
--label dry-run-smoke `
--prompt "Create a polished UI reference." `
--size 1024x1024 `
--quality lowYou can also syntax-check the runtime script locally:
node --check scripts/gpt_image_workflow.mjsUse --n to generate multiple candidates in one request:
node scripts/gpt_image_workflow.mjs `
--mode generate `
--topic model-card-launch `
--label editorial-model-card `
--prompt "Create a premium editorial model card for a new AI model release." `
--size 1536x1024 `
--quality medium `
--n 4The script records every image in the topic manifest and the global index.
This skill does not require Paper MCP, Pencil MCP, or any design MCP server. Generated images can be used directly as implementation references or production assets.
If a compatible design tool is available, use generated images as references, not final UI. A good optional workflow is:
brief -> GPT Image reference -> optional structured design tool -> app implementation -> browser QA
Examples of optional design-tool handoffs:
- Paper MCP (
https://paper.design/docs/mcp): rebuild the reference structurally in Paper, inspect selected frames, use screenshots/JSX/styles, and support design-to-code handoff. Local images can be referenced aspaper-asset://D:/Github/design/image-memory-palace/<topic>/<image>.png. - Pencil (
https://www.pencil.dev/): use the reference as visual direction for canvas-to-code workflows, webpages, components, layouts,.pendesign files, and design/code iteration. - No design MCP: implement directly in the app from the saved image and manifest.
- Set
OPENAI_API_KEYin your environment before making real API calls. - Do not store API keys in the repository, skill files, manifests, examples, or prompts.
- Never commit generated private/customer images unless you intend to publish them.
- For
gpt-image-2, omitinput_fidelity; the model already processes image inputs at high fidelity. gpt-image-2does not support transparent backgrounds.- Cost is reported as an estimate unless API usage data is available.
- Start with
quality lowfor exploration, then upgrade only selected winners tomediumorhigh.