ai-image

99views

5installs

Generate AI images using OpenAI's gpt-image-1 model with customizable aspect ratios and artistic themes. Use when the user wants to create images, generate artwork, or mentions image generation with specific styles like Ghibli, futuristic, Pixar, oil painting, or Chinese painting.

Install

mkdir -p .claude/skills/ai-image && curl -L -o skill.zip "https://mcp.directory/api/skills/download/256" && unzip -o skill.zip -d .claude/skills/ai-image && rm skill.zip

Installs to .claude/skills/ai-image

About this skill

AI Image Generation Skill

Generate high-quality AI images using OpenAI's gpt-image-1 model with customizable styles and themes.

When to Use This Skill

Use this skill when the user wants to:

Generate images from text descriptions
Create artwork with specific artistic styles
Generate images with particular aspect ratios (vertical, horizontal, square)
Apply themed visual styles (Studio Ghibli, futuristic, Pixar, oil painting, Chinese painting)

Instructions

Check for API Key: Verify that the OPENAI_API_KEY environment variable is set
Gather Requirements: Ask the user for:
- Image prompt (required)
- Style/aspect ratio: vertical (1024x1536), horizontal (1536x1024), or square (1024x1024)
- Theme: ghibli, futuristic, pixar, oil-paint, or chinese-paint (optional)
- Output location (optional, defaults to ./generated_image.png)
Run the CLI: Execute the main.py script with the appropriate parameters
Report Results: Show the user where the image was saved and any relevant details

Available Options

Aspect Ratios (--style)

vertical: 1024x1536 pixels (portrait orientation)
horizontal: 1536x1024 pixels (landscape orientation)
square: 1024x1024 pixels (default)

Artistic Themes (--theme)

ghibli: Studio Ghibli animation style with whimsical, dreamlike aesthetics
futuristic: Sci-fi style with sleek designs and neon lights
pixar: Vibrant 3D animation style with expressive characters
oil-paint: Classical oil painting with rich textures and brushstrokes
chinese-paint: Traditional Chinese ink painting with delicate brushwork

Usage Examples

Basic Usage

uv run main.py --prompt "a cat sitting on a tree"

With Style and Theme

uv run main.py --prompt "a sunset over mountains" --style horizontal --theme oil-paint --output ./sunset.png

Futuristic Portrait

uv run main.py --prompt "a robot in a city" --style vertical --theme futuristic --output ./robot.png

Studio Ghibli Landscape

uv run main.py --prompt "a magical forest with spirits" --style horizontal --theme ghibli --output ./forest.png

Setup Requirements

This skill requires an OpenAI API key with access to the gpt-image-1 model:

export OPENAI_API_KEY='your-api-key-here'

Note: Using gpt-image-1 requires organization verification on the OpenAI platform.

Technical Details

Model: OpenAI gpt-image-1 (released April 2025)
Response Format: Base64 encoded images (b64_json)
Supported Sizes: 1024x1024, 1024x1536, 1536x1024
Maximum Resolution: Up to 4096x4096 pixels
Dependencies: openai>=2.7.1

Pricing Information

Usage is priced per token:

Text tokens: $5 per million
Image input tokens: $10 per million
Image output tokens: $40 per million

Approximate costs per generated image:

Low quality square: ~$0.02
Medium quality square: ~$0.07
High quality square: ~$0.19

Troubleshooting

API Key Not Set

If you see "Error: OPENAI_API_KEY environment variable not set", ensure your API key is exported in your shell session.

Organization Not Verified

gpt-image-1 requires organization verification on platform.openai.com. Visit your OpenAI account settings to complete verification.

Invalid Size Error

Ensure you're using one of the supported sizes: 1024x1024, 1024x1536, or 1536x1024.

More by tyrchen

View all skills by tyrchen →

codex-code-review

tyrchen

Perform comprehensive code reviews using OpenAI Codex CLI. This skill should be used when users request code reviews, want to analyze diffs/PRs, need security audits, performance analysis, or want automated code quality feedback. Supports reviewing staged changes, specific files, entire directories, or git diffs.

flutter-development

aj-geddes

Build beautiful cross-platform mobile apps with Flutter and Dart. Covers widgets, state management with Provider/BLoC, navigation, API integration, and material design.

1,5731,370

ui-ux-pro-max

nextlevelbuilder

"UI/UX design intelligence. 50 styles, 21 palettes, 50 font pairings, 20 charts, 8 stacks (React, Next.js, Vue, Svelte, SwiftUI, React Native, Flutter, Tailwind). Actions: plan, build, create, design, implement, review, fix, improve, optimize, enhance, refactor, check UI/UX code. Projects: website, landing page, dashboard, admin panel, e-commerce, SaaS, portfolio, blog, mobile app, .html, .tsx, .vue, .svelte. Elements: button, modal, navbar, sidebar, card, table, form, chart. Styles: glassmorphism, claymorphism, minimalism, brutalism, neumorphism, bento grid, dark mode, responsive, skeuomorphism, flat design. Topics: color palette, accessibility, animation, layout, typography, font pairing, spacing, hover, shadow, gradient."

1,1161,191

drawio-diagrams-enhanced

jgtolentino

Create professional draw.io (diagrams.net) diagrams in XML format (.drawio files) with integrated PMP/PMBOK methodologies, extensive visual asset libraries, and industry-standard professional templates. Use this skill when users ask to create flowcharts, swimlane diagrams, cross-functional flowcharts, org charts, network diagrams, UML diagrams, BPMN, project management diagrams (WBS, Gantt, PERT, RACI), risk matrices, stakeholder maps, or any other visual diagram in draw.io format. This skill includes access to custom shape libraries for icons, clipart, and professional symbols.

1,4181,109

godot

bfollington

This skill should be used when working on Godot Engine projects. It provides specialized knowledge of Godot's file formats (.gd, .tscn, .tres), architecture patterns (component-based, signal-driven, resource-based), common pitfalls, validation tools, code templates, and CLI workflows. The `godot` command is available for running the game, validating scripts, importing resources, and exporting builds. Use this skill for tasks involving Godot game development, debugging scene/resource files, implementing game systems, or creating new Godot components.

1,197748

nano-banana-pro

garg-aayush

Generate and edit images using Google's Nano Banana Pro (Gemini 3 Pro Image) API. Use when the user asks to generate, create, edit, modify, change, alter, or update images. Also use when user references an existing image file and asks to modify it in any way (e.g., "modify this image", "change the background", "replace X with Y"). Supports both text-to-image generation and image-to-image editing with configurable resolution (1K default, 2K, or 4K for high resolution). DO NOT read the image file first - use this skill directly with the --input-image parameter.

1,154684

pdf-to-markdown

aliceisjustplaying

Convert entire PDF documents to clean, structured Markdown for full context loading. Use this skill when the user wants to extract ALL text from a PDF into context (not grep/search), when discussing or analyzing PDF content in full, when the user mentions "load the whole PDF", "bring the PDF into context", "read the entire PDF", or when partial extraction/grepping would miss important context. This is the preferred method for PDF text extraction over page-by-page or grep approaches.

1,318617

Related MCP Servers

Browse all servers

Stability AI

Generate and edit images with Stability AI's powerful ai image generator using advanced Stable Diffusion models for stun

800 tools

VChart

Create interactive visualizations and charts with VChart, a powerful data analysis tool and pie chart maker for flexible

4810 tools

Grok2 Image Generator

Grok2 Image Generator is an AI image generator enabling seamless artificial intelligence images generation via stdio tra

290 tools

GPT Image Generator

Generate and edit images instantly using GPT Image Generator, an advanced AI image generator for creative visual content

180 tools

Placid Image Generator

Placid Image Generator creates dynamic images from templates using Placid's API—ideal for social media posts and marketi

140 tools

Nano Banana (Gemini Image Generator)

Create images instantly with Nano Banana, a free online Gemini AI image generator. Share with public URLs—no downloads n

80 tools

Stay ahead of the MCP ecosystem

Get weekly updates on new skills and servers.

Install

mkdir -p .claude/skills/ai-image && curl -L -o skill.zip "https://mcp.directory/api/skills/download/256" && unzip -o skill.zip -d .claude/skills/ai-image && rm skill.zip

Installs to .claude/skills/ai-image

Stats

Views

Installs

Author

tyrchen

2 skills published

Links

Source Code

ai-image

Install

About this skill

AI Image Generation Skill

When to Use This Skill

Instructions

Available Options

Aspect Ratios (--style)

Artistic Themes (--theme)

Usage Examples

Basic Usage

With Style and Theme

Futuristic Portrait

Studio Ghibli Landscape

Setup Requirements

Technical Details

Pricing Information

Troubleshooting

API Key Not Set

Organization Not Verified

Invalid Size Error

More by tyrchen

codex-code-review

You might also like

flutter-development

ui-ux-pro-max

drawio-diagrams-enhanced

godot

nano-banana-pro

pdf-to-markdown

Related MCP Servers

Stay ahead of the MCP ecosystem