ai-image
Generate AI images using OpenAI's gpt-image-1 model with customizable aspect ratios and artistic themes. Use when the user wants to create images, generate artwork, or mentions image generation with specific styles like Ghibli, futuristic, Pixar, oil painting, or Chinese painting.
Install
mkdir -p .claude/skills/ai-image && curl -L -o skill.zip "https://mcp.directory/api/skills/download/256" && unzip -o skill.zip -d .claude/skills/ai-image && rm skill.zipInstalls to .claude/skills/ai-image
About this skill
AI Image Generation Skill
Generate high-quality AI images using OpenAI's gpt-image-1 model with customizable styles and themes.
When to Use This Skill
Use this skill when the user wants to:
- Generate images from text descriptions
- Create artwork with specific artistic styles
- Generate images with particular aspect ratios (vertical, horizontal, square)
- Apply themed visual styles (Studio Ghibli, futuristic, Pixar, oil painting, Chinese painting)
Instructions
- Check for API Key: Verify that the OPENAI_API_KEY environment variable is set
- Gather Requirements: Ask the user for:
- Image prompt (required)
- Style/aspect ratio: vertical (1024x1536), horizontal (1536x1024), or square (1024x1024)
- Theme: ghibli, futuristic, pixar, oil-paint, or chinese-paint (optional)
- Output location (optional, defaults to ./generated_image.png)
- Run the CLI: Execute the main.py script with the appropriate parameters
- Report Results: Show the user where the image was saved and any relevant details
Available Options
Aspect Ratios (--style)
vertical: 1024x1536 pixels (portrait orientation)horizontal: 1536x1024 pixels (landscape orientation)square: 1024x1024 pixels (default)
Artistic Themes (--theme)
ghibli: Studio Ghibli animation style with whimsical, dreamlike aestheticsfuturistic: Sci-fi style with sleek designs and neon lightspixar: Vibrant 3D animation style with expressive charactersoil-paint: Classical oil painting with rich textures and brushstrokeschinese-paint: Traditional Chinese ink painting with delicate brushwork
Usage Examples
Basic Usage
uv run main.py --prompt "a cat sitting on a tree"
With Style and Theme
uv run main.py --prompt "a sunset over mountains" --style horizontal --theme oil-paint --output ./sunset.png
Futuristic Portrait
uv run main.py --prompt "a robot in a city" --style vertical --theme futuristic --output ./robot.png
Studio Ghibli Landscape
uv run main.py --prompt "a magical forest with spirits" --style horizontal --theme ghibli --output ./forest.png
Setup Requirements
This skill requires an OpenAI API key with access to the gpt-image-1 model:
export OPENAI_API_KEY='your-api-key-here'
Note: Using gpt-image-1 requires organization verification on the OpenAI platform.
Technical Details
- Model: OpenAI gpt-image-1 (released April 2025)
- Response Format: Base64 encoded images (b64_json)
- Supported Sizes: 1024x1024, 1024x1536, 1536x1024
- Maximum Resolution: Up to 4096x4096 pixels
- Dependencies: openai>=2.7.1
Pricing Information
Usage is priced per token:
- Text tokens: $5 per million
- Image input tokens: $10 per million
- Image output tokens: $40 per million
Approximate costs per generated image:
- Low quality square: ~$0.02
- Medium quality square: ~$0.07
- High quality square: ~$0.19
Troubleshooting
API Key Not Set
If you see "Error: OPENAI_API_KEY environment variable not set", ensure your API key is exported in your shell session.
Organization Not Verified
gpt-image-1 requires organization verification on platform.openai.com. Visit your OpenAI account settings to complete verification.
Invalid Size Error
Ensure you're using one of the supported sizes: 1024x1024, 1024x1536, or 1536x1024.
You might also like
flutter-development
aj-geddes
Build beautiful cross-platform mobile apps with Flutter and Dart. Covers widgets, state management with Provider/BLoC, navigation, API integration, and material design.
drawio-diagrams-enhanced
jgtolentino
Create professional draw.io (diagrams.net) diagrams in XML format (.drawio files) with integrated PMP/PMBOK methodologies, extensive visual asset libraries, and industry-standard professional templates. Use this skill when users ask to create flowcharts, swimlane diagrams, cross-functional flowcharts, org charts, network diagrams, UML diagrams, BPMN, project management diagrams (WBS, Gantt, PERT, RACI), risk matrices, stakeholder maps, or any other visual diagram in draw.io format. This skill includes access to custom shape libraries for icons, clipart, and professional symbols.
godot
bfollington
This skill should be used when working on Godot Engine projects. It provides specialized knowledge of Godot's file formats (.gd, .tscn, .tres), architecture patterns (component-based, signal-driven, resource-based), common pitfalls, validation tools, code templates, and CLI workflows. The `godot` command is available for running the game, validating scripts, importing resources, and exporting builds. Use this skill for tasks involving Godot game development, debugging scene/resource files, implementing game systems, or creating new Godot components.
nano-banana-pro
garg-aayush
Generate and edit images using Google's Nano Banana Pro (Gemini 3 Pro Image) API. Use when the user asks to generate, create, edit, modify, change, alter, or update images. Also use when user references an existing image file and asks to modify it in any way (e.g., "modify this image", "change the background", "replace X with Y"). Supports both text-to-image generation and image-to-image editing with configurable resolution (1K default, 2K, or 4K for high resolution). DO NOT read the image file first - use this skill directly with the --input-image parameter.
ui-ux-pro-max
nextlevelbuilder
"UI/UX design intelligence. 50 styles, 21 palettes, 50 font pairings, 20 charts, 8 stacks (React, Next.js, Vue, Svelte, SwiftUI, React Native, Flutter, Tailwind). Actions: plan, build, create, design, implement, review, fix, improve, optimize, enhance, refactor, check UI/UX code. Projects: website, landing page, dashboard, admin panel, e-commerce, SaaS, portfolio, blog, mobile app, .html, .tsx, .vue, .svelte. Elements: button, modal, navbar, sidebar, card, table, form, chart. Styles: glassmorphism, claymorphism, minimalism, brutalism, neumorphism, bento grid, dark mode, responsive, skeuomorphism, flat design. Topics: color palette, accessibility, animation, layout, typography, font pairing, spacing, hover, shadow, gradient."
rust-coding-skill
UtakataKyosui
Guides Claude in writing idiomatic, efficient, well-structured Rust code using proper data modeling, traits, impl organization, macros, and build-speed best practices.
Stay ahead of the MCP ecosystem
Get weekly updates on new skills and servers.