
Gemini Nanobanana (Image Generation)
Integrates Google's Gemini 2.5 Flash Image API into Claude conversations, allowing you to generate images from text prompts directly in chat.
Integrates with Google's Gemini 2.5 Flash Image API to provide text-to-image generation, single image editing with prompts, multi-image composition, and style transfer capabilities with automatic file saving and collision handling.
What it does
- Generate images from text descriptions
- Edit existing images with text prompts
- Compose multiple images together
- Apply style transfer to images
- Save generated images with collision handling
Best for
About Gemini Nanobanana (Image Generation)
Gemini Nanobanana (Image Generation) is a community-built MCP server published by junhan2 that provides AI assistants with tools and capabilities via the Model Context Protocol. Gemini Nanobanana: AI image generator for creating, editing, and composing stunning images using advanced artificial int It is categorized under ai ml.
How to install
You can install Gemini Nanobanana (Image Generation) in your AI client of choice. Use the install panel on this page to get one-click setup for Cursor, Claude Desktop, VS Code, and other MCP-compatible clients. This server runs locally on your machine via the stdio transport.
License
Gemini Nanobanana (Image Generation) is released under the MIT license. This is a permissive open-source license, meaning you can freely use, modify, and distribute the software.
π¨ Gemini Nanobanana MCP
Generate images from text with Claude! Simply type "Draw a cute cat" and get instant AI-generated images.
A beginner-friendly Model Context Protocol (MCP) server that brings Google's Gemini 2.5 Flash Image generation directly into your Claude conversations.
Quick Start - Just 3 Steps
1οΈβ£ Get Your API Key (1 minute)
- Visit Google AI Studio
- Sign in with your Google account
- Click "Create API key" β Copy the key
2οΈβ£ Install in Your Claude Client (2 minutes)
Claude Desktop (Windows)
- Open Notepad
- Copy this code and replace
YOUR_API_KEYwith your actual key:
{
"mcpServers": {
"gemini-nanobanana-mcp": {
"command": "npx",
"args": ["gemini-nanobanana-mcp@latest"],
"env": {
"GEMINI_API_KEY": "YOUR_API_KEY"
}
}
}
}
- Save as:
%APPDATA%/Claude/claude_desktop_config.json - Restart Claude Desktop
Claude Desktop (Mac)
- Open Terminal (search "Terminal" in Spotlight)
- Run this command (replace
YOUR_API_KEY):
cat > ~/Library/Application\ Support/Claude/claude_desktop_config.json << 'EOF'
{
"mcpServers": {
"gemini-nanobanana-mcp": {
"command": "npx",
"args": ["gemini-nanobanana-mcp@latest"],
"env": {
"GEMINI_API_KEY": "YOUR_API_KEY"
}
}
}
}
EOF
- Restart Claude Desktop
Claude Code (Easiest!)
Just run this one command in your terminal (replace YOUR_API_KEY):
claude mcp add gemini-nanobanana-mcp -s user -e GEMINI_API_KEY="YOUR_API_KEY" -- npx -y gemini-nanobanana-mcp@latest
Cursor
- Go to
Cursor SettingsβMCPβAdd new MCP Server - Fill in:
- Name:
gemini-nanobanana-mcp - Command:
npx - Args:
gemini-nanobanana-mcp@latest - Environment Variables:
GEMINI_API_KEY=YOUR_API_KEY
- Name:
- Restart Cursor
3οΈβ£ Start Creating! (0 minutes)
Try these in Claude:
- "Generate a cute puppy playing in a garden"
- "Create a beautiful sunset over mountains"
- "Draw a red sports car"
- "Make an abstract colorful painting"
Your images automatically save to ~/Downloads/gemini-images/
See It In Action
Basic Usage
You: Generate a cozy coffee shop interior
Claude: [Generating image...]
Image generated and saved to: ~/Downloads/gemini-images/generate-2025-01-09-14-30-45.png
Size: 1.2MB | Format: PNG
Custom Save Location
You: Create a sunset landscape and save it as ./my-sunset.png
Claude: Image saved to: ./my-sunset.png
What You Can Do
Text-to-Image Generation
Create any image you can imagine from a text description.
Examples:
"A majestic dragon flying over a medieval castle""Modern minimalist living room with plants""Vintage bicycle on a cobblestone street"
Image Editing
Edit existing images with natural language instructions.
How to use:
- Upload an image to Claude
- Say:
"Make this image black and white" - Or:
"Add a sunset background to this photo"
Image Composition
Combine multiple images into one creative composition.
How to use:
- Upload 2-10 images to Claude
- Say:
"Combine these images into a collage" - Or:
"Blend these photos together artistically"
Style Transfer
Apply the artistic style of one image to another.
How to use:
- Upload two images: a content image and a style reference
- Say:
"Apply the style of the second image to the first"
Configuration Options
Environment Variables
| Variable | Default | Description |
|---|---|---|
GEMINI_API_KEY | Required | Your Google AI Studio API key |
AUTO_SAVE | true | Automatically save images when no path specified |
DEFAULT_SAVE_DIR | ~/Downloads/gemini-images | Default directory for saved images |
LOG_LEVEL | info | Logging level (error, warn, info, debug) |
Example with custom settings:
{
"mcpServers": {
"gemini-nanobanana-mcp": {
"command": "npx",
"args": ["gemini-nanobanana-mcp@latest"],
"env": {
"GEMINI_API_KEY": "your-api-key",
"AUTO_SAVE": "true",
"DEFAULT_SAVE_DIR": "~/Pictures/AI-Images",
"LOG_LEVEL": "debug"
}
}
}
}
Disable Auto-Save
To only save when you explicitly request it:
{
"env": {
"GEMINI_API_KEY": "your-api-key",
"AUTO_SAVE": "false"
}
}
Then images will only appear in the chat without saving to disk.
Instant Image Preview (Claude Code)
Want images to open automatically after generation? Set up Claude Code hooks for instant Quick Look previews!
One-Click Setup (Mac)
# Clone this repo and run the installer
git clone https://github.com/nanobanana/nanobanana-mcp.git
cd nanobanana-mcp
bash hooks/install.sh
What You Get
- Instant Preview: Generated images open automatically in Quick Look
- Zero Manual Work: No more finding and opening files
- Smart Detection: Only triggers for nanobanana image tools
- Press Space to Close: Standard Quick Look controls
Full setup guide: hooks/README.md
Troubleshooting
β "GEMINI_API_KEY not set" error
Solution:
- Double-check you replaced
YOUR_API_KEYwith your actual API key - Make sure there are no extra spaces around the key
- Restart your Claude client completely
- Verify your API key works at Google AI Studio
"No such file or directory" error
Solution:
- Install Node.js from nodejs.org (choose LTS version)
- Restart your terminal/Claude client
- Try the installation again
Images not generating
Checklist:
- API key correctly set?
- Internet connection working?
- Restart Claude after configuration?
- Try a simple prompt: "Generate a blue circle"
Images not saving automatically
Solution:
Check your configuration has AUTO_SAVE: "true" (default behavior).
If you want to disable auto-save, set it to "false".
Hook setup not working
Common fixes:
- Make sure you're using Claude Code (not Claude Desktop)
- Run the installer from the nanobanana-mcp directory
- Restart Claude Code after installation
- Check hooks/README.md for detailed troubleshooting
Tips for Better Images
Prompt Writing Tips
- Be specific: "A golden retriever puppy" vs "A dog"
- Include style: "in watercolor style", "photorealistic", "cartoon style"
- Add details: "with blue eyes", "in a sunny garden", "wearing a red collar"
- Set the mood: "cozy", "dramatic", "peaceful", "energetic"
Technical Details
- Supported formats: PNG, JPEG, WebP, GIF
- Default output: PNG format
- Image size: Optimized for quality and reasonable file size
- Rate limits: Managed automatically with retry logic
π Advanced Features
π HTTP Mode (for integrations)
Run as an HTTP server instead of stdio:
MCP_TRANSPORT=http MCP_HTTP_PORT=8080 npx gemini-nanobanana-mcp@latest
Access at http://localhost:8080/mcp
π Debug Logging
Enable detailed logging:
{
"env": {
"GEMINI_API_KEY": "your-key",
"LOG_LEVEL": "debug"
}
}
π‘ Need Help?
- π Quick Setup Guide: QUICK_START.md
- πͺ Auto-Preview Setup: hooks/README.md
- π Report Issues: GitHub Issues
- π¬ Feature Requests: Welcome!
π€ Contributing
Found a bug? Have a feature idea? Contributions are welcome!
- Fork the repository
- Create your feature branch
- Make your changes
- Submit a pull request
π License
MIT License - feel free to use this in your own projects!
β If this helped you, please star the repository on GitHub!
Built with β€οΈ for the Claude community
Alternatives
Related Skills
Browse all skillsGenerate or edit images using Google Gemini API via nanobanana. Use when the user asks to create, generate, edit images with nanobanana, or mentions image generation/editing tasks.
Generate and edit images using Google's Nano Banana Pro (Gemini 3 Pro Image) API. Use when the user asks to generate, create, edit, modify, change, alter, or update images. Also use when user references an existing image file and asks to modify it in any way (e.g., "modify this image", "change the background", "replace X with Y"). Supports both text-to-image generation and image-to-image editing with configurable resolution (1K default, 2K, or 4K for high resolution). DO NOT read the image file first - use this skill directly with the --input-image parameter.
Generate and edit images using the Gemini API (Nano Banana Pro). Use this skill when creating images from text prompts, editing existing images, applying style transfers, generating logos with text, creating stickers, product mockups, or any image generation/manipulation task. Supports text-to-image, image editing, multi-turn refinement, and composition from multiple reference images.
AI-powered PPT generation with document analysis and styled images
Generate AI images with FLUX, Gemini, Grok, Seedream, Reve and 50+ models via inference.sh CLI. Models: FLUX Dev LoRA, FLUX.2 Klein LoRA, Gemini 3 Pro Image, Grok Imagine, Seedream 4.5, Reve, ImagineArt. Capabilities: text-to-image, image-to-image, inpainting, LoRA, image editing, upscaling, text rendering. Use for: AI art, product mockups, concept art, social media graphics, marketing visuals, illustrations. Triggers: flux, image generation, ai image, text to image, stable diffusion, generate image, ai art, midjourney alternative, dall-e alternative, text2img, t2i, image generator, ai picture, create image with ai, generative ai, ai illustration, grok image, gemini image
Generate website images with Gemini 3 Native Image Generation. Covers hero banners, service cards, infographics with legible text, and multi-turn editing. Includes Australian-specific imagery patterns. Use when stock photos don't fit, need text in images, or require consistent style across assets. Prevents 5 documented errors.