Gemini Nanobanana (Image Generation)

Name: Gemini Nanobanana (Image Generation)
Rating: 4.5 (30 reviews)
Author: junhan2

Integrates Google's Gemini 2.5 Flash Image API into Claude conversations, allowing you to generate images from text prompts directly in chat.

Integrates with Google's Gemini 2.5 Flash Image API to provide text-to-image generation, single image editing with prompts, multi-image composition, and style transfer capabilities with automatic file saving and collision handling.

11704 views4Local (stdio)

ai ml

GitHub

What it does

Generate images from text descriptions
Edit existing images with text prompts
Compose multiple images together
Apply style transfer to images
Save generated images with collision handling

Best for

Content creators needing quick image generationDesigners prototyping visual conceptsUsers wanting AI art within Claude conversations

Works directly in Claude DesktopAutomatic file savingBeginner-friendly 3-step setup

About Gemini Nanobanana (Image Generation)

Gemini Nanobanana (Image Generation) is a community-built MCP server published by junhan2 that provides AI assistants with tools and capabilities via the Model Context Protocol. Gemini Nanobanana: AI image generator for creating, editing, and composing stunning images using advanced artificial int It is categorized under ai ml.

How to install

You can install Gemini Nanobanana (Image Generation) in your AI client of choice. Use the install panel on this page to get one-click setup for Cursor, Claude Desktop, VS Code, and other MCP-compatible clients. This server runs locally on your machine via the stdio transport.

License

Gemini Nanobanana (Image Generation) is released under the MIT license. This is a permissive open-source license, meaning you can freely use, modify, and distribute the software.

🎨 Gemini Nanobanana MCP

Generate images from text with Claude! Simply type "Draw a cute cat" and get instant AI-generated images.

A beginner-friendly Model Context Protocol (MCP) server that brings Google's Gemini 2.5 Flash Image generation directly into your Claude conversations.

Quick Start - Just 3 Steps

1️⃣ Get Your API Key (1 minute)

Visit Google AI Studio
Sign in with your Google account
Click "Create API key" → Copy the key

2️⃣ Install in Your Claude Client (2 minutes)

Claude Desktop (Windows)

Open Notepad
Copy this code and replace YOUR_API_KEY with your actual key:

{
  "mcpServers": {
    "gemini-nanobanana-mcp": {
      "command": "npx",
      "args": ["gemini-nanobanana-mcp@latest"],
      "env": {
        "GEMINI_API_KEY": "YOUR_API_KEY"
      }
    }
  }
}

Save as: %APPDATA%/Claude/claude_desktop_config.json
Restart Claude Desktop

Claude Desktop (Mac)

Open Terminal (search "Terminal" in Spotlight)
Run this command (replace YOUR_API_KEY):

cat > ~/Library/Application\ Support/Claude/claude_desktop_config.json << 'EOF'
{
  "mcpServers": {
    "gemini-nanobanana-mcp": {
      "command": "npx",
      "args": ["gemini-nanobanana-mcp@latest"],
      "env": {
        "GEMINI_API_KEY": "YOUR_API_KEY"
      }
    }
  }
}
EOF

Restart Claude Desktop

Claude Code (Easiest!)

Just run this one command in your terminal (replace YOUR_API_KEY):

claude mcp add gemini-nanobanana-mcp -s user -e GEMINI_API_KEY="YOUR_API_KEY" -- npx -y gemini-nanobanana-mcp@latest

Cursor

Go to Cursor Settings → MCP → Add new MCP Server
Fill in:
- Name: gemini-nanobanana-mcp
- Command: npx
- Args: gemini-nanobanana-mcp@latest
- Environment Variables: GEMINI_API_KEY = YOUR_API_KEY
Restart Cursor

3️⃣ Start Creating! (0 minutes)

Try these in Claude:

"Generate a cute puppy playing in a garden"
"Create a beautiful sunset over mountains"
"Draw a red sports car"
"Make an abstract colorful painting"

Your images automatically save to ~/Downloads/gemini-images/

See It In Action

Basic Usage

You: Generate a cozy coffee shop interior
Claude: [Generating image...]
Image generated and saved to: ~/Downloads/gemini-images/generate-2025-01-09-14-30-45.png
Size: 1.2MB | Format: PNG

Custom Save Location

You: Create a sunset landscape and save it as ./my-sunset.png
Claude: Image saved to: ./my-sunset.png

What You Can Do

Text-to-Image Generation

Create any image you can imagine from a text description.

Examples:

"A majestic dragon flying over a medieval castle"
"Modern minimalist living room with plants"
"Vintage bicycle on a cobblestone street"

Image Editing

Edit existing images with natural language instructions.

How to use:

Upload an image to Claude
Say: "Make this image black and white"
Or: "Add a sunset background to this photo"

Image Composition

Combine multiple images into one creative composition.

How to use:

Upload 2-10 images to Claude
Say: "Combine these images into a collage"
Or: "Blend these photos together artistically"

Style Transfer

Apply the artistic style of one image to another.

How to use:

Upload two images: a content image and a style reference
Say: "Apply the style of the second image to the first"

Configuration Options

Environment Variables

Variable	Default	Description
`GEMINI_API_KEY`	Required	Your Google AI Studio API key
`AUTO_SAVE`	`true`	Automatically save images when no path specified
`DEFAULT_SAVE_DIR`	`~/Downloads/gemini-images`	Default directory for saved images
`LOG_LEVEL`	`info`	Logging level (`error`, `warn`, `info`, `debug`)

Example with custom settings:

{
  "mcpServers": {
    "gemini-nanobanana-mcp": {
      "command": "npx",
      "args": ["gemini-nanobanana-mcp@latest"],
      "env": {
        "GEMINI_API_KEY": "your-api-key",
        "AUTO_SAVE": "true",
        "DEFAULT_SAVE_DIR": "~/Pictures/AI-Images",
        "LOG_LEVEL": "debug"
      }
    }
  }
}

Disable Auto-Save

To only save when you explicitly request it:

{
  "env": {
    "GEMINI_API_KEY": "your-api-key",
    "AUTO_SAVE": "false"
  }
}

Then images will only appear in the chat without saving to disk.

Instant Image Preview (Claude Code)

Want images to open automatically after generation? Set up Claude Code hooks for instant Quick Look previews!

One-Click Setup (Mac)

# Clone this repo and run the installer
git clone https://github.com/nanobanana/nanobanana-mcp.git
cd nanobanana-mcp
bash hooks/install.sh

What You Get

Instant Preview: Generated images open automatically in Quick Look
Zero Manual Work: No more finding and opening files
Smart Detection: Only triggers for nanobanana image tools
Press Space to Close: Standard Quick Look controls

Full setup guide: hooks/README.md

Troubleshooting

❌ "GEMINI_API_KEY not set" error

Solution:

Double-check you replaced YOUR_API_KEY with your actual API key
Make sure there are no extra spaces around the key
Restart your Claude client completely
Verify your API key works at Google AI Studio

"No such file or directory" error

Solution:

Install Node.js from nodejs.org (choose LTS version)
Restart your terminal/Claude client
Try the installation again

Images not generating

Checklist:

API key correctly set?
Internet connection working?
Restart Claude after configuration?
Try a simple prompt: "Generate a blue circle"

Images not saving automatically

Solution: Check your configuration has AUTO_SAVE: "true" (default behavior). If you want to disable auto-save, set it to "false".

Hook setup not working

Common fixes:

Make sure you're using Claude Code (not Claude Desktop)
Run the installer from the nanobanana-mcp directory
Restart Claude Code after installation
Check hooks/README.md for detailed troubleshooting

Tips for Better Images

Prompt Writing Tips

Be specific: "A golden retriever puppy" vs "A dog"
Include style: "in watercolor style", "photorealistic", "cartoon style"
Add details: "with blue eyes", "in a sunny garden", "wearing a red collar"
Set the mood: "cozy", "dramatic", "peaceful", "energetic"

Technical Details

Supported formats: PNG, JPEG, WebP, GIF
Default output: PNG format
Image size: Optimized for quality and reasonable file size
Rate limits: Managed automatically with retry logic

🚀 Advanced Features

🔗 HTTP Mode (for integrations)

Run as an HTTP server instead of stdio:

MCP_TRANSPORT=http MCP_HTTP_PORT=8080 npx gemini-nanobanana-mcp@latest

Access at http://localhost:8080/mcp

📊 Debug Logging

Enable detailed logging:

{
  "env": {
    "GEMINI_API_KEY": "your-key",
    "LOG_LEVEL": "debug"
  }
}

💡 Need Help?

📖 Quick Setup Guide: QUICK_START.md
🪝 Auto-Preview Setup: hooks/README.md
🐛 Report Issues: GitHub Issues
💬 Feature Requests: Welcome!

🤝 Contributing

Found a bug? Have a feature idea? Contributions are welcome!

Fork the repository
Create your feature branch
Make your changes
Submit a pull request

📄 License

MIT License - feel free to use this in your own projects!

⭐ If this helped you, please star the repository on GitHub!

Built with ❤️ for the Claude community

Alternatives

Knowledge Graph Memory

anthropic

80.5k

Build persistent semantic networks for enterprise & engineering data management. Enable data persistence and memory acro

OfficialPopular

2.1k147

Context7

upstash

48.2k

Boost your AI code assistant with Context7: inject real-time API documentation from OpenAPI specification sources into y

OfficialRemotePopular

15.5k763

Blender

ahujasid

17.6k

Connect Blender to Claude AI for seamless 3D modeling. Use AI 3D model generator tools for faster, intuitive, interactiv

CommunityPopular

2.9k48

Google GenAI Toolbox

google

13.3k

Google GenAI Toolbox: open-source GenAI database agent and AI database connector for Google Cloud database—query Cloud S

OfficialPopular

215

Related Skills

Browse all skills

nanobanana-skill

Generate or edit images using Google Gemini API via nanobanana. Use when the user asks to create, generate, edit images with nanobanana, or mentions image generation/editing tasks.

nano-banana-pro

Generate and edit images using Google's Nano Banana Pro (Gemini 3 Pro Image) API. Use when the user asks to generate, create, edit, modify, change, alter, or update images. Also use when user references an existing image file and asks to modify it in any way (e.g., "modify this image", "change the background", "replace X with Y"). Supports both text-to-image generation and image-to-image editing with configurable resolution (1K default, 2K, or 4K for high resolution). DO NOT read the image file first - use this skill directly with the --input-image parameter.

666

gemini-imagegen

Generate and edit images using the Gemini API (Nano Banana Pro). Use this skill when creating images from text prompts, editing existing images, applying style transfers, generating logos with text, creating stickers, product mockups, or any image generation/manipulation task. Supports text-to-image, image editing, multi-turn refinement, and composition from multiple reference images.

nanobanana-ppt-skills

AI-powered PPT generation with document analysis and styled images

ai-image-generation

Generate AI images with FLUX, Gemini, Grok, Seedream, Reve and 50+ models via inference.sh CLI. Models: FLUX Dev LoRA, FLUX.2 Klein LoRA, Gemini 3 Pro Image, Grok Imagine, Seedream 4.5, Reve, ImagineArt. Capabilities: text-to-image, image-to-image, inpainting, LoRA, image editing, upscaling, text rendering. Use for: AI art, product mockups, concept art, social media graphics, marketing visuals, illustrations. Triggers: flux, image generation, ai image, text to image, stable diffusion, generate image, ai art, midjourney alternative, dall-e alternative, text2img, t2i, image generator, ai picture, create image with ai, generative ai, ai illustration, grok image, gemini image

image-gen

Generate website images with Gemini 3 Native Image Generation. Covers hero banners, service cards, infographics with legible text, and multi-turn editing. Includes Australian-specific imagery patterns. Use when stock photos don't fit, need text in images, or require consistent style across assets. Prevents 5 documented errors.