Gemini Nanobanana (Image Generation)

Gemini Nanobanana (Image Generation)

junhan2

Integrates Google's Gemini 2.5 Flash Image API into Claude conversations, allowing you to generate images from text prompts directly in chat.

Integrates with Google's Gemini 2.5 Flash Image API to provide text-to-image generation, single image editing with prompts, multi-image composition, and style transfer capabilities with automatic file saving and collision handling.

11704 views4Local (stdio)

What it does

  • Generate images from text descriptions
  • Edit existing images with text prompts
  • Compose multiple images together
  • Apply style transfer to images
  • Save generated images with collision handling

Best for

Content creators needing quick image generationDesigners prototyping visual conceptsUsers wanting AI art within Claude conversations
Works directly in Claude DesktopAutomatic file savingBeginner-friendly 3-step setup

About Gemini Nanobanana (Image Generation)

Gemini Nanobanana (Image Generation) is a community-built MCP server published by junhan2 that provides AI assistants with tools and capabilities via the Model Context Protocol. Gemini Nanobanana: AI image generator for creating, editing, and composing stunning images using advanced artificial int It is categorized under ai ml.

How to install

You can install Gemini Nanobanana (Image Generation) in your AI client of choice. Use the install panel on this page to get one-click setup for Cursor, Claude Desktop, VS Code, and other MCP-compatible clients. This server runs locally on your machine via the stdio transport.

License

Gemini Nanobanana (Image Generation) is released under the MIT license. This is a permissive open-source license, meaning you can freely use, modify, and distribute the software.

🎨 Gemini Nanobanana MCP

npm version License: MIT TypeScript Node.js

Generate images from text with Claude! Simply type "Draw a cute cat" and get instant AI-generated images.

A beginner-friendly Model Context Protocol (MCP) server that brings Google's Gemini 2.5 Flash Image generation directly into your Claude conversations.

Quick Start - Just 3 Steps

1️⃣ Get Your API Key (1 minute)

  1. Visit Google AI Studio
  2. Sign in with your Google account
  3. Click "Create API key" β†’ Copy the key

2️⃣ Install in Your Claude Client (2 minutes)

Claude Desktop (Windows)
  1. Open Notepad
  2. Copy this code and replace YOUR_API_KEY with your actual key:
{
  "mcpServers": {
    "gemini-nanobanana-mcp": {
      "command": "npx",
      "args": ["gemini-nanobanana-mcp@latest"],
      "env": {
        "GEMINI_API_KEY": "YOUR_API_KEY"
      }
    }
  }
}
  1. Save as: %APPDATA%/Claude/claude_desktop_config.json
  2. Restart Claude Desktop
Claude Desktop (Mac)
  1. Open Terminal (search "Terminal" in Spotlight)
  2. Run this command (replace YOUR_API_KEY):
cat > ~/Library/Application\ Support/Claude/claude_desktop_config.json << 'EOF'
{
  "mcpServers": {
    "gemini-nanobanana-mcp": {
      "command": "npx",
      "args": ["gemini-nanobanana-mcp@latest"],
      "env": {
        "GEMINI_API_KEY": "YOUR_API_KEY"
      }
    }
  }
}
EOF
  1. Restart Claude Desktop
Claude Code (Easiest!)

Just run this one command in your terminal (replace YOUR_API_KEY):

claude mcp add gemini-nanobanana-mcp -s user -e GEMINI_API_KEY="YOUR_API_KEY" -- npx -y gemini-nanobanana-mcp@latest
Cursor
  1. Go to Cursor Settings β†’ MCP β†’ Add new MCP Server
  2. Fill in:
    • Name: gemini-nanobanana-mcp
    • Command: npx
    • Args: gemini-nanobanana-mcp@latest
    • Environment Variables: GEMINI_API_KEY = YOUR_API_KEY
  3. Restart Cursor

3️⃣ Start Creating! (0 minutes)

Try these in Claude:

  • "Generate a cute puppy playing in a garden"
  • "Create a beautiful sunset over mountains"
  • "Draw a red sports car"
  • "Make an abstract colorful painting"

Your images automatically save to ~/Downloads/gemini-images/


See It In Action

Basic Usage

You: Generate a cozy coffee shop interior
Claude: [Generating image...]
Image generated and saved to: ~/Downloads/gemini-images/generate-2025-01-09-14-30-45.png
Size: 1.2MB | Format: PNG

Custom Save Location

You: Create a sunset landscape and save it as ./my-sunset.png
Claude: Image saved to: ./my-sunset.png

What You Can Do

Text-to-Image Generation

Create any image you can imagine from a text description.

Examples:

  • "A majestic dragon flying over a medieval castle"
  • "Modern minimalist living room with plants"
  • "Vintage bicycle on a cobblestone street"

Image Editing

Edit existing images with natural language instructions.

How to use:

  • Upload an image to Claude
  • Say: "Make this image black and white"
  • Or: "Add a sunset background to this photo"

Image Composition

Combine multiple images into one creative composition.

How to use:

  • Upload 2-10 images to Claude
  • Say: "Combine these images into a collage"
  • Or: "Blend these photos together artistically"

Style Transfer

Apply the artistic style of one image to another.

How to use:

  • Upload two images: a content image and a style reference
  • Say: "Apply the style of the second image to the first"

Configuration Options

Environment Variables
VariableDefaultDescription
GEMINI_API_KEYRequiredYour Google AI Studio API key
AUTO_SAVEtrueAutomatically save images when no path specified
DEFAULT_SAVE_DIR~/Downloads/gemini-imagesDefault directory for saved images
LOG_LEVELinfoLogging level (error, warn, info, debug)

Example with custom settings:

{
  "mcpServers": {
    "gemini-nanobanana-mcp": {
      "command": "npx",
      "args": ["gemini-nanobanana-mcp@latest"],
      "env": {
        "GEMINI_API_KEY": "your-api-key",
        "AUTO_SAVE": "true",
        "DEFAULT_SAVE_DIR": "~/Pictures/AI-Images",
        "LOG_LEVEL": "debug"
      }
    }
  }
}
Disable Auto-Save

To only save when you explicitly request it:

{
  "env": {
    "GEMINI_API_KEY": "your-api-key",
    "AUTO_SAVE": "false"
  }
}

Then images will only appear in the chat without saving to disk.


Instant Image Preview (Claude Code)

Want images to open automatically after generation? Set up Claude Code hooks for instant Quick Look previews!

One-Click Setup (Mac)

# Clone this repo and run the installer
git clone https://github.com/nanobanana/nanobanana-mcp.git
cd nanobanana-mcp
bash hooks/install.sh

What You Get

  • Instant Preview: Generated images open automatically in Quick Look
  • Zero Manual Work: No more finding and opening files
  • Smart Detection: Only triggers for nanobanana image tools
  • Press Space to Close: Standard Quick Look controls

Full setup guide: hooks/README.md


Troubleshooting

❌ "GEMINI_API_KEY not set" error

Solution:

  1. Double-check you replaced YOUR_API_KEY with your actual API key
  2. Make sure there are no extra spaces around the key
  3. Restart your Claude client completely
  4. Verify your API key works at Google AI Studio
"No such file or directory" error

Solution:

  1. Install Node.js from nodejs.org (choose LTS version)
  2. Restart your terminal/Claude client
  3. Try the installation again
Images not generating

Checklist:

  • API key correctly set?
  • Internet connection working?
  • Restart Claude after configuration?
  • Try a simple prompt: "Generate a blue circle"
Images not saving automatically

Solution: Check your configuration has AUTO_SAVE: "true" (default behavior). If you want to disable auto-save, set it to "false".

Hook setup not working

Common fixes:

  1. Make sure you're using Claude Code (not Claude Desktop)
  2. Run the installer from the nanobanana-mcp directory
  3. Restart Claude Code after installation
  4. Check hooks/README.md for detailed troubleshooting

Tips for Better Images

Prompt Writing Tips

  • Be specific: "A golden retriever puppy" vs "A dog"
  • Include style: "in watercolor style", "photorealistic", "cartoon style"
  • Add details: "with blue eyes", "in a sunny garden", "wearing a red collar"
  • Set the mood: "cozy", "dramatic", "peaceful", "energetic"

Technical Details

  • Supported formats: PNG, JPEG, WebP, GIF
  • Default output: PNG format
  • Image size: Optimized for quality and reasonable file size
  • Rate limits: Managed automatically with retry logic

πŸš€ Advanced Features

πŸ”— HTTP Mode (for integrations)

Run as an HTTP server instead of stdio:

MCP_TRANSPORT=http MCP_HTTP_PORT=8080 npx gemini-nanobanana-mcp@latest

Access at http://localhost:8080/mcp

πŸ“Š Debug Logging

Enable detailed logging:

{
  "env": {
    "GEMINI_API_KEY": "your-key",
    "LOG_LEVEL": "debug"
  }
}

πŸ’‘ Need Help?


🀝 Contributing

Found a bug? Have a feature idea? Contributions are welcome!

  1. Fork the repository
  2. Create your feature branch
  3. Make your changes
  4. Submit a pull request

πŸ“„ License

MIT License - feel free to use this in your own projects!


⭐ If this helped you, please star the repository on GitHub!

Built with ❀️ for the Claude community

Alternatives

Related Skills

Browse all skills
nanobanana-skill

Generate or edit images using Google Gemini API via nanobanana. Use when the user asks to create, generate, edit images with nanobanana, or mentions image generation/editing tasks.

6
nano-banana-pro

Generate and edit images using Google's Nano Banana Pro (Gemini 3 Pro Image) API. Use when the user asks to generate, create, edit, modify, change, alter, or update images. Also use when user references an existing image file and asks to modify it in any way (e.g., "modify this image", "change the background", "replace X with Y"). Supports both text-to-image generation and image-to-image editing with configurable resolution (1K default, 2K, or 4K for high resolution). DO NOT read the image file first - use this skill directly with the --input-image parameter.

666
gemini-imagegen

Generate and edit images using the Gemini API (Nano Banana Pro). Use this skill when creating images from text prompts, editing existing images, applying style transfers, generating logos with text, creating stickers, product mockups, or any image generation/manipulation task. Supports text-to-image, image editing, multi-turn refinement, and composition from multiple reference images.

16
nanobanana-ppt-skills

AI-powered PPT generation with document analysis and styled images

7
ai-image-generation

Generate AI images with FLUX, Gemini, Grok, Seedream, Reve and 50+ models via inference.sh CLI. Models: FLUX Dev LoRA, FLUX.2 Klein LoRA, Gemini 3 Pro Image, Grok Imagine, Seedream 4.5, Reve, ImagineArt. Capabilities: text-to-image, image-to-image, inpainting, LoRA, image editing, upscaling, text rendering. Use for: AI art, product mockups, concept art, social media graphics, marketing visuals, illustrations. Triggers: flux, image generation, ai image, text to image, stable diffusion, generate image, ai art, midjourney alternative, dall-e alternative, text2img, t2i, image generator, ai picture, create image with ai, generative ai, ai illustration, grok image, gemini image

6
image-gen

Generate website images with Gemini 3 Native Image Generation. Covers hero banners, service cards, infographics with legible text, and multi-turn editing. Includes Australian-specific imagery patterns. Use when stock photos don't fit, need text in images, or require consistent style across assets. Prevents 5 documented errors.

5