venice-ai-media

3views

1installs

Generate, edit, and upscale images; create videos from images or other videos via Venice AI. Supports text-to-image, image-to-video (Sora, WAN), video-to-video (Runway Gen4), upscaling, and AI editing.

Install

mkdir -p .claude/skills/venice-ai-media && curl -L -o skill.zip "https://mcp.directory/api/skills/download/7340" && unzip -o skill.zip -d .claude/skills/venice-ai-media && rm skill.zip

Installs to .claude/skills/venice-ai-media

About this skill

Venice AI Media

Generate images and videos using Venice AI APIs. Venice is an uncensored AI platform with competitive pricing.

Prerequisites

Python 3.10+ (brew install python or system Python)
Venice API key (free tier available)
requests library (auto-installed by scripts if missing)

Setup

1. Get Your API Key

Create account at venice.ai
Go to venice.ai/settings/api
Click "Create API Key"
Copy the key (starts with vn_...)

2. Configure the Key

Option A: Environment variable

export VENICE_API_KEY="vn_your_key_here"

Option B: Clawdbot config (recommended - persists across sessions)

Add to ~/.clawdbot/clawdbot.json:

{
  skills: {
    entries: {
      "venice-ai-media": {
        env: {
          VENICE_API_KEY: "vn_your_key_here",
        },
      },
    },
  },
}

3. Verify Setup

python3 {baseDir}/scripts/venice-image.py --list-models

If you see a list of models, you're ready!

Pricing Overview

Feature	Cost
Image generation	~$0.01-0.03 per image
Image upscale	~$0.02-0.04
Image edit	$0.04
Video (WAN)	~$0.10-0.50 depending on duration
Video (Sora)	~$0.50-2.00 depending on duration

Use --quote with video commands to check pricing before generation.

Quick Start

# Generate an image
python3 {baseDir}/scripts/venice-image.py --prompt "a serene canal in Venice at sunset"

# Upscale an image
python3 {baseDir}/scripts/venice-upscale.py photo.jpg --scale 2

# Edit an image with AI
python3 {baseDir}/scripts/venice-edit.py photo.jpg --prompt "add sunglasses"

# Create a video from an image
python3 {baseDir}/scripts/venice-video.py --image photo.jpg --prompt "gentle camera pan" --duration 5s

Image Generation

python3 {baseDir}/scripts/venice-image.py --prompt "a serene canal in Venice at sunset"
python3 {baseDir}/scripts/venice-image.py --prompt "cyberpunk city" --count 4
python3 {baseDir}/scripts/venice-image.py --prompt "portrait" --width 768 --height 1024
python3 {baseDir}/scripts/venice-image.py --prompt "abstract art" --out-dir /tmp/venice
python3 {baseDir}/scripts/venice-image.py --list-models
python3 {baseDir}/scripts/venice-image.py --list-styles
python3 {baseDir}/scripts/venice-image.py --prompt "fantasy" --model flux-2-pro --no-validate
python3 {baseDir}/scripts/venice-image.py --prompt "photo" --style-preset "Cinematic" --embed-exif

Key flags: --prompt, --model (default: flux-2-max), --count (uses efficient batch API for same prompt), --width, --height, --format (webp/png/jpeg), --resolution (1K/2K/4K), --aspect-ratio, --negative-prompt, --style-preset (use --list-styles to see options), --cfg-scale (prompt adherence 0-20, default 7.5), --seed (for reproducible results), --safe-mode (disabled by default for uncensored output), --hide-watermark (only use if explicitly requested - watermark supports Venice), --embed-exif (embed prompt in image metadata), --lora-strength (0-100 for applicable models), --steps (inference steps, model-dependent), --enable-web-search, --no-validate (skip model check for new/beta models)

Image Upscale

python3 {baseDir}/scripts/venice-upscale.py photo.jpg --scale 2
python3 {baseDir}/scripts/venice-upscale.py photo.jpg --scale 4 --enhance
python3 {baseDir}/scripts/venice-upscale.py photo.jpg --enhance --enhance-prompt "sharpen details"
python3 {baseDir}/scripts/venice-upscale.py --url "https://example.com/image.jpg" --scale 2

Key flags: --scale (1-4, default: 2), --enhance (AI enhancement), --enhance-prompt, --enhance-creativity (0.0-1.0), --replication (0.0-1.0, preserves lines/noise, default: 0.35), --url (use URL instead of local file), --output, --out-dir

Image Edit

python3 {baseDir}/scripts/venice-edit.py photo.jpg --prompt "add sunglasses"
python3 {baseDir}/scripts/venice-edit.py photo.jpg --prompt "change the sky to sunset"
python3 {baseDir}/scripts/venice-edit.py photo.jpg --prompt "remove the person in background"
python3 {baseDir}/scripts/venice-edit.py --url "https://example.com/image.jpg" --prompt "colorize"

Key flags: --prompt (required - AI interprets what to modify), --url (use URL instead of local file), --output, --out-dir

Note: The edit endpoint uses the Qwen-Image model which has some content restrictions (unlike other Venice endpoints).

Video Generation

# Get price quote first (no generation)
python3 {baseDir}/scripts/venice-video.py --quote --model wan-2.6-image-to-video --duration 10s --resolution 720p

# Image-to-video (WAN 2.6 - default)
python3 {baseDir}/scripts/venice-video.py --image photo.jpg --prompt "camera pans slowly" --duration 10s

# Image-to-video (Sora)
python3 {baseDir}/scripts/venice-video.py --image photo.jpg --prompt "cinematic" \
  --model sora-2-image-to-video --duration 8s --aspect-ratio 16:9 --skip-audio-param

# List models (shows available durations per model)
python3 {baseDir}/scripts/venice-video.py --list-models

# Clean up a video downloaded with --no-delete
python3 {baseDir}/scripts/venice-video.py --complete <queue_id> --model <model>

Key flags: --image (required for generation), --prompt (required for generation), --model (default: wan-2.6-image-to-video), --duration (model-dependent, see --list-models), --resolution (480p/720p/1080p), --aspect-ratio, --audio/--no-audio, --skip-audio-param, --quote (price estimate), --timeout, --poll-interval, --no-delete (keep server media), --complete (cleanup previously downloaded video), --no-validate (skip model check)

Progress: During generation, the script shows estimated progress based on Venice's average execution time.

Model Notes

Use --list-models to see current availability and status. Models change frequently.

Image: Default is flux-2-max. Common options include flux, gpt-image, and nano-banana variants.

Video:

WAN models: Image-to-video, configurable audio, various durations (5s-21s)
Sora models: Requires --aspect-ratio, use --skip-audio-param

Tips:

Use --no-validate for new or beta models not yet in the model list
Use --quote for video to check pricing before generation
Safe mode is disabled by default (Venice is an uncensored API)

Output

Scripts print a MEDIA: /path/to/file line for Clawdbot auto-attach.

Tip: Use --out-dir /tmp/venice-$(date +%s) when generating media to send via iMessage (ensures accessibility across user accounts).

Troubleshooting

"VENICE_API_KEY not set"

Check your config in ~/.clawdbot/clawdbot.json
Or export the env var: export VENICE_API_KEY="vn_..."

"Invalid API key"

Verify your key at venice.ai/settings/api
Keys start with vn_

"Model not found"

Run --list-models to see available models
Use --no-validate for new/beta models

Video stuck/timeout

Videos can take 1-5 minutes depending on model and duration
Use --timeout 600 for longer videos
Check Venice status at venice.ai

"requests" module not found

Install it: pip3 install requests

More by openclaw

View all skills by openclaw →

a-stock-analysis

openclaw

A股实时行情与分时量能分析。获取沪深股票实时价格、涨跌、成交量，分析分时量能分布（早盘/尾盘放量）、主力动向（抢筹/出货信号）、涨停封单。支持持仓管理和盈亏分析。Use when: (1) 查询A股实时行情, (2) 分析主力资金动向, (3) 查看分时成交量分布, (4) 管理股票持仓, (5) 分析持仓盈亏。

458175

research-paper-writer

openclaw

Creates formal academic research papers following IEEE/ACM formatting standards with proper structure, citations, and scholarly writing style. Use when the user asks to write a research paper, academic paper, or conference paper on any topic.

50121

weread

openclaw

WeChat Reading (微信读书) CLI tool for fetching notes and highlights. Use when: (1) user asks about weread/微信读书 notes or highlights, (2) fetching today's or recent reading notes, (3) exporting book highlights, (4) managing reading bookshelf, (5) any task involving reading notes from WeChat Reading.

6376

gog

openclaw

Google Workspace CLI for Gmail, Calendar, Drive, Contacts, Sheets, and Docs.

17773

keyword-research

openclaw

Discovers high-value keywords with search intent analysis, difficulty assessment, and content opportunity mapping. Essential for starting any SEO or GEO content strategy.

31565

seedream-image-gen

openclaw

Generate images via Seedream API (doubao-seedream models). Synchronous generation.

4063

ui-ux-pro-max

nextlevelbuilder

"UI/UX design intelligence. 50 styles, 21 palettes, 50 font pairings, 20 charts, 8 stacks (React, Next.js, Vue, Svelte, SwiftUI, React Native, Flutter, Tailwind). Actions: plan, build, create, design, implement, review, fix, improve, optimize, enhance, refactor, check UI/UX code. Projects: website, landing page, dashboard, admin panel, e-commerce, SaaS, portfolio, blog, mobile app, .html, .tsx, .vue, .svelte. Elements: button, modal, navbar, sidebar, card, table, form, chart. Styles: glassmorphism, claymorphism, minimalism, brutalism, neumorphism, bento grid, dark mode, responsive, skeuomorphism, flat design. Topics: color palette, accessibility, animation, layout, typography, font pairing, spacing, hover, shadow, gradient."

1,5581,556

flutter-development

aj-geddes

Build beautiful cross-platform mobile apps with Flutter and Dart. Covers widgets, state management with Provider/BLoC, navigation, API integration, and material design.

1,8261,484

drawio-diagrams-enhanced

jgtolentino

Create professional draw.io (diagrams.net) diagrams in XML format (.drawio files) with integrated PMP/PMBOK methodologies, extensive visual asset libraries, and industry-standard professional templates. Use this skill when users ask to create flowcharts, swimlane diagrams, cross-functional flowcharts, org charts, network diagrams, UML diagrams, BPMN, project management diagrams (WBS, Gantt, PERT, RACI), risk matrices, stakeholder maps, or any other visual diagram in draw.io format. This skill includes access to custom shape libraries for icons, clipart, and professional symbols.

1,7061,235

godot

bfollington

This skill should be used when working on Godot Engine projects. It provides specialized knowledge of Godot's file formats (.gd, .tscn, .tres), architecture patterns (component-based, signal-driven, resource-based), common pitfalls, validation tools, code templates, and CLI workflows. The `godot` command is available for running the game, validating scripts, importing resources, and exporting builds. Use this skill for tasks involving Godot game development, debugging scene/resource files, implementing game systems, or creating new Godot components.

1,610905

pdf-to-markdown

aliceisjustplaying

Convert entire PDF documents to clean, structured Markdown for full context loading. Use this skill when the user wants to extract ALL text from a PDF into context (not grep/search), when discussing or analyzing PDF content in full, when the user mentions "load the whole PDF", "bring the PDF into context", "read the entire PDF", or when partial extraction/grepping would miss important context. This is the preferred method for PDF text extraction over page-by-page or grep approaches.

1,890835

nano-banana-pro

garg-aayush

Generate and edit images using Google's Nano Banana Pro (Gemini 3 Pro Image) API. Use when the user asks to generate, create, edit, modify, change, alter, or update images. Also use when user references an existing image file and asks to modify it in any way (e.g., "modify this image", "change the background", "replace X with Y"). Supports both text-to-image generation and image-to-image editing with configurable resolution (1K default, 2K, or 4K for high resolution). DO NOT read the image file first - use this skill directly with the --input-image parameter.

1,435791

Related MCP Servers

Browse all servers

Recraft AI

Recraft AI is an ai image generator for creating, editing, and upscaling raster or vector images with advanced artificia

460 tools

Office PowerPoint

Create and edit PowerPoint presentations in Python with Office PowerPoint. Use python pptx or pptx python tools to add s

1,56237 tools

Nano-Banana (Gemini 2.5 Flash Image)

Generate and edit images from text with Nano-Banana, an AI image generator powered by Gemini 2.5 Flash. Fast, seamless,

1120 tools

Stability AI

Generate and edit images with Stability AI's powerful ai image generator using advanced Stable Diffusion models for stun

800 tools

Gemini Image Generator

Gemini Image Generator uses AI to create images with prompt enhancement, character consistency, and multi-image blending

790 tools

VChart

Create interactive visualizations and charts with VChart, a powerful data analysis tool and pie chart maker for flexible

4810 tools

Install

mkdir -p .claude/skills/venice-ai-media && curl -L -o skill.zip "https://mcp.directory/api/skills/download/7340" && unzip -o skill.zip -d .claude/skills/venice-ai-media && rm skill.zip

Installs to .claude/skills/venice-ai-media

Stats

Views

Installs

Author

openclaw

7 skills published

Links

Source Code

venice-ai-media

Install

About this skill

Venice AI Media

Prerequisites

Setup

1. Get Your API Key

2. Configure the Key

3. Verify Setup

Pricing Overview

Quick Start

Image Generation

Image Upscale

Image Edit

Video Generation

Model Notes

Output

Troubleshooting

More by openclaw

a-stock-analysis

research-paper-writer

weread

gog

keyword-research

seedream-image-gen

You might also like

ui-ux-pro-max

flutter-development

drawio-diagrams-enhanced

godot

pdf-to-markdown

nano-banana-pro

Related MCP Servers