open-webui

8
0
Source

Complete Open WebUI API integration for managing LLM models, chat completions, Ollama proxy operations, file uploads, knowledge bases (RAG), image generation, audio processing, and pipelines. Use this skill when interacting with Open WebUI instances via REST API - listing models, chatting with LLMs, uploading files for RAG, managing knowledge collections, or executing Ollama commands through the Open WebUI proxy. Requires OPENWEBUI_URL and OPENWEBUI_TOKEN environment variables or explicit parameters.

Install

mkdir -p .claude/skills/open-webui && curl -L -o skill.zip "https://mcp.directory/api/skills/download/3861" && unzip -o skill.zip -d .claude/skills/open-webui && rm skill.zip

Installs to .claude/skills/open-webui

About this skill

Open WebUI API Skill

Complete API integration for Open WebUI - a unified interface for LLMs including Ollama, OpenAI, and other providers.

When to Use

Activate this skill when the user wants to:

  • List available models from their Open WebUI instance
  • Send chat completions to models through Open WebUI
  • Upload files for RAG (Retrieval Augmented Generation)
  • Manage knowledge collections and add files to them
  • Use Ollama proxy endpoints (generate, embed, pull models)
  • Generate images or process audio through Open WebUI
  • Check Ollama status or manage models (load, unload, delete)
  • Create or manage pipelines

Do NOT activate for:

  • Installing or configuring Open WebUI server itself (use system admin skills)
  • General questions about what Open WebUI is (use general knowledge)
  • Troubleshooting Open WebUI server issues (use troubleshooting guides)
  • Local file operations unrelated to Open WebUI API

Prerequisites

Environment Variables (Recommended)

export OPENWEBUI_URL="http://localhost:3000"  # Your Open WebUI instance URL
export OPENWEBUI_TOKEN="your-api-key-here"    # From Settings > Account in Open WebUI

Authentication

  • Bearer Token authentication required
  • Token obtained from Open WebUI: Settings > Account
  • Alternative: JWT token for advanced use cases

Activation Triggers

Example requests that SHOULD activate this skill:

  1. "List all models available in my Open WebUI"
  2. "Send a chat completion to llama3.2 via Open WebUI with prompt 'Explain quantum computing'"
  3. "Upload /path/to/document.pdf to Open WebUI knowledge base"
  4. "Create a new knowledge collection called 'Research Papers' in Open WebUI"
  5. "Generate an embedding for 'Open WebUI is great' using the nomic-embed-text model"
  6. "Pull the llama3.2 model through Open WebUI Ollama proxy"
  7. "Get Ollama status from my Open WebUI instance"
  8. "Chat with gpt-4 using my Open WebUI with RAG enabled on collection 'docs'"
  9. "Generate an image using Open WebUI with prompt 'A futuristic city'"
  10. "Delete the old-model from Open WebUI Ollama"

Example requests that should NOT activate this skill:

  1. "How do I install Open WebUI?" (Installation/Admin)
  2. "What is Open WebUI?" (General knowledge)
  3. "Configure the Open WebUI environment variables" (Server config)
  4. "Troubleshoot why Open WebUI won't start" (Server troubleshooting)
  5. "Compare Open WebUI to other UIs" (General comparison)

Workflow

1. Configuration Check

  • Verify OPENWEBUI_URL and OPENWEBUI_TOKEN are set
  • Validate URL format (http/https)
  • Test connection with GET /api/models or /ollama/api/tags

2. Operation Execution

Use the CLI tool or direct API calls:

# Using the CLI tool (recommended)
python3 scripts/openwebui-cli.py --help
python3 scripts/openwebui-cli.py models list
python3 scripts/openwebui-cli.py chat --model llama3.2 --message "Hello"

# Using curl (alternative)
curl -H "Authorization: Bearer $OPENWEBUI_TOKEN" \
  "$OPENWEBUI_URL/api/models"

3. Response Handling

  • HTTP 200: Success - parse and present JSON
  • HTTP 401: Authentication failed - check token
  • HTTP 404: Endpoint/model not found
  • HTTP 422: Validation error - check request parameters

Core API Endpoints

Chat & Completions

EndpointMethodDescription
/api/chat/completionsPOSTOpenAI-compatible chat completions
/api/modelsGETList all available models
/ollama/api/chatPOSTNative Ollama chat completion
/ollama/api/generatePOSTOllama text generation

Ollama Proxy

EndpointMethodDescription
/ollama/api/tagsGETList Ollama models
/ollama/api/pullPOSTPull/download a model
/ollama/api/deleteDELETEDelete a model
/ollama/api/embedPOSTGenerate embeddings
/ollama/api/psGETList loaded models

RAG & Knowledge

EndpointMethodDescription
/api/v1/files/POSTUpload file for RAG
/api/v1/files/{id}/process/statusGETCheck file processing status
/api/v1/knowledge/GET/POSTList/create knowledge collections
/api/v1/knowledge/{id}/file/addPOSTAdd file to knowledge base

Images & Audio

EndpointMethodDescription
/api/v1/images/generationsPOSTGenerate images
/api/v1/audio/speechPOSTText-to-speech
/api/v1/audio/transcriptionsPOSTSpeech-to-text

Safety & Boundaries

Confirmation Required

Always confirm before:

  • Deleting models (DELETE /ollama/api/delete) - Irreversible
  • Pulling large models - May take significant time/bandwidth
  • Deleting knowledge collections - Data loss risk
  • Uploading sensitive files - Privacy consideration

Redaction & Security

  • Never log the full API token - Redact to sk-...XXXX format
  • Sanitize file paths - Verify files exist before upload
  • Validate URLs - Ensure HTTPS for external instances
  • Handle errors gracefully - Don't expose stack traces with tokens

Workspace Safety

  • File uploads default to workspace directory
  • Confirm before accessing files outside workspace
  • No sudo/root operations required (pure API client)

Examples

List Models

python3 scripts/openwebui-cli.py models list

Chat Completion

python3 scripts/openwebui-cli.py chat \
  --model llama3.2 \
  --message "Explain the benefits of RAG" \
  --stream

Upload File for RAG

python3 scripts/openwebui-cli.py files upload \
  --file /path/to/document.pdf \
  --process

Add File to Knowledge Base

python3 scripts/openwebui-cli.py knowledge add-file \
  --collection-id "research-papers" \
  --file-id "doc-123-uuid"

Generate Embeddings (Ollama)

python3 scripts/openwebui-cli.py ollama embed \
  --model nomic-embed-text \
  --input "Open WebUI is great for LLM management"

Pull Model (Confirmation Required)

python3 scripts/openwebui-cli.py ollama pull \
  --model llama3.2:70b
# Agent must confirm: "This will download ~40GB. Proceed? [y/N]"

Check Ollama Status

python3 scripts/openwebui-cli.py ollama status

Error Handling

ErrorCauseSolution
401 UnauthorizedInvalid or missing tokenVerify OPENWEBUI_TOKEN
404 Not FoundModel/endpoint doesn't existCheck model name spelling
422 Validation ErrorInvalid parametersCheck request body format
400 Bad RequestFile still processingWait for processing completion
Connection refusedWrong URLVerify OPENWEBUI_URL

Edge Cases

File Processing Race Condition

Files uploaded for RAG are processed asynchronously. Before adding to knowledge:

  1. Upload file → get file_id
  2. Poll /api/v1/files/{id}/process/status until status: "completed"
  3. Then add to knowledge collection

Large Model Downloads

Pulling models (e.g., 70B parameters) can take hours. Always:

  • Confirm with user before starting
  • Show progress if possible
  • Allow cancellation

Streaming Responses

Chat completions support streaming. Use --stream flag for real-time output or collect full response for non-streaming.

CLI Tool Reference

The included CLI tool (scripts/openwebui-cli.py) provides:

  • Automatic authentication from environment variables
  • Structured JSON output with optional formatting
  • Built-in help for all commands
  • Error handling with user-friendly messages
  • Progress indicators for long operations

Run python3 scripts/openwebui-cli.py --help for full usage.

seedream-image-gen

openclaw

Generate images via Seedream API (doubao-seedream models). Synchronous generation.

2359

ffmpeg-cli

openclaw

Comprehensive video/audio processing with FFmpeg. Use for: (1) Video transcoding and format conversion, (2) Cutting and merging clips, (3) Audio extraction and manipulation, (4) Thumbnail and GIF generation, (5) Resolution scaling and quality adjustment, (6) Adding subtitles or watermarks, (7) Speed adjustment (slow/fast motion), (8) Color correction and filters.

6623

context-optimizer

openclaw

Advanced context management with auto-compaction and dynamic context optimization for DeepSeek's 64k context window. Features intelligent compaction (merging, summarizing, extracting), query-aware relevance scoring, and hierarchical memory system with context archive. Logs optimization events to chat.

3622

a-stock-analysis

openclaw

A股实时行情与分时量能分析。获取沪深股票实时价格、涨跌、成交量,分析分时量能分布(早盘/尾盘放量)、主力动向(抢筹/出货信号)、涨停封单。支持持仓管理和盈亏分析。Use when: (1) 查询A股实时行情, (2) 分析主力资金动向, (3) 查看分时成交量分布, (4) 管理股票持仓, (5) 分析持仓盈亏。

9121

himalaya

openclaw

CLI to manage emails via IMAP/SMTP. Use `himalaya` to list, read, write, reply, forward, search, and organize emails from the terminal. Supports multiple accounts and message composition with MML (MIME Meta Language).

7921

garmin-connect

openclaw

Syncs daily health and fitness data from Garmin Connect into markdown files. Provides sleep, activity, heart rate, stress, body battery, HRV, SpO2, and weight data.

7321

You might also like

flutter-development

aj-geddes

Build beautiful cross-platform mobile apps with Flutter and Dart. Covers widgets, state management with Provider/BLoC, navigation, API integration, and material design.

643969

drawio-diagrams-enhanced

jgtolentino

Create professional draw.io (diagrams.net) diagrams in XML format (.drawio files) with integrated PMP/PMBOK methodologies, extensive visual asset libraries, and industry-standard professional templates. Use this skill when users ask to create flowcharts, swimlane diagrams, cross-functional flowcharts, org charts, network diagrams, UML diagrams, BPMN, project management diagrams (WBS, Gantt, PERT, RACI), risk matrices, stakeholder maps, or any other visual diagram in draw.io format. This skill includes access to custom shape libraries for icons, clipart, and professional symbols.

591705

ui-ux-pro-max

nextlevelbuilder

"UI/UX design intelligence. 50 styles, 21 palettes, 50 font pairings, 20 charts, 8 stacks (React, Next.js, Vue, Svelte, SwiftUI, React Native, Flutter, Tailwind). Actions: plan, build, create, design, implement, review, fix, improve, optimize, enhance, refactor, check UI/UX code. Projects: website, landing page, dashboard, admin panel, e-commerce, SaaS, portfolio, blog, mobile app, .html, .tsx, .vue, .svelte. Elements: button, modal, navbar, sidebar, card, table, form, chart. Styles: glassmorphism, claymorphism, minimalism, brutalism, neumorphism, bento grid, dark mode, responsive, skeuomorphism, flat design. Topics: color palette, accessibility, animation, layout, typography, font pairing, spacing, hover, shadow, gradient."

318398

godot

bfollington

This skill should be used when working on Godot Engine projects. It provides specialized knowledge of Godot's file formats (.gd, .tscn, .tres), architecture patterns (component-based, signal-driven, resource-based), common pitfalls, validation tools, code templates, and CLI workflows. The `godot` command is available for running the game, validating scripts, importing resources, and exporting builds. Use this skill for tasks involving Godot game development, debugging scene/resource files, implementing game systems, or creating new Godot components.

339397

nano-banana-pro

garg-aayush

Generate and edit images using Google's Nano Banana Pro (Gemini 3 Pro Image) API. Use when the user asks to generate, create, edit, modify, change, alter, or update images. Also use when user references an existing image file and asks to modify it in any way (e.g., "modify this image", "change the background", "replace X with Y"). Supports both text-to-image generation and image-to-image editing with configurable resolution (1K default, 2K, or 4K for high resolution). DO NOT read the image file first - use this skill directly with the --input-image parameter.

451339

fastapi-templates

wshobson

Create production-ready FastAPI projects with async patterns, dependency injection, and comprehensive error handling. Use when building new FastAPI applications or setting up backend API projects.

304231

Stay ahead of the MCP ecosystem

Get weekly updates on new skills and servers.