cellcog
#1 on DeepResearch Bench (Feb 2026). Any-to-Any AI for agents. Combines deep reasoning with all modalities through sophisticated multi-agent orchestration. Research, videos, images, audio, dashboards, presentations, spreadsheets, and more.
Install
mkdir -p .claude/skills/cellcog && curl -L -o skill.zip "https://mcp.directory/api/skills/download/8531" && unzip -o skill.zip -d .claude/skills/cellcog && rm skill.zipInstalls to .claude/skills/cellcog
About this skill
CellCog - Any-to-Any for Agents
The Power of Any-to-Any
CellCog is the only AI that truly handles any input → any output in a single request. No tool chaining. No orchestration complexity. One call, multiple deliverables.
CellCog pairs all modalities with frontier-level deep reasoning — as of April 2026, CellCog is #1 on the DeepResearch Bench: https://huggingface.co/spaces/muset-ai/DeepResearch-Bench-Leaderboard
Work With Multiple Files, Any Format
Reference as many documents as you need—all at once:
prompt = """
Analyze all of these together:
<SHOW_FILE>/data/q4_earnings.pdf</SHOW_FILE>
<SHOW_FILE>/data/competitor_analysis.pdf</SHOW_FILE>
<SHOW_FILE>/data/market_research.xlsx</SHOW_FILE>
<SHOW_FILE>/recordings/customer_interview.mp3</SHOW_FILE>
<SHOW_FILE>/designs/product_mockup.png</SHOW_FILE>
Give me a comprehensive market positioning analysis based on all these inputs.
"""
File paths must be absolute and enclosed in <SHOW_FILE> tags. CellCog understands PDFs, spreadsheets, images, audio, video, code files, and more.
⚠️ Without SHOW_FILE tags, CellCog only sees the path as text — not the file contents.
❌ Analyze /data/sales.csv — CellCog can't read the file
✅ Analyze <SHOW_FILE>/data/sales.csv</SHOW_FILE> — CellCog reads it
Request Multiple Outputs, Different Modalities
Ask for completely different output types in ONE request:
prompt = """
Based on this quarterly sales data:
<SHOW_FILE>/data/sales_q4_2025.csv</SHOW_FILE>
Create ALL of the following:
1. A PDF executive summary report with charts
2. An interactive HTML dashboard for the leadership team
3. A 60-second video presentation for the all-hands meeting
4. A slide deck for the board presentation
5. An Excel file with the underlying analysis and projections
"""
CellCog handles the entire workflow — analyzing, generating, and delivering all outputs with consistent insights across every format.
⚠️ Be explicit about output artifacts. Without explicit artifact language, CellCog may respond with text analysis instead of generating a file.
❌ "Quarterly earnings analysis for AAPL" — could produce text or any format
✅ "Create a PDF report and an interactive HTML dashboard analyzing AAPL quarterly earnings." — CellCog creates actual deliverables
Your sub-agent for quality work. Depth, accuracy, and real deliverables.
Quick Start
Setup
from cellcog import CellCogClient
If import fails:
pip install -U cellcog
Authentication
Environment variable (recommended): Set CELLCOG_API_KEY — the SDK picks it up automatically:
export CELLCOG_API_KEY="sk_..."
Get API key from: https://cellcog.ai/profile?tab=api-keys
status = client.get_account_status()
print(status) # {"configured": True, "email": "user@example.com", ...}
Agent Provider
agent_provider is required when creating a CellCogClient. It identifies which agent framework is calling CellCog — not your individual agent's name, but the platform/tool you're running inside.
Examples: "openclaw", "claude-code", "cursor", "aider", "windsurf", "perplexity", "hermes", "script" (for standalone scripts).
OpenClaw Agents
Fire-and-forget — your agent stays free while CellCog works:
client = CellCogClient(agent_provider="openclaw")
result = client.create_chat(
prompt="Research quantum computing advances in 2026",
notify_session_key="agent:main:main", # OpenClaw session key
task_label="quantum-research", # Label for notifications
chat_mode="agent",
)
# Returns IMMEDIATELY — daemon delivers results to your session when done
Requires sessions_send enabled on your gateway — see OpenClaw Reference below.
All Other Agents (Cursor, Claude Code, etc.)
Blocks until done — simplest pattern:
client = CellCogClient(agent_provider="cursor") # or "claude-code", "aider", "script", etc.
result = client.create_chat(
prompt="Research quantum computing advances in 2026",
task_label="quantum-research",
chat_mode="agent",
)
# Blocks until done — result contains everything
print(result["message"])
Credit Usage
CellCog orchestrates 21+ frontier foundation models. Credit consumption is unpredictable and varies by task complexity. Credits used are reported in every completion notification.
Creating Tasks
Notify on Completion (OpenClaw — Fire-and-Forget)
Returns immediately. A background daemon monitors via WebSocket and delivers results to your session when done. Your agent stays free to take new instructions, start other tasks, or continue working.
result = client.create_chat(
prompt="Your task description",
notify_session_key="agent:main:main", # Required — your OpenClaw session key
task_label="my-task", # Label shown in notifications
chat_mode="agent",
)
Requires OpenClaw Gateway with sessions_send enabled (disabled by default since OpenClaw 2026.4). See OpenClaw Reference below for one-time setup.
Wait for Completion (Universal)
Blocks until CellCog finishes. Works with any agent — OpenClaw, Cursor, Claude Code, or any Python environment.
result = client.create_chat(
prompt="Your task description",
task_label="my-task",
chat_mode="agent",
timeout=1800, # 30 min (default). Use 3600 for complex jobs.
)
print(result["message"])
print(result["status"]) # "completed" | "timeout"
When to Use Which
| Scenario | Best Mode | Why |
|---|---|---|
| OpenClaw + long task + stay free | Notify | Agent keeps working, gets notified when done |
| OpenClaw + chaining steps (research → summarize → PDF) | Wait | Each step feeds the next — simpler sequential workflows |
| OpenClaw + quick task | Either | Both return fast for simple tasks |
| Non-OpenClaw agent | Wait | Only option — no sessions_send available |
Notify mode is more productive (agent never blocks) but requires gateway configuration. Wait mode is simpler to reason about, but blocks your agent for the duration.
Continuing a Conversation
# Wait mode (default)
result = client.send_message(
chat_id="abc123",
message="Focus on hardware advances specifically",
)
# Notify mode (OpenClaw)
result = client.send_message(
chat_id="abc123",
message="Focus on hardware advances specifically",
notify_session_key="agent:main:main",
task_label="continue-research",
)
Resuming After Timeout
If create_chat() or wait_for_completion() times out, CellCog is still working. The timeout response includes recent progress:
completion = client.wait_for_completion(chat_id="abc123", timeout=1800)
Optional Parameters
result = client.create_chat(
prompt="...",
task_label="...",
chat_mode="agent", # See Chat Modes below
project_id="...", # install project-cog for details
agent_role_id="...", # install project-cog for details
enable_cowork=True, # install cowork-cog for details
cowork_working_directory="/Users/...", # install cowork-cog for details
)
Response Shape
Every SDK method returns the same shape:
{
"chat_id": str, # CellCog chat ID
"is_operating": bool, # True = still working, False = done
"status": str, # "completed" | "tracking" | "timeout" | "operating"
"message": str, # THE printable message — always print this in full
}
⚠️ Always print the entire result["message"]. Truncating or summarizing it will lose critical information including generated file paths, credits used, and follow-up instructions.
Utility Methods
get_history(chat_id) — Full chat history (when original delivery was missed or you need to review). Returns the same shape; if still operating, message shows progress so far.
result = client.get_history(chat_id="abc123")
get_status(chat_id) — Lightweight status check (no history fetch):
status = client.get_status(chat_id="abc123")
print(status["is_operating"]) # True/False
Chat Modes
| Mode | Best For | Speed | Min Credits |
|---|---|---|---|
"agent" | Most tasks — images, audio, dashboards, spreadsheets, presentations | Fast (seconds to minutes) | 100 |
"agent core" | Coding, co-work, terminal operations | Fast | 50 |
"agent team" | Deep research & multi-angled reasoning across every modality | Slower (5-60 min) | 500 |
"agent team max" | High-stakes work where extra reasoning depth justifies the cost | Slowest | 2,000 |
"agent"(default) — Most versatile. Handles most tasks excellently, including deep research when guided."agent core"— Lightweight context for code, terminal, and file operations. Multimedia tools load on demand. Requires Co-work (CellCog Desktop). Seecode-cog."agent team"— A team of agents that debates, cross-validates, and delivers comprehensive results. The only platform with deep reasoning across every modality."agent team max"— Same Agent Team with all settings maxed. Quality gain is incremental (5-10%) but meaningful for costly decisions.
Working with Files
Input: SHOW_FILE
Include local file paths in your prompt with <SHOW_FILE> tags (absolute paths required):
prompt = """
Analyze this sales data and create a report:
<SHOW_FILE>/path/to/sales.csv</SHOW_FILE>
"""
Output: GENERATE_FILE
Use <GENERATE_FILE> tags to specify where output files should be stored on your machine. Essential for deterministic workflows where the next step needs to know the file path in advance.
prompt = """
Create a PDF report on Q4 earnings:
<GENERATE_FILE>/workspace/reports/q4_
---
*Content truncated.*
More by openclaw
View all skills by openclaw →You might also like
flutter-development
aj-geddes
Build beautiful cross-platform mobile apps with Flutter and Dart. Covers widgets, state management with Provider/BLoC, navigation, API integration, and material design.
drawio-diagrams-enhanced
jgtolentino
Create professional draw.io (diagrams.net) diagrams in XML format (.drawio files) with integrated PMP/PMBOK methodologies, extensive visual asset libraries, and industry-standard professional templates. Use this skill when users ask to create flowcharts, swimlane diagrams, cross-functional flowcharts, org charts, network diagrams, UML diagrams, BPMN, project management diagrams (WBS, Gantt, PERT, RACI), risk matrices, stakeholder maps, or any other visual diagram in draw.io format. This skill includes access to custom shape libraries for icons, clipart, and professional symbols.
ui-ux-pro-max
nextlevelbuilder
"UI/UX design intelligence. 50 styles, 21 palettes, 50 font pairings, 20 charts, 8 stacks (React, Next.js, Vue, Svelte, SwiftUI, React Native, Flutter, Tailwind). Actions: plan, build, create, design, implement, review, fix, improve, optimize, enhance, refactor, check UI/UX code. Projects: website, landing page, dashboard, admin panel, e-commerce, SaaS, portfolio, blog, mobile app, .html, .tsx, .vue, .svelte. Elements: button, modal, navbar, sidebar, card, table, form, chart. Styles: glassmorphism, claymorphism, minimalism, brutalism, neumorphism, bento grid, dark mode, responsive, skeuomorphism, flat design. Topics: color palette, accessibility, animation, layout, typography, font pairing, spacing, hover, shadow, gradient."
godot
bfollington
This skill should be used when working on Godot Engine projects. It provides specialized knowledge of Godot's file formats (.gd, .tscn, .tres), architecture patterns (component-based, signal-driven, resource-based), common pitfalls, validation tools, code templates, and CLI workflows. The `godot` command is available for running the game, validating scripts, importing resources, and exporting builds. Use this skill for tasks involving Godot game development, debugging scene/resource files, implementing game systems, or creating new Godot components.
nano-banana-pro
garg-aayush
Generate and edit images using Google's Nano Banana Pro (Gemini 3 Pro Image) API. Use when the user asks to generate, create, edit, modify, change, alter, or update images. Also use when user references an existing image file and asks to modify it in any way (e.g., "modify this image", "change the background", "replace X with Y"). Supports both text-to-image generation and image-to-image editing with configurable resolution (1K default, 2K, or 4K for high resolution). DO NOT read the image file first - use this skill directly with the --input-image parameter.
pdf-to-markdown
aliceisjustplaying
Convert entire PDF documents to clean, structured Markdown for full context loading. Use this skill when the user wants to extract ALL text from a PDF into context (not grep/search), when discussing or analyzing PDF content in full, when the user mentions "load the whole PDF", "bring the PDF into context", "read the entire PDF", or when partial extraction/grepping would miss important context. This is the preferred method for PDF text extraction over page-by-page or grep approaches.
Related MCP Servers
Browse all serversOfficial Bright Data MCP server. Web scraping, SERP data collection, and browser automation via Bright Data's proxy netw
Browser Use lets LLMs and agents access and scrape any website in real time, making web scraping and web page scraping e
Enhance software testing with Playwright MCP: Fast, reliable browser automation, an innovative alternative to Selenium s
MCP Toolbox for Databases by Google. An open-source server that lets AI agents query Cloud SQL, Spanner, AlloyDB, and ot
The fullstack MCP framework for developing MCP apps for ChatGPT, Claude, and building MCP servers for AI agents. Connect
Advanced MCP server enabling AI agents to autonomously run 150+ security and penetration testing tools. Covers reconnais
Stay ahead of the MCP ecosystem
Get weekly updates on new skills and servers.