model-registry-maintainer

Name: model-registry-maintainer
Author: massgen

4views

0installs

Guide for maintaining the MassGen model and backend registry. This skill should be used when adding new models, updating model information (release dates, pricing, context windows), or ensuring the registry stays current with provider releases. Covers both the capabilities registry and the pricing/token manager.

Install

mkdir -p .claude/skills/model-registry-maintainer && curl -L -o skill.zip "https://mcp.directory/api/skills/download/8598" && unzip -o skill.zip -d .claude/skills/model-registry-maintainer && rm skill.zip

Installs to .claude/skills/model-registry-maintainer

About this skill

Model Registry Maintainer

This skill provides guidance for maintaining MassGen's model registry across two key files:

massgen/backend/capabilities.py - Models, capabilities, release dates
massgen/token_manager/token_manager.py - Pricing, context windows

When to Use This Skill

New model released by a provider
Model pricing changes
Context window limits updated
Model capabilities changed
New provider/backend added

Two Files to Maintain

File 1: capabilities.py (Models & Features)

What it contains:

List of available models per provider
Model capabilities (web search, code execution, vision, etc.)
Release dates
Default models

Used by:

Config builder (--quickstart, --generate-config)
Documentation generation
Backend validation

Always update this file for new models.

File 2: token_manager.py (Pricing & Limits)

What it contains:

Hardcoded pricing/context windows for models NOT in LiteLLM database
On-demand loading from LiteLLM database (500+ models)

Used by:

Cost estimation
Token counting
Context management

Pricing resolution order:

LiteLLM database (fetched on-demand, cached 1 hour)
Hardcoded PROVIDER_PRICING (fallback only)
Pattern matching heuristics

Only update PROVIDER_PRICING if:

Model is NOT in LiteLLM database
LiteLLM pricing is incorrect/outdated
Model is custom/internal to your organization

Information to Gather for New Models

1. Release Date

Format: "YYYY-MM"
Sources:
- OpenAI: https://openai.com/index
- Anthropic: https://www.anthropic.com/news
- Google DeepMind: https://blog.google/technology/google-deepmind/
- xAI: https://x.ai/news

2. Context Window

Input context size (tokens)
Max output tokens
Look for: "context window", "max tokens", "input/output limits"

3. Pricing

Input cost per 1K tokens (USD)
Output cost per 1K tokens (USD)
Cached input cost (if applicable)
Sources:
- OpenAI: https://openai.com/api/pricing/
- Anthropic: https://www.anthropic.com/pricing
- Google: https://ai.google.dev/pricing
- xAI: https://x.ai/api/pricing

4. Capabilities

Web search, code execution, vision, reasoning, etc.
Check official API documentation

5. Model Name

Exact API identifier (case-sensitive)
Check provider's model documentation

Adding a New Model - Complete Workflow

Step 1: Add to capabilities.py

Add model to the models list and model_release_dates:

# massgen/backend/capabilities.py

"openai": BackendCapabilities(
    # ... existing fields ...
    models=[
        "new-model-name",  # Add here (newest first)
        "gpt-5.1",
        # ... existing models ...
    ],
    model_release_dates={
        "new-model-name": "2025-12",  # Add here
        "gpt-5.1": "2025-11",
        # ... existing dates ...
    },
)

Step 2: Check if pricing is in LiteLLM (Usually Skip)

First, check if the model is already in LiteLLM database:

import requests

url = "https://raw.githubusercontent.com/BerriAI/litellm/main/model_prices_and_context_window.json"
pricing_db = requests.get(url).json()

if "new-model-name" in pricing_db:
    print("✅ Model found in LiteLLM - no need to update token_manager.py")
    print(f"Pricing: ${pricing_db['new-model-name']['input_cost_per_token']*1000}/1K input")
else:
    print("❌ Model NOT in LiteLLM - need to add to PROVIDER_PRICING")

Only if NOT in LiteLLM, add to PROVIDER_PRICING:

# massgen/token_manager/token_manager.py

PROVIDER_PRICING: Dict[str, Dict[str, ModelPricing]] = {
    "OpenAI": {
        # Format: ModelPricing(input_per_1k, output_per_1k, context_window, max_output)
        "new-model-name": ModelPricing(0.00125, 0.01, 300000, 150000),
        # ... existing models ...
    },
}

Provider name mapping:

"OpenAI" (not "openai")
"Anthropic" (not "claude")
"Google" (not "gemini")
"xAI" (not "grok")

Step 3: Update Capabilities (if new features)

If the model introduces new capabilities:

supported_capabilities={
    "web_search",
    "code_execution",
    "new_capability",  # Add here
}

Step 4: Update Default Model (if appropriate)

Only change if the new model should be the recommended default:

default_model="new-model-name"

Step 5: Validate and Test

# Run capabilities tests
uv run pytest massgen/tests/test_backend_capabilities.py -v

# Test config generation with new model
massgen --generate-config ./test.yaml --config-backend openai --config-model new-model-name

# Verify the config was created successfully
cat ./test.yaml

Step 6: Regenerate Documentation

uv run python docs/scripts/generate_backend_tables.py
cd docs && make html

Current Model Data

OpenAI Models (as of Nov 2025)

In capabilities.py:

models=[
    "gpt-5.1",        # 2025-11
    "gpt-5-codex",    # 2025-09
    "gpt-5",          # 2025-08
    "gpt-5-mini",     # 2025-08
    "gpt-5-nano",     # 2025-08
    "gpt-4.1",        # 2025-04
    "gpt-4.1-mini",   # 2025-04
    "gpt-4.1-nano",   # 2025-04
    "gpt-4o",         # 2024-05
    "gpt-4o-mini",    # 2024-07
    "o4-mini",        # 2025-04
]

In token_manager.py (add missing models):

"OpenAI": {
    "gpt-5": ModelPricing(0.00125, 0.01, 400000, 128000),
    "gpt-5-mini": ModelPricing(0.00025, 0.002, 400000, 128000),
    "gpt-5-nano": ModelPricing(0.00005, 0.0004, 400000, 128000),
    "gpt-4o": ModelPricing(0.0025, 0.01, 128000, 16384),
    "gpt-4o-mini": ModelPricing(0.00015, 0.0006, 128000, 16384),
    # Missing: gpt-5.1, gpt-5-codex, gpt-4.1 family, o4-mini
}

Claude Models (as of Nov 2025)

In capabilities.py:

models=[
    "claude-haiku-4-5-20251001",    # 2025-10
    "claude-sonnet-4-5-20250929",   # 2025-09
    "claude-opus-4-1-20250805",     # 2025-08
    "claude-sonnet-4-20250514",     # 2025-05
]

In token_manager.py:

"Anthropic": {
    "claude-haiku-4-5": ModelPricing(0.001, 0.005, 200000, 65536),
    "claude-sonnet-4-5": ModelPricing(0.003, 0.015, 200000, 65536),
    "claude-opus-4.1": ModelPricing(0.015, 0.075, 200000, 32768),
    "claude-sonnet-4": ModelPricing(0.003, 0.015, 200000, 8192),
}

Gemini Models (as of Nov 2025)

In capabilities.py:

models=[
    "gemini-3-pro-preview",  # 2025-11
    "gemini-2.5-flash",      # 2025-06
    "gemini-2.5-pro",        # 2025-06
]

In token_manager.py (missing gemini-2.5 and gemini-3):

"Google": {
    "gemini-1.5-pro": ModelPricing(0.00125, 0.005, 2097152, 8192),
    "gemini-1.5-flash": ModelPricing(0.000075, 0.0003, 1048576, 8192),
    # Missing: gemini-2.5-pro, gemini-2.5-flash, gemini-3-pro-preview
}

Grok Models (as of Nov 2025)

In capabilities.py:

models=[
    "grok-4-1-fast-reasoning",      # 2025-11
    "grok-4-1-fast-non-reasoning",  # 2025-11
    "grok-code-fast-1",             # 2025-08
    "grok-4",                       # 2025-07
    "grok-4-fast",                  # 2025-09
    "grok-3",                       # 2025-02
    "grok-3-mini",                  # 2025-05
]

In token_manager.py (missing grok-3, grok-4 families):

"xAI": {
    "grok-2-latest": ModelPricing(0.005, 0.015, 131072, 131072),
    "grok-2": ModelPricing(0.005, 0.015, 131072, 131072),
    "grok-2-mini": ModelPricing(0.001, 0.003, 131072, 65536),
    # Missing: grok-3, grok-4, grok-4-1 families
}

Model Name Matching

Important: The names in PROVIDER_PRICING use simplified patterns:

"gpt-5" matches gpt-5, gpt-5-preview, gpt-5-*
"claude-sonnet-4-5" matches claude-sonnet-4-5-* (any date suffix)
"gemini-2.5-pro" is exact match

The token manager uses prefix matching for flexibility.

Common Tasks

Task: Add brand new GPT-5.2 model

Research: Release date, pricing, context window, capabilities
Add to capabilities.py models list and release_dates
Add to token_manager.py PROVIDER_PRICING["OpenAI"]
Run tests
Regenerate docs

Task: Update pricing for existing model

Verify new pricing from official source
Update only token_manager.py PROVIDER_PRICING
No need to touch capabilities.py
Document change in notes if significant

Task: Add new capability to model

Update supported_capabilities in capabilities.py
Add to notes explaining when/how capability works
Update backend implementation if needed
Run tests

Validation Commands

# Test capabilities registry
uv run pytest massgen/tests/test_backend_capabilities.py -v

# Test token manager
uv run pytest massgen/tests/test_token_manager.py -v

# Generate config with new model
massgen --generate-config ./test.yaml --config-backend openai --config-model new-model

# Build docs to verify tables
cd docs && make html

Programmatic Model Updates

LiteLLM Pricing Database (RECOMMENDED)

The easiest way to get comprehensive model pricing and context window data:

URL: https://raw.githubusercontent.com/BerriAI/litellm/main/model_prices_and_context_window.json

Coverage: 500+ models across 30+ providers including:

OpenAI, Anthropic, Google, xAI
Together AI, Groq, Cerebras, Fireworks
AWS Bedrock, Azure, Cohere, and more

Data Available:

{
  "gpt-4o": {
    "input_cost_per_token": 0.0000025,
    "output_cost_per_token": 0.00001,
    "max_input_tokens": 128000,
    "max_output_tokens": 16384,
    "supports_vision": true,
    "supports_function_calling": true,
    "supports_prompt_caching": true
  }
}

Usage:

import requests

# Fetch latest pricing
url = "https://raw.githubusercontent.com/BerriAI/litellm/main/model_prices_and_context_window.json"
pricing_db = requests.get(url).json()

# Get info for a model
model_info = pricing_db.get("gpt-4o")
input_per_1k = model_info["input_cost_per_token"] * 1000
output_per_1k = model_info["output_cost_per_token"] * 1000

**Update t

Content truncated.

More by massgen

View all skills by massgen →

serena

massgen

This skill provides symbol-level code understanding and navigation using Language Server Protocol (LSP). Enables IDE-like capabilities for finding symbols, tracking references, and making precise code edits at the symbol level.

10315

file-search

massgen

This skill should be used when agents need to search codebases for text patterns or structural code patterns. Provides fast search using ripgrep for text and ast-grep for syntax-aware code search.

textual-ui-developer

massgen

Develop and improve the MassGen Textual TUI by running it in a browser via textual-serve and using Claude's browser tool for visual feedback.

massgen-log-analyzer

massgen

Run MassGen experiments and analyze logs using automation mode, logfire tracing, and SQL queries. Use this skill for performance analysis, debugging agent behavior, evaluating coordination patterns, and improving the logging structure, or whenever an ANALYSIS_REPORT.md is needed in a log directory.

evolving-skill-creator

massgen

Guide for creating evolving skills - detailed workflow plans that capture what you'll do, what tools you'll create, and learnings from execution. Use this when starting a new task that could benefit from a reusable workflow.

massgen-develops-massgen

massgen

Guide for using MassGen to develop and improve itself. This skill should be used when agents need to run MassGen experiments programmatically (using automation mode) OR analyze terminal UI/UX quality (using visual evaluation tools). These are mutually exclusive workflows for different improvement goals.

ui-ux-pro-max

nextlevelbuilder

"UI/UX design intelligence. 50 styles, 21 palettes, 50 font pairings, 20 charts, 8 stacks (React, Next.js, Vue, Svelte, SwiftUI, React Native, Flutter, Tailwind). Actions: plan, build, create, design, implement, review, fix, improve, optimize, enhance, refactor, check UI/UX code. Projects: website, landing page, dashboard, admin panel, e-commerce, SaaS, portfolio, blog, mobile app, .html, .tsx, .vue, .svelte. Elements: button, modal, navbar, sidebar, card, table, form, chart. Styles: glassmorphism, claymorphism, minimalism, brutalism, neumorphism, bento grid, dark mode, responsive, skeuomorphism, flat design. Topics: color palette, accessibility, animation, layout, typography, font pairing, spacing, hover, shadow, gradient."

3,2982,852

pdf-to-markdown

aliceisjustplaying

Convert entire PDF documents to clean, structured Markdown for full context loading. Use this skill when the user wants to extract ALL text from a PDF into context (not grep/search), when discussing or analyzing PDF content in full, when the user mentions "load the whole PDF", "bring the PDF into context", "read the entire PDF", or when partial extraction/grepping would miss important context. This is the preferred method for PDF text extraction over page-by-page or grep approaches.

4,3341,881

flutter-development

aj-geddes

Build beautiful cross-platform mobile apps with Flutter and Dart. Covers widgets, state management with Provider/BLoC, navigation, API integration, and material design.

2,2331,675

drawio-diagrams-enhanced

jgtolentino

Create professional draw.io (diagrams.net) diagrams in XML format (.drawio files) with integrated PMP/PMBOK methodologies, extensive visual asset libraries, and industry-standard professional templates. Use this skill when users ask to create flowcharts, swimlane diagrams, cross-functional flowcharts, org charts, network diagrams, UML diagrams, BPMN, project management diagrams (WBS, Gantt, PERT, RACI), risk matrices, stakeholder maps, or any other visual diagram in draw.io format. This skill includes access to custom shape libraries for icons, clipart, and professional symbols.

2,3891,533

godot

bfollington

This skill should be used when working on Godot Engine projects. It provides specialized knowledge of Godot's file formats (.gd, .tscn, .tres), architecture patterns (component-based, signal-driven, resource-based), common pitfalls, validation tools, code templates, and CLI workflows. The `godot` command is available for running the game, validating scripts, importing resources, and exporting builds. Use this skill for tasks involving Godot game development, debugging scene/resource files, implementing game systems, or creating new Godot components.

2,7001,285

nano-banana-pro

garg-aayush

Generate and edit images using Google's Nano Banana Pro (Gemini 3 Pro Image) API. Use when the user asks to generate, create, edit, modify, change, alter, or update images. Also use when user references an existing image file and asks to modify it in any way (e.g., "modify this image", "change the background", "replace X with Y"). Supports both text-to-image generation and image-to-image editing with configurable resolution (1K default, 2K, or 4K for high resolution). DO NOT read the image file first - use this skill directly with the --input-image parameter.

2,1461,036

Related MCP Servers

Browse all servers

pg-aiguide

pg-aiguide — Version-aware PostgreSQL docs and best practices tailored for AI coding assistants. Improve queries, migrations, and model integrations.

1,5903 tools

Spec-Driven Development

Guide your software projects with structured prompts from requirements to code using the waterfall development model and software design specifications.

4270 tools

Salesforce

Unlock seamless Salesforce org management with the secure, flexible Salesforce DX MCP Server. Streamline workflows and boost productivity.

3040 tools

Ollama

Integrate Ollama's local LLM models for secure, on-premise AI and data control with MCP-compatible apps. Deploy custom models easily.

1430 tools

CLI

Use CLI, a secure shell client for Windows, to safely execute commands via SSH secure shell with advanced security controls for AI models.

401 tools

Spec-Driven Development

Spec-Driven Development integrates with IBM DOORS software to track software licenses, automate requirements, and enforce best development practices.

140 tools

Install

mkdir -p .claude/skills/model-registry-maintainer && curl -L -o skill.zip "https://mcp.directory/api/skills/download/8598" && unzip -o skill.zip -d .claude/skills/model-registry-maintainer && rm skill.zip

Installs to .claude/skills/model-registry-maintainer

Stats

Views

Installs

Author

massgen

7 skills published

Links

Source Code

model-registry-maintainer

Install

About this skill

Model Registry Maintainer

When to Use This Skill

Two Files to Maintain

File 1: capabilities.py (Models & Features)

File 2: token_manager.py (Pricing & Limits)

Information to Gather for New Models

1. Release Date

2. Context Window

3. Pricing

4. Capabilities

5. Model Name

Adding a New Model - Complete Workflow

Step 1: Add to capabilities.py

Step 2: Check if pricing is in LiteLLM (Usually Skip)

Step 3: Update Capabilities (if new features)

Step 4: Update Default Model (if appropriate)

Step 5: Validate and Test

Step 6: Regenerate Documentation

Current Model Data

OpenAI Models (as of Nov 2025)

Claude Models (as of Nov 2025)

Gemini Models (as of Nov 2025)

Grok Models (as of Nov 2025)

Model Name Matching

Common Tasks

Task: Add brand new GPT-5.2 model

Task: Update pricing for existing model

Task: Add new capability to model

Validation Commands

Programmatic Model Updates

LiteLLM Pricing Database (RECOMMENDED)

More by massgen

serena

file-search

textual-ui-developer

massgen-log-analyzer

evolving-skill-creator

massgen-develops-massgen

You might also like

ui-ux-pro-max

pdf-to-markdown

flutter-development

drawio-diagrams-enhanced

godot

nano-banana-pro

Related MCP Servers