report-research

0views

1installs

Write a complete Numerai experiment report in experiment.md (abstract, methods, results tables, decisions, next steps) and generate/link the standard show_experiment plot(s). Use after running any Numerai research experiments, or when a user asks for a “full report”, “write up”, “experiment.md update”, or “generate the standard plot”.

Install

mkdir -p .claude/skills/report-research && curl -L -o skill.zip "https://mcp.directory/api/skills/download/6030" && unzip -o skill.zip -d .claude/skills/report-research && rm skill.zip

Installs to .claude/skills/report-research

About this skill

Report Research

Overview

This skill turns an experiment run into a durable write-up: a full experiment.md plus the standard show_experiment plot(s) linked from the report.

Workflow (do all steps)

1) Locate the experiment folder

Use the folder that contains:

configs/ (the configs you ran)
results/ (JSON metrics output)
predictions/ (OOF parquet output)
experiment.md (the report you will write/update)

2) Inventory what was actually run

List configs that exist.
Determine which ones were executed by checking for matching results/*.json and predictions/*.parquet.
Identify the “best” model(s) using bmc_mean and bmc_last_200_eras.mean (primary), with corr_mean as a sanity check.
If experiments were run in rounds, summarize each round’s intent (what changed) and whether it improved the current best.

3) Extract metrics for the report

For each run you report, include at least:

corr_mean
bmc_mean
bmc_last_200_eras.mean
avg_corr_with_benchmark (from the BMC summary)

Prefer a single markdown table with one row per model.

4) Write a full report in experiment.md

Update/create experiment.md with these sections (keep it crisp but complete):

Title + Date
Abstract (what was tested + headline result)
Hypothesis / Motivation (why this should help BMC)
Method (data split, CV, feature set, model type/hparams, any transforms)
Experiments run (one subsection per config that actually ran; include output artifacts)
Results (the metrics table; mention best run + trade-offs)
Standard plot (embed the PNG and include the generating command)
Decisions made (what you chose and why; e.g., per-era vs global, feature set choice, sweep choices)
Stopping rationale (why you stopped iterating; e.g., plateau after N rounds, confirmatory scale step, diminishing returns)
Findings (what worked / didn’t; interpret the plot)
Next experiments (2–5 concrete follow-ups)
Repro commands (train + plot commands from repo root)

5) Generate the standard plot(s) and link them

Default standard plot (baseline = benchmark predictions):

PYTHONPATH=numerai python3 -m agents.code.analysis.show_experiment benchmark <best_model_results_name> \
  --base-benchmark-model v52_lgbm_ender20 \
  --benchmark-data-path numerai/v5.2/full_benchmark_models.parquet \
  --start-era 575 --dark \
  --output-dir numerai/agents/experiments/<experiment_name> \
  --baselines-dir numerai/agents/baselines

Then embed it in experiment.md with a relative link:

![benchmark vs best model](plots/<generated_plot_name>.png)

If you have multiple candidate models, either:

generate one plot with multiple experiment models, or
generate one plot per candidate (and link all of them).

6) Final checks

Plot files exist under plots/.
experiment.md links resolve (use relative paths).
Metrics table matches results/*.json.
Report clearly states what was run vs what is only planned/configured.

More by numerai

View all skills by numerai →

numerai-model-upload

numerai

Create Numerai Tournament model upload pickles (.pkl) with a self-contained predict() function. Use when preparing upload artifacts, debugging numerai_predict import errors, or documenting model-upload requirements and testing steps.

numerai-experiment-design

numerai

Design and manage Numerai experiments in this repo for any model idea.

flutter-development

aj-geddes

Build beautiful cross-platform mobile apps with Flutter and Dart. Covers widgets, state management with Provider/BLoC, navigation, API integration, and material design.

1,6841,428

ui-ux-pro-max

nextlevelbuilder

"UI/UX design intelligence. 50 styles, 21 palettes, 50 font pairings, 20 charts, 8 stacks (React, Next.js, Vue, Svelte, SwiftUI, React Native, Flutter, Tailwind). Actions: plan, build, create, design, implement, review, fix, improve, optimize, enhance, refactor, check UI/UX code. Projects: website, landing page, dashboard, admin panel, e-commerce, SaaS, portfolio, blog, mobile app, .html, .tsx, .vue, .svelte. Elements: button, modal, navbar, sidebar, card, table, form, chart. Styles: glassmorphism, claymorphism, minimalism, brutalism, neumorphism, bento grid, dark mode, responsive, skeuomorphism, flat design. Topics: color palette, accessibility, animation, layout, typography, font pairing, spacing, hover, shadow, gradient."

1,2621,324

drawio-diagrams-enhanced

jgtolentino

Create professional draw.io (diagrams.net) diagrams in XML format (.drawio files) with integrated PMP/PMBOK methodologies, extensive visual asset libraries, and industry-standard professional templates. Use this skill when users ask to create flowcharts, swimlane diagrams, cross-functional flowcharts, org charts, network diagrams, UML diagrams, BPMN, project management diagrams (WBS, Gantt, PERT, RACI), risk matrices, stakeholder maps, or any other visual diagram in draw.io format. This skill includes access to custom shape libraries for icons, clipart, and professional symbols.

1,5331,147

godot

bfollington

This skill should be used when working on Godot Engine projects. It provides specialized knowledge of Godot's file formats (.gd, .tscn, .tres), architecture patterns (component-based, signal-driven, resource-based), common pitfalls, validation tools, code templates, and CLI workflows. The `godot` command is available for running the game, validating scripts, importing resources, and exporting builds. Use this skill for tasks involving Godot game development, debugging scene/resource files, implementing game systems, or creating new Godot components.

1,353807

nano-banana-pro

garg-aayush

Generate and edit images using Google's Nano Banana Pro (Gemini 3 Pro Image) API. Use when the user asks to generate, create, edit, modify, change, alter, or update images. Also use when user references an existing image file and asks to modify it in any way (e.g., "modify this image", "change the background", "replace X with Y"). Supports both text-to-image generation and image-to-image editing with configurable resolution (1K default, 2K, or 4K for high resolution). DO NOT read the image file first - use this skill directly with the --input-image parameter.

1,263727

pdf-to-markdown

aliceisjustplaying

Convert entire PDF documents to clean, structured Markdown for full context loading. Use this skill when the user wants to extract ALL text from a PDF into context (not grep/search), when discussing or analyzing PDF content in full, when the user mentions "load the whole PDF", "bring the PDF into context", "read the entire PDF", or when partial extraction/grepping would miss important context. This is the preferred method for PDF text extraction over page-by-page or grep approaches.

1,481684

Related MCP Servers

Browse all servers

Excel File Manipulation

Automate Excel file tasks without Microsoft Excel using openpyxl and xlsxwriter for formatting, formulas, charts, and ad

3,43225 tools

Filesystem

Learn how to use Python to read a file and manipulate local files safely through the Filesystem API.

80,52714 tools

Mastra Docs

Mastra Docs: AI assistants with direct access to Mastra.ai’s full knowledge base for faster, smarter support and insight

21,8110 tools

Arize Phoenix

Arize Phoenix — unified interface for managing prompts, exploring datasets, and running LLM experiments across providers

8,7850 tools

Deep Research MCP

Deep Research MCP — an AI research assistant and LLM research tool for multi-step web search, content analysis, and synt

4,5010 tools

Google Maps

Find official MCP servers for Google Maps. Explore resources to build, integrate, and extend apps with Google directions

3,3520 tools

Install

mkdir -p .claude/skills/report-research && curl -L -o skill.zip "https://mcp.directory/api/skills/download/6030" && unzip -o skill.zip -d .claude/skills/report-research && rm skill.zip

Installs to .claude/skills/report-research

Stats

Views

Installs

Author

numerai

3 skills published

Links

Source Code

report-research

Install

About this skill

Report Research

Overview

Workflow (do all steps)

1) Locate the experiment folder

2) Inventory what was actually run

3) Extract metrics for the report

4) Write a full report in experiment.md

5) Generate the standard plot(s) and link them

6) Final checks

More by numerai

numerai-model-upload

numerai-experiment-design

You might also like

flutter-development

ui-ux-pro-max

drawio-diagrams-enhanced

godot

nano-banana-pro

pdf-to-markdown

Related MCP Servers