collecting-infrastructure-metrics
Collect comprehensive infrastructure performance metrics across compute, storage, network, containers, load balancers, and databases. Use when monitoring system performance or troubleshooting infrastructure issues. Trigger with phrases like "collect infrastructure metrics", "monitor server performance", or "track system resources".
Install
mkdir -p .claude/skills/collecting-infrastructure-metrics && curl -L -o skill.zip "https://mcp.directory/api/skills/download/8719" && unzip -o skill.zip -d .claude/skills/collecting-infrastructure-metrics && rm skill.zipInstalls to .claude/skills/collecting-infrastructure-metrics
About this skill
Infrastructure Metrics Collector
Collect and centralize infrastructure metrics across compute, storage, network, containers, load balancers, and databases using Prometheus, Datadog, or CloudWatch.
Overview
This skill automates the process of setting up infrastructure metrics collection. It identifies key performance indicators (KPIs) across various infrastructure layers, configures agents to collect these metrics, and assists in setting up central aggregation and visualization.
How It Works
- Identify Infrastructure Layers: Determines the infrastructure layers to monitor (compute, storage, network, containers, load balancers, databases).
- Configure Metrics Collection: Sets up agents (Prometheus, Datadog, CloudWatch) to collect metrics from the identified layers.
- Aggregate Metrics: Configures central aggregation of the collected metrics for analysis and visualization.
- Create Dashboards: Generates infrastructure dashboards for health monitoring, performance analysis, and capacity tracking.
When to Use This Skill
This skill activates when you need to:
- Monitor the performance of your infrastructure.
- Identify bottlenecks in your system.
- Set up dashboards for real-time monitoring.
Examples
Example 1: Setting up basic monitoring
User request: "Collect infrastructure metrics for my web server."
The skill will:
- Identify compute, storage, and network layers relevant to the web server.
- Configure Prometheus to collect CPU, memory, disk I/O, and network bandwidth metrics.
Example 2: Troubleshooting database performance
User request: "I'm seeing slow database queries. Can you help me monitor the database performance?"
The skill will:
- Identify the database layer and relevant metrics such as connection pool usage, replication lag, and cache hit rates.
- Configure Datadog to collect these metrics and create a dashboard to visualize performance trends.
Best Practices
- Agent Selection: Choose the appropriate agent (Prometheus, Datadog, CloudWatch) based on your existing infrastructure and monitoring tools.
- Metric Granularity: Balance the granularity of metrics collection with the storage and processing overhead. Collect only the essential metrics for your use case.
- Alerting: Configure alerts based on thresholds for key metrics to proactively identify and address performance issues.
Integration
This skill can be integrated with other plugins for deployment, configuration management, and alerting to provide a comprehensive infrastructure management solution. For example, it can be used with a deployment plugin to automatically configure metrics collection after deploying new infrastructure.
Prerequisites
- Access to infrastructure monitoring systems (Prometheus, Datadog, CloudWatch)
- System permissions for metrics agent installation
- Network access to monitored infrastructure components
- Storage for metrics data in ${CLAUDE_SKILL_DIR}/metrics/
Instructions
- Identify infrastructure layers to monitor (compute, storage, network, databases)
- Select appropriate metrics collection agent based on environment
- Configure agent with target endpoints and metric types
- Set up central aggregation for collected metrics
- Create dashboards for visualization
- Configure alerts for critical metrics thresholds
Output
- Metrics collection configuration files
- Agent installation and setup scripts
- Dashboard definitions for infrastructure monitoring
- Metric export configurations
- Alert rules for critical thresholds
Error Handling
If metrics collection fails:
- Verify agent installation and permissions
- Check network connectivity to targets
- Validate authentication credentials
- Review firewall and security group rules
- Confirm metric endpoint availability
Resources
- Prometheus documentation for metric collection
- Datadog agent configuration guides
- AWS CloudWatch metrics reference
- Infrastructure monitoring best practices
More by jeremylongshore
View all skills by jeremylongshore →You might also like
flutter-development
aj-geddes
Build beautiful cross-platform mobile apps with Flutter and Dart. Covers widgets, state management with Provider/BLoC, navigation, API integration, and material design.
drawio-diagrams-enhanced
jgtolentino
Create professional draw.io (diagrams.net) diagrams in XML format (.drawio files) with integrated PMP/PMBOK methodologies, extensive visual asset libraries, and industry-standard professional templates. Use this skill when users ask to create flowcharts, swimlane diagrams, cross-functional flowcharts, org charts, network diagrams, UML diagrams, BPMN, project management diagrams (WBS, Gantt, PERT, RACI), risk matrices, stakeholder maps, or any other visual diagram in draw.io format. This skill includes access to custom shape libraries for icons, clipart, and professional symbols.
ui-ux-pro-max
nextlevelbuilder
"UI/UX design intelligence. 50 styles, 21 palettes, 50 font pairings, 20 charts, 8 stacks (React, Next.js, Vue, Svelte, SwiftUI, React Native, Flutter, Tailwind). Actions: plan, build, create, design, implement, review, fix, improve, optimize, enhance, refactor, check UI/UX code. Projects: website, landing page, dashboard, admin panel, e-commerce, SaaS, portfolio, blog, mobile app, .html, .tsx, .vue, .svelte. Elements: button, modal, navbar, sidebar, card, table, form, chart. Styles: glassmorphism, claymorphism, minimalism, brutalism, neumorphism, bento grid, dark mode, responsive, skeuomorphism, flat design. Topics: color palette, accessibility, animation, layout, typography, font pairing, spacing, hover, shadow, gradient."
godot
bfollington
This skill should be used when working on Godot Engine projects. It provides specialized knowledge of Godot's file formats (.gd, .tscn, .tres), architecture patterns (component-based, signal-driven, resource-based), common pitfalls, validation tools, code templates, and CLI workflows. The `godot` command is available for running the game, validating scripts, importing resources, and exporting builds. Use this skill for tasks involving Godot game development, debugging scene/resource files, implementing game systems, or creating new Godot components.
nano-banana-pro
garg-aayush
Generate and edit images using Google's Nano Banana Pro (Gemini 3 Pro Image) API. Use when the user asks to generate, create, edit, modify, change, alter, or update images. Also use when user references an existing image file and asks to modify it in any way (e.g., "modify this image", "change the background", "replace X with Y"). Supports both text-to-image generation and image-to-image editing with configurable resolution (1K default, 2K, or 4K for high resolution). DO NOT read the image file first - use this skill directly with the --input-image parameter.
pdf-to-markdown
aliceisjustplaying
Convert entire PDF documents to clean, structured Markdown for full context loading. Use this skill when the user wants to extract ALL text from a PDF into context (not grep/search), when discussing or analyzing PDF content in full, when the user mentions "load the whole PDF", "bring the PDF into context", "read the entire PDF", or when partial extraction/grepping would miss important context. This is the preferred method for PDF text extraction over page-by-page or grep approaches.
Related MCP Servers
Browse all serversOptimize Facebook ad campaigns with AI-driven insights, creative analysis, and campaign control in Meta Ads Manager for
Integrate seamlessly with the Mux Video and Data Platform for easy video uploads, live stream control, analytics, and pe
Scorecard: Evaluate and optimize LLM systems with thorough testing, actionable metrics, and performance insights to impr
Boost Postgres performance with Postgres MCP Pro—AI-driven index tuning, health checks, and safe, intelligent SQL optimi
Connect MongoDB databases to chat interfaces. Manage AWS with MongoDB, explore Atlas cost, and inspect collections secur
Cloudflare Workers empowers MCP to deploy scalable, low-latency AI services at the network edge for optimal performance.
Stay ahead of the MCP ecosystem
Get weekly updates on new skills and servers.