
Vectorize
OfficialConnects Claude to Vectorize.io's vector database services for document retrieval and text extraction. Requires Vectorize API credentials to access their vector search capabilities.
Bridges Claude with Vectorize.io's vector database services for advanced document retrieval, text extraction, and research capabilities through TypeScript-based tools that handle authentication via organization IDs and API tokens.
What it does
- Query vector databases for document similarity search
- Extract text content from various document formats
- Retrieve semantically similar documents
- Search through vectorized document collections
- Access Vectorize.io pipeline configurations
Best for
About Vectorize
Vectorize is an official MCP server published by vectorize-io that provides AI assistants with tools and capabilities via the Model Context Protocol. Connect Claude with Vectorize.io's vector database to extract text from images and enable advanced retrieval for researc It is categorized under ai ml, analytics data.
How to install
You can install Vectorize in your AI client of choice. Use the install panel on this page to get one-click setup for Cursor, Claude Desktop, VS Code, and other MCP-compatible clients. This server runs locally on your machine via the stdio transport.
License
Vectorize is released under the MIT license. This is a permissive open-source license, meaning you can freely use, modify, and distribute the software.
Vectorize MCP Server
A Model Context Protocol (MCP) server implementation that integrates with Vectorize for advanced Vector retrieval and text extraction.
Installation
Running with npx
export VECTORIZE_ORG_ID=YOUR_ORG_ID
export VECTORIZE_TOKEN=YOUR_TOKEN
export VECTORIZE_PIPELINE_ID=YOUR_PIPELINE_ID
npx -y @vectorize-io/vectorize-mcp-server@latest
VS Code Installation
For one-click installation, click one of the install buttons below:
Manual Installation
For the quickest installation, use the one-click install buttons at the top of this section.
To install manually, add the following JSON block to your User Settings (JSON) file in VS Code. You can do this by pressing Ctrl + Shift + P and typing Preferences: Open User Settings (JSON).
{
"mcp": {
"inputs": [
{
"type": "promptString",
"id": "org_id",
"description": "Vectorize Organization ID"
},
{
"type": "promptString",
"id": "token",
"description": "Vectorize Token",
"password": true
},
{
"type": "promptString",
"id": "pipeline_id",
"description": "Vectorize Pipeline ID"
}
],
"servers": {
"vectorize": {
"command": "npx",
"args": ["-y", "@vectorize-io/vectorize-mcp-server@latest"],
"env": {
"VECTORIZE_ORG_ID": "${input:org_id}",
"VECTORIZE_TOKEN": "${input:token}",
"VECTORIZE_PIPELINE_ID": "${input:pipeline_id}"
}
}
}
}
}
Optionally, you can add the following to a file called .vscode/mcp.json in your workspace to share the configuration with others:
{
"inputs": [
{
"type": "promptString",
"id": "org_id",
"description": "Vectorize Organization ID"
},
{
"type": "promptString",
"id": "token",
"description": "Vectorize Token",
"password": true
},
{
"type": "promptString",
"id": "pipeline_id",
"description": "Vectorize Pipeline ID"
}
],
"servers": {
"vectorize": {
"command": "npx",
"args": ["-y", "@vectorize-io/vectorize-mcp-server@latest"],
"env": {
"VECTORIZE_ORG_ID": "${input:org_id}",
"VECTORIZE_TOKEN": "${input:token}",
"VECTORIZE_PIPELINE_ID": "${input:pipeline_id}"
}
}
}
}
Configuration on Claude/Windsurf/Cursor/Cline
{
"mcpServers": {
"vectorize": {
"command": "npx",
"args": ["-y", "@vectorize-io/vectorize-mcp-server@latest"],
"env": {
"VECTORIZE_ORG_ID": "your-org-id",
"VECTORIZE_TOKEN": "your-token",
"VECTORIZE_PIPELINE_ID": "your-pipeline-id"
}
}
}
}
Tools
Retrieve documents
Perform vector search and retrieve documents (see official API):
{
"name": "retrieve",
"arguments": {
"question": "Financial health of the company",
"k": 5
}
}
Text extraction and chunking (Any file to Markdown)
Extract text from a document and chunk it into Markdown format (see official API):
{
"name": "extract",
"arguments": {
"base64document": "base64-encoded-document",
"contentType": "application/pdf"
}
}
Deep Research
Generate a Private Deep Research from your pipeline (see official API):
{
"name": "deep-research",
"arguments": {
"query": "Generate a financial status report about the company",
"webSearch": true
}
}
Development
npm install
npm run dev
Release
Change the package.json version and then:
git commit -am "x.y.z"
git tag x.y.z
git push origin
git push origin --tags
Contributing
- Fork the repository
- Create your feature branch
- Submit a pull request
Alternatives
Related Skills
Browse all skillsTransform data into compelling narratives using visualization, context, and persuasive structure. Use when presenting analytics to stakeholders, creating data reports, or building executive presentations.
Advanced content and topic research skill that analyzes trends across Google Analytics, Google Trends, Substack, Medium, Reddit, LinkedIn, X, blogs, podcasts, and YouTube to generate data-driven article outlines based on user intent analysis
Expert data scientist for advanced analytics, machine learning, and statistical modeling. Handles complex data analysis, predictive modeling, and business intelligence. Use PROACTIVELY for data analysis tasks, ML modeling, statistical analysis, and data-driven insights.
Analyze Google Analytics data, review website performance metrics, identify traffic patterns, and suggest data-driven improvements. Use when the user asks about analytics, website metrics, traffic analysis, conversion rates, user behavior, or performance optimization.
World-class data science skill for statistical modeling, experimentation, causal inference, and advanced analytics. Expertise in Python (NumPy, Pandas, Scikit-learn), R, SQL, statistical methods, A/B testing, time series, and business intelligence. Includes experiment design, feature engineering, model evaluation, and stakeholder communication. Use when designing experiments, building predictive models, performing causal analysis, or driving data-driven decisions.
Comprehensive backend development guide for Langfuse's Next.js 14/tRPC/Express/TypeScript monorepo. Use when creating tRPC routers, public API endpoints, BullMQ queue processors, services, or working with tRPC procedures, Next.js API routes, Prisma database access, ClickHouse analytics queries, Redis queues, OpenTelemetry instrumentation, Zod v4 validation, env.mjs configuration, tenant isolation patterns, or async patterns. Covers layered architecture (tRPC procedures → services, queue processors → services), dual database system (PostgreSQL + ClickHouse), projectId filtering for multi-tenant isolation, traceException error handling, observability patterns, and testing strategies (Jest for web, vitest for worker).