mistral-ocr-mcp
mistral-ocr-mcp is a Python library that utilizes OCR (Optical Character Recognition) to extract text from images. By integrating it into automated workflows, it reduces manual data entry and enhances efficiency. It is particularly useful for simplifying document management and data collection processes.
GitHub Stars
0
User Rating
Not Rated
Favorites
0
Views
19
Forks
0
Issues
0
Mistral OCR MCP Server
A Model Context Protocol (MCP) server that enables Claude to perform OCR (Optical Character Recognition) on local files using Mistral AI's document processing capabilities.
Setup
Get a Mistral API key from Mistral AI Console
Add to Claude Desktop config (
~/Library/Application Support/Claude/claude_desktop_config.json):
{
"mcpServers": {
"mistral-ocr": {
"command": "uvx",
"args": ["--from", "/path/to/mistral-ocr-mcp", "mistral-ocr-mcp"],
"env": {
"MISTRAL_API_KEY": "your_api_key_here"
}
}
}
}
Replace /path/to/mistral-ocr-mcp with the actual path to this directory.
Usage
ocr_local_file
Process local files with OCR and convert to markdown format.
file_path: Path to the local file to processoutput_path: Optional output path for markdown file (defaults to same name with .md extension)include_image_base64: Whether to include base64 encoded images in response
Examples
Process this document with OCR: /path/to/document.pdf
Extract text from this image: /path/to/image.jpg
OCR this file and save to custom location: /path/to/input.png with output /path/to/output.md
Troubleshooting
- API key error: Set
MISTRAL_API_KEYin your environment - File not found: Check that the file path exists and is accessible
- Unsupported format: Ensure the file is a supported image or document format
- Rate limit: Wait and try again