mistral-ocr-mcp

mistral-ocr-mcpは、OCR(光学式文字認識)を利用して、画像からテキストを抽出するPythonライブラリです。自動化されたワークフローに組み込むことで、手動でのデータ入力を削減し、効率を向上させることができます。特に、文書管理やデータ収集のプロセスを簡素化するのに役立ちます。

GitHubスター

0

ユーザー評価

未評価

お気に入り

0

閲覧数

12

フォーク

0

イシュー

0

README
Mistral OCR MCP Server

A Model Context Protocol (MCP) server that enables Claude to perform OCR (Optical Character Recognition) on local files using Mistral AI's document processing capabilities.

Setup
  1. Get a Mistral API key from Mistral AI Console

  2. Add to Claude Desktop config (~/Library/Application Support/Claude/claude_desktop_config.json):

{
  "mcpServers": {
    "mistral-ocr": {
      "command": "uvx",
      "args": ["--from", "/path/to/mistral-ocr-mcp", "mistral-ocr-mcp"],
      "env": {
        "MISTRAL_API_KEY": "your_api_key_here"
      }
    }
  }
}

Replace /path/to/mistral-ocr-mcp with the actual path to this directory.

Usage
ocr_local_file

Process local files with OCR and convert to markdown format.

  • file_path: Path to the local file to process
  • output_path: Optional output path for markdown file (defaults to same name with .md extension)
  • include_image_base64: Whether to include base64 encoded images in response
Examples
Process this document with OCR: /path/to/document.pdf

Extract text from this image: /path/to/image.jpg

OCR this file and save to custom location: /path/to/input.png with output /path/to/output.md
Troubleshooting
  • API key error: Set MISTRAL_API_KEY in your environment
  • File not found: Check that the file path exists and is accessible
  • Unsupported format: Ensure the file is a supported image or document format
  • Rate limit: Wait and try again