textin-mcp

TextIn OCR MCPは、文書からテキストを抽出し、OCRを実行するためのツールです。このツールは、文書のテキスト認識、ID認識、請求書認識をサポートし、文書をMarkdown形式に変換する機能も備えています。さまざまなファイル形式(PDF、画像、Microsoft Office文書)に対応しており、ユーザーが指定した情報を自動的に識別して抽出することが可能です。これにより、文書処理の効率を大幅に向上させることができます。

GitHubスター

16

ユーザー評価

未評価

フォーク

4

イシュー

2

閲覧数

1

お気に入り

0

README
TextIn OCR MCP

TextIn

English | 中文

TextIn OCR MCP Server

TextIn MCP Server is a tool for extracting text and performing OCR on documents, including document text recognition, ID recognition, and invoice recognition. It also supports converting documents into Markdown format.

Tools
  • recognition_text

    • Text recognition from images, Word documents, and PDF files.
    • Inputs:
      • path (string, required): file path or a URL (HTTP/HTTPS) pointing to a document
    • Return: Text of the document.
    • Supports conversion for:
      • PDF
      • Image (Jpeg, Jpg, Png, Bmp)
  • doc_to_markdown

    • Convert images, PDFs, and Word documents to Markdown.
    • Inputs:
      • path (string, required): file path or a URL (HTTP/HTTPS) pointing to a document
    • Return: Markdown of the document.
    • Supports conversion for:
      • PDF
      • Microsoft Office Documents (Word, Excel)
      • Image (Jpeg, Jpg, Png, Bmp)
  • general_information_extration

    • Automatically identify and extract information from documents, or identify and extract user-specified information.
    • Inputs:
      • path (string, required): file path or a URL (HTTP/HTTPS) pointing to a document
      • key (string[], optional): The non-tabular text information that the user wants to identify, input format is an array of strings.
      • table_header (string[], optional): The table information that the user wants to identify, input format is an array of strings.
    • Return: The key information JSON.
    • Supports conversion for:
      • PDF
      • Microsoft Office Documents (Word, Excel)
      • Image (Jpeg, Jpg, Png, Bmp)

When the input is a URL, it does not support handling access to protected resources.

Setup
APP_ID and APP_SECRET

Click here to register for a TextIn account.

Get Textin APP_ID and APP_SECRET by following the instructions here.

NPX
{
  "mcpServers": {
    "textin-ocr": {
      "command": "npx",
      "args": [
        "-y",
        "@intsig/server-textin"
      ],
      "env": {
        "APP_ID": "<YOUR_APP_ID>",
        "APP_SECRET": "<YOUR_APP_SECRET>",
        "MCP_SERVER_REQUEST_TIMEOUT": "600000"
      },
      "timeout": 600
    }
  }
}
License

This MCP server is licensed under the MIT License. This means you are free to use, modify, and distribute the software, subject to the terms and conditions of the MIT License. For more details, please see the LICENSE file in the project repository.

作者情報
IntSig

如需体验TextIn文档解析,请访问 https://cc.co/16YSIy

IntSig

39

フォロワー

9

リポジトリ

0

Gist

10

貢献数

トップ貢献者

スレッド