kreuzberg

Document intelligence framework for Python - Extract text, metadata, and structured data from PDFs, images, Office documents, and more. Built on Pandoc, PDFium, and Tesseract.

GitHub Stars

2,340

User Rating

Not Rated

Favorites

0

Views

80

Forks

95

Issues

5