A lightweight MCP (Model Context Protocol) server designed for seamless integrating ComIDP with Claude Desktop.
-
Updated
Sep 17, 2025 - Python
A lightweight MCP (Model Context Protocol) server designed for seamless integrating ComIDP with Claude Desktop.
My Dissertation Project, MarinEnGPT. An Intelligent Assistant for Marine Engineering Manuals
This Odoo module streamlines data extraction from supplier invoices and expenses and automates contact info pre-filling, utilizing NTBIES services for enhanced operational efficiency.
Self-hostable PDF parsing engine for structured text and table extraction. Works locally or via Docker.
DigiParser Documentation and API reference
PDF ETL tool kit with table extraction and csv/jsnol export with multi-provider detection cascade and streamlit ui.
Production-ready code examples for integrating the ParserData API to extract structured financial data from invoices, receipts, and statements.
🤖 AI-Powered Document Automation Platform (MVP): A production-grade RAG & Document Governance system built to automate high-volume document workflows (Legal, Finance, HR). Features intelligent boundary detection, multi-model routing (Gemini/Mistral/Phi-2), and automated audit reporting to ensure context fidelity and less hallucinations.
Data Extraction from PDF Readme
Official website and download page for Valido - Professional PDF validation and data extraction tool for Windows
A platform for extracting structured product data from PDF documents using NLP and storing it in a PostgreSQL database.
Python tool for extracting named sections from structured PDFs: TOC detection and keyword scanning, built for IFAD development research workflows
Add a description, image, and links to the data-extraction-from-pdf topic page so that developers can more easily learn about it.
To associate your repository with the data-extraction-from-pdf topic, visit your repo's landing page and select "manage topics."