Voice
Invoice Parser
OpenClaw plugin to parse PDF invoices β extract line items, totals, vendor info, tables, and structured data from invoice PDFs using OCR and table detection. Powered by PDFAPIHub API. Free API key at pdfapihub.com
Install
openclaw plugins install clawhub:pdfapihub-invoice-parser
Configuration Example
{
"plugins": {
"entries": {
"pdfapihub-invoice-parser": {
"enabled": true,
"env": {
"PDFAPIHUB_API_KEY": "your-api-key-here"
}
}
}
}
}
README
# Invoice Parser β OpenClaw Plugin
Parse, extract, and convert invoice PDFs using the [PDFAPIHub](https://pdfapihub.com) API. This OpenClaw plugin gives your AI agent the ability to read invoices, extract line items and tables, OCR scanned invoices, and export data to Excel and CSV.
## What It Does
Extract structured data from invoice PDFs β text, tables, line items, totals, and metadata. Handle both digital and scanned invoices with OCR. Export extracted data to Excel and CSV for accounting, analysis, and bookkeeping.
### Features
- **Full Invoice Parsing** β Extract text, layout, and table data in a single call
- **Table Extraction** β Pull out line items, pricing tables, and itemized charges
- **Excel Export** β Export invoice tables directly to XLSX spreadsheets
- **CSV Export** β Export to CSV for database import and accounting software
- **Scanned Invoice OCR** β Read text from scanned or image-based invoice PDFs at 300 DPI
- **Multi-Language OCR** β English, German, French, Spanish, and many more
- **Page Selection** β Parse specific pages or the entire document
- **PDF Metadata** β Check page count, file size, encryption, author, and creation date
## Tools
| Tool | Description |
|------|-------------|
| `parse_invoice` | Parse invoice PDF β extract text, layout, tables, or all |
| `extract_invoice_tables` | Extract tables to Excel (XLSX) |
| `ocr_scanned_invoice` | OCR scanned invoice PDFs |
| `invoice_to_csv` | Extract tables to CSV |
| `invoice_info` | Get invoice PDF metadata |
## Installation
```bash
openclaw plugins install clawhub:pdfapihub-invoice-parser
```
## Configuration
Add your API key in `~/.openclaw/openclaw.json`:
```json
{
"plugins": {
"entries": {
"pdfapihub-invoice-parser": {
"enabled": true,
"env": {
"PDFAPIHUB_API_KEY": "your-api-key-here"
}
}
}
}
}
```
Get your **free API key** at [https://pdfapihub.com](https://pdfapihub.com).
## Usage Examples
Just ask your OpenClaw agent:
- *"Parse this invoice and show me the line items"*
- *"Extract the invoice table to an Excel file"*
- *"OCR this scanned invoice and extract the text"*
- *"Export the invoice data as CSV"*
- *"How many pages is this invoice?"*
## Use Cases
- **Accounts Payable** β Extract invoice data for automated payment processing
- **Bookkeeping** β Parse invoices and export to Excel/CSV for accounting records
- **Expense Management** β Extract line items and amounts from vendor invoices
- **Audit Trails** β Parse and archive invoice data for compliance and auditing
- **Data Entry Automation** β Replace manual invoice data entry with automated extraction
- **ERP Integration** β Export invoice data in CSV format for ERP system import
## API Documentation
Full API docs: [https://pdfapihub.com/docs](https://pdfapihub.com/docs)
## License
MIT
voice
Comments
Sign in to leave a comment