Integration

Pdf Ocr

Name: Pdf Ocr
Rating: 3.5 (1 reviews)
Author: PdfApiHub

By PdfApiHub 👁 66 views ▲ 0 votes

OpenClaw plugin for OCR text extraction from scanned PDFs and images — Tesseract OCR with 100+ languages, configurable DPI, word-level bounding boxes, character whitelisting, image preprocessing (grayscale, sharpen, threshold, resize), and document scanning with perspective correction. Powered by PDFAPIHub API. Free API key at pdfapihub.com

GitHub

Install

openclaw plugins install clawhub:pdf-ocr

Configuration Example

{
  "plugins": {
    "entries": {
      "pdf-ocr": {
        "enabled": true,
        "env": {
          "PDFAPIHUB_API_KEY": "your-api-key-here"
        }
      }
    }
  }
}

README

# PDF & Image OCR — OpenClaw Plugin

Extract text from scanned PDFs and images using the [PDFAPIHub](https://pdfapihub.com) API. This OpenClaw plugin provides Tesseract OCR with 100+ languages, document photo enhancement, and visual document comparison.

## What It Does

Read text from scanned documents and photos using OCR, clean up document photos into professional scans, and compare documents for visual similarity.

### Features

- **PDF OCR** — Rasterise and OCR scanned PDFs with configurable DPI (72-400)
- **Image OCR** — OCR photos of receipts, documents, signs, business cards, meter readings
- **100+ Languages** — Tesseract language packs, combine with `+` (e.g. `eng+hin+fra`)
- **Word-Level Bounding Boxes** — Per-word positions and confidence scores
- **Character Whitelisting** — Restrict to digits only for invoice amounts or meter readings
- **Image Preprocessing** — Grayscale, sharpen, threshold, resize/upscale for noisy inputs
- **Document Scan Enhancement** — Edge detection, perspective correction, brightness/contrast
- **Color Modes** — B&W (best for text), grayscale, enhanced color
- **PDF Output** — Export scanned documents as single-page PDFs
- **Document Comparison** — Visual similarity scoring with feature matching, SSIM, or phash
- **Confidence Scores** — Per-page and per-word OCR confidence percentages

## Tools

| Tool | Description |
|------|-------------|
| `ocr_pdf` | OCR scanned PDFs with multi-language Tesseract |
| `ocr_image` | OCR images with preprocessing options |
| `scan_enhance` | Clean up document photos into professional scans |
| `compare_documents` | Compare two images/PDFs for visual similarity |

## Installation

```bash
openclaw plugins install clawhub:pdf-ocr
```

## Configuration

Add your API key in `~/.openclaw/openclaw.json`:

```json
{
  "plugins": {
    "entries": {
      "pdf-ocr": {
        "enabled": true,
        "env": {
          "PDFAPIHUB_API_KEY": "your-api-key-here"
        }
      }
    }
  }
}
```

Get your **free API key** at [https://pdfapihub.com](https://pdfapihub.com).

## Usage Examples

Just ask your OpenClaw agent:

- *"Extract text from this scanned PDF"*
- *"OCR this document in English and Hindi at 300 DPI"*
- *"Extract only the numbers from this invoice scan"*
- *"Read the text from this receipt photo"*
- *"Clean up this document photo to look like a scan"*
- *"Scan this photo then OCR the result"*
- *"How similar are these two documents?"*

## Use Cases

- **Invoice Processing** — OCR scanned invoices to extract line items and totals
- **Receipt Scanning** — Extract text from receipt photos for expense tracking
- **Document Digitization** — Convert legacy paper documents to searchable text
- **Multi-Language Documents** — Process documents in Hindi, French, German, Arabic, etc.
- **Business Card Reading** — Extract name, phone, and email from card photos
- **Meter Reading** — Extract digits from utility meter photos with character whitelisting
- **Document Photo Cleanup** — Turn phone photos into clean, professional scans
- **Fraud Detection** — Compare documents for visual similarity
- **QA Testing** — Compare rendered documents before and after changes

## API Documentation

Full API docs: [https://pdfapihub.com/docs](https://pdfapihub.com/docs)

## License

MIT

integration

Comments

Loading comments...

More by PdfApiHub

Pdf Compress

OpenClaw plugin to compress and optimize PDF files — reduce file size with configurable compression levels (low, medium,

Pdf Extract Text

OpenClaw plugin to extract text from PDFs — plain text extraction, structured JSON parsing with layout blocks and tables

Pdf Watermark

OpenClaw plugin to add watermarks to PDFs and images — text stamps (CONFIDENTIAL, DRAFT, APPROVED), logo/image overlays,

Website Screenshot

OpenClaw plugin to capture website screenshots as PNG images — supports full-page scrolling, custom viewports (desktop/m

Similar Plugins in Integration

MintAPI

Expose MintAPI paid Twitter/X and YouTube endpoints as native OpenClaw tools with x402-aware payment handling.

Agent Trust Verification Providers

Agent Trust Verification Provider Pattern — community-authored integration spec for plugins implementing agent trust ver

Mcp Tools

OpenClaw plugin to bridge MCP server tools as native OpenClaw tools

Tweetclaw

Post tweets, reply, like, retweet, follow, DM & more from OpenClaw - full X/Twitter automation via Xquik

1 3

Xai Auth

xAI Grok SSO authentication plugin for OpenClaw — use your Grok/SuperGrok subscription instead of API credits

Abacusai Auth

OpenClaw AbacusAI provider plugin - Third-party plugin for AbacusAI RouteLLM integration