Ship Next Generation
Document Automations

Complete developer platform and SDK for shipping state-of-the-art document
processing in the age of LLMs. Extract data from any document format.

{ "invoice": { "number": "INV-2024-001", "date": "2024-01-15", "vendor": { "name": "Acme Corp", "address": "123 Main St" }, "items": [ { "desc": "Consulting", "qty": 10, "rate": 150, "amount": 1500 } ], "total": 1500, "tax": 150, "due": 1650 } }

{ "applicant": { "firstName": "John", "lastName": "Smith", "email": "john@email.com", "phone": "+1234567890", "address": { "street": "456 Oak Ave", "city": "Springfield", "zip": "12345" } }, "agreed": true, "date": "2024-01-15" }

{ "store": "Target Store #1234", "date": "2024-01-15", "time": "14:23", "items": [ { "name": "Milk 2%", "price": 4.99 }, { "name": "Bread", "price": 2.49 } ], "subtotal": 7.48, "tax": 0.52, "total": 8.00, "payment": "VISA *1234" }

{ "contract": { "title": "Service Agreement", "parties": { "client": "ABC Corp", "provider": "XYZ Services" }, "terms": { "duration": "12 months", "value": 50000, "startDate": "2024-02-01" }, "signatures": [ { "party": "client", "signed": true } ] } }

{ "statement": { "account": "****1234", "period": { "from": "2024-01-01", "to": "2024-01-31" }, "balance": { "opening": 1250.00, "closing": 890.50 }, "transactions": [ { "date": "2024-01-05", "desc": "Payment", "amount": -500.00 } ] } }

{ "report": { "title": "Q1 Sales Report", "period": "2024-Q1", "metrics": { "revenue": 125000, "growth": "15.2%", "customers": 342 }, "summary": { "highlights": [ "Strong Q1 performance", "New market expansion" ], "nextSteps": "Retention" } } }

{ "invoice": { "number": "INV-2024-001", "date": "2024-01-15", "vendor": { "name": "Acme Corp", "address": "123 Main St" }, "items": [ { "desc": "Consulting", "qty": 10, "rate": 150, "amount": 1500 } ], "total": 1500, "tax": 150, "due": 1650 } }

{ "applicant": { "firstName": "John", "lastName": "Smith", "email": "john@email.com", "phone": "+1234567890", "address": { "street": "456 Oak Ave", "city": "Springfield", "zip": "12345" } }, "agreed": true, "date": "2024-01-15" }

{ "store": "Target Store #1234", "date": "2024-01-15", "time": "14:23", "items": [ { "name": "Milk 2%", "price": 4.99 }, { "name": "Bread", "price": 2.49 } ], "subtotal": 7.48, "tax": 0.52, "total": 8.00, "payment": "VISA *1234" }

{ "contract": { "title": "Service Agreement", "parties": { "client": "ABC Corp", "provider": "XYZ Services" }, "terms": { "duration": "12 months", "value": 50000, "startDate": "2024-02-01" }, "signatures": [ { "party": "client", "signed": true } ] } }

{ "statement": { "account": "****1234", "period": { "from": "2024-01-01", "to": "2024-01-31" }, "balance": { "opening": 1250.00, "closing": 890.50 }, "transactions": [ { "date": "2024-01-05", "desc": "Payment", "amount": -500.00 } ] } }

{ "report": { "title": "Q1 Sales Report", "period": "2024-Q1", "metrics": { "revenue": 125000, "growth": "15.2%", "customers": 342 }, "summary": { "highlights": [ "Strong Q1 performance", "New market expansion" ], "nextSteps": "Retention" } } }

How it works

Why developers choose us as their goto platform

Join the growing number of customers and champions who trust Retab for document processing and automation.

Built-in preprocessing

We've thought about every edge case so you don't have to - we guarantee the best preprocessing, no matter the file.

40%
Accuracy

Auto Project and Improvement

Retab automatically evaluates the accuracy of your data extraction and provides feedback on how to improve the schema or the model settings.

Automatic Dataset Labeling

Multiple models label your docs automatically. You only review the few cells where models disagree in a simple table UI.

File NameTypeVendorAmount

Auto-model routing

Continuous benchmarking picks the best model for each document based on your accuracy and latency goals.

Integrations

Many automation triggers: email forwarding, Outlook plugin, webhooks, Google Sheets, Excel

Traceable Source Locations

View exactly where each piece of information was extracted from in the source document. See the model's reasoning traces before data extraction.

Vendor Name
Invoice Date
Total Amount
Due Date
Line Items

Simple to implement.

Only a few lines of code.

from retab import Retab

# Initialize the client with your API key
client = Retab(api_key="YOUR_RETAB_API_KEY")

# Submit a single document
completion = client.deployments.extract(
    project_id="proj_WaWbfpmqHfLCdcK_T4l_5",
    iteration_id="base-configuration",
    document="path/to/document.pdf"
)

print(completion)

Use cases

Built for every industry

Enterprise Ready

Entreprize grade security

Industry-leading document processing without compromising trust.

Secure, private, and compliant. Always.

  • SOC2 Type II

  • HIPAA

  • CCPA

Get started for free

No credit card required. No commitment.

Free

Unlimited platform access. Perfect for trying out Retab

$0/mo

1000 credits / mo included

  • Schema Designer & Reusable Schemas
  • Drag-and-Drop Ground Truth Table
  • Source Highlights & Reasoning Traces
  • Prompt Iteration + Field-Level Reasoning
  • Multi-LLM Consensus
  • Advanced Automations: Email, Outlook, API, Webhooks
  • Team Management & Role-based Access Control
  • Community Support
  • Continuous Model Selection & Auto-Routing

Scale

Unlimited platform access. For teams & production workloads

Custom

1000+ credits / mo

  • Schema Designer & Reusable Schemas
  • Drag-and-Drop Ground Truth Table
  • Source Highlights & Reasoning Traces
  • Prompt Iteration + Field-Level Reasoning
  • Multi-LLM Consensus
  • Advanced Automations: Email, Outlook, API, Webhooks
  • Team Management & Role-based Access Control
  • Continuous Model Selection & Auto-Routing
  • Real-time Monitoring & Analytics Dashboard
  • 24/7 Priority Support
  • Dedicated Account Manager
  • Custom Integrations & Whitelabeling
  • Advanced Security & Compliance (SOC 2, GDPR)

Preprocessing

Document optimization: orientation correction, table conversion, OCR enhancement

0.50credits /page

Auto-large extraction

Premium AI models (GPT-4.1, Gemini 2.5 Pro) for complex documents and maximum accuracy

2credits /page

Auto-small extraction

Fast, cost-effective extraction for simpler documents (routing to GPT-4.1, Gemini 2.5 Flash Lite)

0.50credits /page

Auto-micro extraction

Ultra-fast, budget-friendly extraction for high volumes (routing to GPT-4.1 Nano, Gemini 2.5 Flash Lite)

0.10credits /page

FAQ

Frequently asked questions

Rapid-fire answers to common questions. Have a question of your own? Our Support Engineers are ready to help. Get in touch

Retab can process a wide variety of documents including invoices, receipts, contracts, forms, reports, and more. Our AI models are trained to handle both structured and unstructured documents across multiple formats like PDF, PNG, JPEG, and TIFF.

Retab - Ship next generation document automations