Retab

Retab – AI-powered document automation for developers

Parse, validate, and structure PDFs, emails, and images with reliable AI. Simple SDKs. Production-ready.

Ship NextGenerationDocumentAutomations

Complete developer platform and SDK for shipping state-of-the-art document
processing in the age of LLMs. Extract data from the most complex documents.

How it works

Why developers choose us as their goto platform

Join the growing number of developers and enterprises that trust Retab for document processing automations.

Built-in preprocessing

We've thought about every edge case so you don't have to - we guarantee the best preprocessing, no matter the file.

Evaluations and self-optimizing schemas

Retab automatically evaluates the accuracy of your data extraction and provides feedback on how to improve the schema or the model settings.

40%
Accuracy

Automatic Dataset Labeling

Multiple models label your docs automatically.
You only review the few cells where models disagree in minutes.

File NameTypeVendorAmount

Auto-model routing

Continuous benchmarking picks the best model for each document based on your accuracy and latency goals.

Integrations

Many automation triggers: email forwarding, Outlook plugin, webhooks, Google Sheets, Excel, and many more.

Traceable Source Locations

View exactly where each piece of information was extracted from in the source document. See the model's reasoning traces before data extraction.

Vendor Name
Invoice Date
Total Amount
Due Date
Line Items

Simple to implement

Only a few lines of code

Code example showing how to integrate Retab SDK
from retab import Retab

# Initialize the client with your API key
client = Retab(api_key="YOUR_RETAB_API_KEY")

# Submit a single document
completion = client.projects.extract(
    project_id="proj_WaWbfpmqHfLCdcK_T4l_5",
    document="path/to/document.pdf"
)

print(completion)

Use cases

Built for every industry

Enterprise

Enterprise ready security

Industry-leading document processing without compromising trust.

Read our Privacy Policy

Secure, private, and compliant. Always.

  • SOC2 Type II

  • HIPAA

  • CCPA

Pricing

No credit card required No commitment Get started for free

  • Free

    Best to explore document processing

    $0/mo

    1000 credits

    • State-of-the-art preprocessing for any file (pdf, email, excel, …)
    • UI for schema design
    • Automated dataset labeling
    • Source highlighting
    • Model + schema evaluations with detailed field-by-field metrics
    • Different models for various latency requirements
    • Dashboard to monitor extractions
    • Access to parse/extract APIs and the online platform
  • Pro

    Best for early-stage teams

    $300 / month

    30,000 credits $0.01/credit after

    • Everything in Starter, plus:
    • Multi-LLM consensus
    • Team Management & role-based access control
    • Continuous model selection & auto-routing
    • Agent for schema/model optimization
    • Higher rate limits
  • Enterprise

    Best for full control & custom needs

    Custom

    • Everything in Pro, plus:
    • Option to have us build a custom processor for you
    • Volume-based discounts
    • Premium rate limits
    • Forward deployed support and maintenance
    • Custom SLAs/SLOs
    • Advanced Security & SOC/GDPR compliance
    • Custom data retention
    • Custom deployment & integrations

Auto-large extraction

Premium AI models (GPT-5, Gemini 2.5 Pro, Claude 4 Sonnet) for complex documents and maximum accuracy

Auto-small extraction

Fast, cost-effective extraction for simpler documents (routing to GPT-5-mini, Gemini 2.5 Flash)

Auto-micro extraction

Ultra-fast, budget-friendly extraction for high volumes (routing to GPT-5 Nano, Gemini 2.5 Flash Lite)

FAQ

Frequently asked questions

Everything about Retab.

Retab can process a wide variety of documents including invoices, receipts, contracts, forms, reports, and more. Our AI models are trained to handle both structured and unstructured documents across multiple formats like PDF, PNG, JPEG, and TIFF.