Built by Crescentic LLC

The document
spine for
regulated
workflows.

docABL ingests, classifies, extracts, validates, and reconciles the documents your regulated process depends on — with full lineage, tenant isolation, multi-format export, and an audit trail that holds up.

Get started Sign in

Row-level security

Multi-tenant

Lineage-first

docABL — live extraction pipeline

Watch a 1099 move from upload to reconciled insight·

Official document infrastructure for

Gyrence

PDF & MS Office render / raster provider

wealthABL

Regulatory document provider

JNL Square

Regulatory document provider

tokenizABL

Regulatory document provider

CorpAction Chain

Regulatory document provider

bridgeABL

Regulatory document provider

Gyrence

PDF & MS Office render / raster provider

wealthABL

Regulatory document provider

JNL Square

Regulatory document provider

tokenizABL

Regulatory document provider

CorpAction Chain

Regulatory document provider

bridgeABL

Regulatory document provider

Three-lane parse front

One intake. Three engines. No documents left behind.

Every submission is routed to the lane that will actually read it. The choice is recorded, so the audit trail tells you not just what was extracted but how.

Native-text PDF

Direct text extraction from machine-generated PDFs — the fast path, with structure and coordinates preserved for downstream classifiers.

Image OCR

Scanned and photographed documents pass through an OCR pipeline tuned for regulated-form vocabulary, with confidence scoring per region.

Image-PDF rasterizer

Hybrid PDFs whose pages are really images get rasterized server-side and routed back through OCR — no silent data loss.

End-to-end workflow

From upload to reconciled, with the audit trail attached.

docABL is more than a parser. The same document moves through intake, workspace, download, and reconciliation — and every state transition is recorded.

Upload

Drop PDFs, scans, or image-PDFs into the intake. Idempotent submissions — re-uploading a file you already processed returns the same document, not a duplicate.

Workspace

Every submission shows up immediately with live status: queued, processing, extracted, or failed — with reason, retry, and a link to the parsed result.

Download

Open any document and pull the parsed output in the format you need — normalized JSON, markdown, plain text, or the original bytes.

Compare & reconcile

Diff extracted structures across versions, reconcile against templates, and surface field-level dispositions: matched, parse miss, or reconciled.

Current inventory

What's live today.

Everything below is shipped and reachable from your workspace once you sign in.

For operators

Idempotent intake

Resumable, dedup-by-hash uploads with per-submission status.

Live workspace

All your files, sortable by status, type, and last update.

Multi-format export

Normalized JSON, markdown, text, and original source bytes.

Compare & reconcile

Side-by-side structural diffs with field-level dispositions.

For platform & governance

Templates & catalog

Versioned templates and a document catalog scoped to your tenant.

Process jobs

Operate the parse front: requeue, inspect, and tail workers.

Ping central

Health surface for storage, vision cache, and gateway routes.

Coverage matrix

See which form codes and templates are exercised — and which aren't.

Passthrough audit

Every byte handoff to the sensitive lane is logged and replayable.

Tenant-scoped RLS

Postgres row-level security on every table — no app-layer shortcuts.

Status, transparent by default

Every document has one of four states.

The Workspace surfaces these live. No black-box "processing forever" — if something failed, you'll see why.

Queued

Accepted, hash recorded, awaiting a worker.

Processing

Routed to a parse lane; partial progress visible.

Extracted

Structure, fields, and downloads are ready.

Failed

Reason captured. Retry is one click; the original bytes are preserved.

Tenants & accounts

Isolation that's enforced where it counts.

Every document, classification, and validation issue belongs to a tenant. Postgres row-level security — not application code — decides who can see what. Personas (service, reviewer, operator) layer on top without bending the model.

Tenant

The boundary. Your org's data never crosses it.

Accounts

Real people, mapped to auth.uid via a server-side trigger.

document_access

Per-document grants for cross-team reviews.

Ready when your documents are.

Create an account

The documentspine forregulatedworkflows.

One intake. Three engines. No documents left behind.

Native-text PDF

Image OCR

Image-PDF rasterizer

From upload to reconciled, with the audit trail attached.

Upload

Workspace

Download

Compare & reconcile

What's live today.

Every document has one of four states.

Isolation that's enforced where it counts.

Ready when your documents are.

The document
spine for
regulated
workflows.