Form-aware extraction
W-9, 1040, ACORD claims forms, vendor onboarding packets. Fields snap to your schema even when layouts drift.
- Pre-built schemas for 200+ common forms
- Custom schemas via JSON-Schema or Zod
- Active learning from corrections
Features
Every Vellum extraction returns the same JSON shape regardless of input — form, table, scan, multilingual mix. Your downstream code doesn’t care which mode ran.
W-9, 1040, ACORD claims forms, vendor onboarding packets. Fields snap to your schema even when layouts drift.
Multi-page tables, nested rows, header propagation. Preserves the cell-to-row relationships your downstream system needs.
Signature presence, signer name, stamp text, notary marks, dated approvals — all extracted with bounding boxes.
Mixed scripts in one document — English contract with Mandarin appendix and Arabic notary stamp. Output normalized to your locale.
Drop a 10K-document folder. Vellum fans it out across pods, fires per-doc webhooks, and rolls up failures into a triage queue.
Detect → redact → log per pipeline. Tenant-isolated extraction means your documents never train shared models.
How it works
Four steps. The first three are sub-second; the fourth runs async and emits a webhook when it’s done.
Vellum auto-detects whether to run form / table / signature / multilingual mode. Override per pipeline.
Field-level confidence. Below your threshold? Auto-routed to human-in-the-loop queue.
Every field gets a bounding box reference. Output is signed with a content hash for the audit log.
JSON to your webhook, row to your warehouse, record to your ERP. Pick one or fan out to many.
Integrations
Vellum is the layer between your documents and your stack. Every extraction can write to multiple destinations in parallel.
NetSuite
Bi-directional invoice + PO sync
Workday
AP automation, expense routing
Zapier
5,000+ apps via Vellum action
Snowflake
Direct table write, change feed
Salesforce
Contract intake → opportunity
REST + webhooks
OpenAPI spec, signed events
Drop a PDF, watch it become structured data, copy the JSON. Three minutes from open to wow.
100 free pages / month, forever