Skip to main content

Case Auto-Formation

Talonic automatically groups related documents into cases by detecting shared entities like vendor names, contract IDs, and invoice numbers across extractions.

How cases form

When the four-phase pipeline extracts data from a document, it identifies entities — values linked to Field Registry entries marked as linkable. If two documents share the same vendor name or invoice number, Talonic clusters them into a single case. This inference-based clustering replaces manual document grouping.

Entity linking

Entity linking uses the Schema Graph to match values across schemas. A schema field marked as a linking entity (e.g., vendor_name) triggers cross-document matching. The 529-type ontology provides entity type definitions that guide the matching process.

Case resolution

Cases resolve automatically as new documents are extracted. Adding a delivery note that shares an order_id with an existing invoice case will merge the delivery note into that case. Each document in a case retains its own per-cell provenance trail.

Viewing cases

Cases are visible in the Talonic platform and can be queried through the search API. Use the filter endpoint to find all documents belonging to a specific case. Cases group documents across sources, enabling cross-source entity resolution.

Use cases

Cases are valuable for accounts payable (grouping invoices, purchase orders, and delivery notes for a vendor), compliance (linking contracts with amendments and addenda), and logistics (connecting bills of lading with customs declarations). The extraction data endpoint returns case membership for each extraction.