Skip to content

Reconciliation Jobs

A reconciliation job (or “check”) compares data between two connections. It consists of two phases: extraction and comparison.

DataRecs runs your configured query against each connection and pulls the result sets into a staging area. Extractors run as isolated Kubernetes Jobs, ensuring resource limits and tenant isolation.

Once both extractions complete, a Comparator analyses the two result sets. It identifies:

  • Matching rows — rows present in both sources with identical values.
  • Missing rows — rows present in one source but not the other.
  • Mismatched rows — rows present in both but with differing values in one or more columns.

You can organise related jobs into job groups and run them together on a schedule or on-demand. Job groups track run history and aggregate status.

Jobs can be triggered:

  • Manually from the Console or CLI.
  • On a schedule via the built-in scheduler (backed by Apache Airflow).
  • Programmatically via the REST API.