Runner Scaling Model

Assumptions

Runner state (saga state, dedupe markers, checkpoints, outbox, schedules) is stored in a local MDBX database via edge_storage.
Correctness for a given tenant+saga depends on reading/writing the same storage instance over time.

Run multiple Runner instances, each responsible for a disjoint set of tenants, and give each instance its own storage volume.

Use RUNNER_TENANT_ALLOWLIST to bind an instance to tenants.
Or use NATS KV placement: set RUNNER_TENANT_PLACEMENT_BUCKET and RUNNER_SHARD_ID.
Streams/consumers can be shared; subjects are tenant-qualified, and per-instance consumers filter by tenant subjects.

Example:

If RUNNER_TENANT_PLACEMENT_BUCKET and RUNNER_SHARD_ID are set, the Runner watches a NATS KV bucket where:

and dynamically updates the set of per-tenant consumers it is polling without restarting.

If two replicas for the same tenant use different local storages, they will not share:

and can duplicate work.

To support same-tenant replicas, storage must be shared/replicated (not implemented here).

Use the drain endpoint before stopping a process:

Controlled replay exists for operational/debug use: