Lot-Size Quantization & Residual Inventory Slippage Playbook

2026-03-12 · finance

Lot-Size Quantization & Residual Inventory Slippage Playbook

Date: 2026-03-12
Category: research
Audience: small quant execution teams running child-order schedulers across venues with discrete lot/notional constraints


Why this playbook exists

Most execution simulators optimize in continuous size space, but live venues fill in discrete units:

That mismatch creates quantization residuals. At first they look tiny, then they accumulate near deadlines and force expensive late-stage catch-up (crossing wider spreads, paying higher markout risk, and blowing up p95 costs).

If your slippage model ignores quantity discretization, you systematically underprice tail execution risk.


Core idea

Treat size discretization as a first-class branch in execution cost, not post-trade noise.

For parent target Q, continuous child suggestion q*_k, and venue lot size L_k:

Total expected cost:

E[Cost] = C_base + C_quant + C_rescue

Where:


Failure mechanism (how tiny errors become expensive)

  1. Continuous scheduler emits many sub-lot recommendations.
  2. Rounding/down-clipping creates small underfills each interval.
  3. Residual inventory drifts upward while clock time decays.
  4. Urgency spikes near deadline and forces aggressive marketable flow.
  5. Tail slippage increases nonlinearly (especially in thin books).

This is a convexity problem: small early quantization errors can create large late execution tax.


Cost decomposition

Cost = IS + QuantizationDriftTax + ResidualConvexityTax + TerminalMissPenalty

1) Quantization Drift Tax

Incremental cost from repeated rounding mismatch while schedule is still flexible.

2) Residual Convexity Tax

Extra cost from urgency escalation because residual remained unresolved too long.

3) Terminal Miss Penalty

Opportunity cost (or forced crossing premium) when non-zero residual hits hard deadline.


Key metrics

Quantization Pressure Ratio (QPR)

QPR_t = |r_t| / max(RemainingTarget_t, epsilon)

How much unfinished inventory is purely discretization-driven relative to remaining task.

Residual Catch-up Convexity (RCC)

RCC = d(Slippage) / d(Urgency) | high-QPR

Measures how expensive urgency becomes when residual pressure is already elevated.

Discrete Fill Friction (DFF)

DFF = Slippage_discrete - Slippage_continuous_counterfactual

Matched by volatility/spread/participation regime.

Rounding Bias Drift (RBD)

RBD = mean(sign(q*_k - q_k))

Persistent sign means systematic over- or under-rounding bias.

Lot Feasibility Stress (LFS)

LFS = Fraction(sub-lot recommendations over window)

High LFS means scheduler is operating outside feasible size grid too often.


State machine

ALIGNED

DRIFTING

CATCHUP_RISK

DEADLINE_RESCUE

SAFE


Control policy

In DRIFTING

In CATCHUP_RISK

In DEADLINE_RESCUE

In SAFE


Execution objective (lot-aware)

For decision a_k at step k:

J(a_k) = E[BaseCost_k] + λ1*|r_k| + λ2*TailRisk_k + λ3*TerminalRisk_k

Subject to:

This prevents infeasible “fractional optimal” schedules from leaking into live slippage.


Data contract (must-have fields)

Per child recommendation + dispatch + fill:

Without recommendation-vs-dispatch lineage, quantization tax is invisible.


Calibration loop

Intraday (5-10 min)

Daily

Weekly


Practical implementation patterns

  1. Stochastic rounding with bias control
    Reduces systematic drift vs hard floor rounding (with risk guardrails).

  2. Residual escrow bucket
    Hold a controlled residual buffer that gets amortized earlier, not at deadline.

  3. Lot-aware optimizer layer
    Project continuous plan onto feasible grid at planning time, not only at dispatch.

  4. Venue increment arbitration
    Prefer finer increment venues when residual pressure is high and toxicity is acceptable.

  5. Deadline-aware residual damping
    Increase residual penalty as time-to-close shrinks to avoid panic rescue.


Common failure modes

  1. Continuous-only backtests
    Simulated fills are infeasible; live slippage appears as “unexpected drift.”

  2. Static floor rounding
    Persistent underfill bias accumulates into terminal catch-up spikes.

  3. No residual state in policy
    Router reacts only to market state, ignoring self-created inventory pressure.

  4. Late rescue with no aggression cap
    Small remaining quantity still causes outsized markout/impact tails.

  5. No post-trade quantization attribution
    Team cannot tell model error vs discretization mechanics.


Dashboard minimum


Practical takeaway

Discrete size constraints are not implementation detail—they are execution physics.

If you optimize in continuous space and round at the edge, residual inventory will eventually force convex late-stage slippage. Model quantization explicitly, control residual drift early, and treat deadline rescue as a governed mode, not a last-minute reflex.


One-line implementation mantra

Plan on the executable size grid from the start, or pay the rounding bill at the worst possible time.