Off-Exchange Internalization and Lit Residual Toxicity: A Slippage Modeling Playbook
Date: 2026-03-09
Category: research (execution microstructure)
Why this matters
When benign retail flow is heavily internalized off-exchange, displayed lit liquidity can become a selection-biased remainder:
- quotes still look tight,
- spread capture looks tempting,
- but fills on lit books become increasingly toxic,
- and passive strategies pay through post-fill markouts.
So the desk is not just routing across venues. It is routing across different information pools.
Failure pattern (seen in production)
- Router sees stable NBBO and healthy displayed depth.
- Passive lit posting is increased to harvest spread/rebates.
- Benign contra flow gets internalized elsewhere first.
- Lit fills arrive disproportionately when informed flow sweeps through.
- Short-horizon markouts worsen; queue alpha goes negative.
- Model still attributes losses to “volatility/noise” instead of venue-flow selection.
Net effect: apparent spread quality masks worsening adverse selection.
Mechanism in one equation
Let
- (\lambda_B): benign contra-arrival intensity
- (\lambda_T): toxic contra-arrival intensity
- (p_B): probability benign flow is absorbed off-exchange
- (p_T): probability toxic flow is absorbed off-exchange
Then toxic share among lit fills is
[ \tau_{lit}=\frac{(1-p_T)\lambda_T}{(1-p_B)\lambda_B + (1-p_T)\lambda_T} ]
If (p_B > p_T) (common in practice), (\tau_{lit}) rises even when headline spread is unchanged.
That is the residual-toxicity channel.
Cost decomposition
For parent notional (Q):
[ C_{total}=C_{px}+C_{fee}+C_{markout}+C_{delay}+C_{catchup} ]
where
- (C_{px}): immediate execution vs benchmark
- (C_{fee}): maker/taker + routing economics
- (C_{markout}): adverse post-fill drift (core residual-toxicity cost)
- (C_{delay}): waiting cost if passive policy underfills
- (C_{catchup}): late aggressive completion cost
In this regime, (C_{markout}) dominates long-run leakage.
Core signals (add to dashboard)
Off-Exchange Absorption Skew (OEAS)
Estimated differential absorption of benign vs toxic flow (proxy for (p_B-p_T)).Lit Residual Toxicity Index (LRTI)
Difference between realized lit-fill markout and blended-venue baseline markout.Quote-to-Print Dissociation (QPD)
Frequency of stable quotes but unstable execution outcomes/markouts.Passive Edge Decay (PED)
Rolling decay of expected passive edge: spread+rebate minus expected markout.Residual Completion Pressure (RCP)
Risk that conservative posting now forces expensive catch-up later.
Modeling architecture
1) Two-layer fill model
Layer A: fill probability by venue/policy.
Layer B: conditional markout distribution given fill.
Do not collapse these into one average-cost model. Separation is required to expose selection effects.
2) Branch model per child decision
For each action (lit post / midpoint / take):
- branch 1: fill + favorable markout
- branch 2: fill + adverse markout
- branch 3: no fill + queue decay
- branch 4: forced aggressive catch-up
Optimize expected cost with tail penalty:
[ J(a)=\mathbb{E}[C_{total}|x_t,a] + \lambda,\text{CVaR}{95}(C{total}|x_t,a) ]
3) Regime feature set
- off-exchange share by symbol/time bucket
- microprice drift and OFI
- queue depletion velocity
- lit-vs-off-ex trade-size mix
- rolling markout asymmetry (buy vs sell)
Practical state controller
STATE A: BALANCED FLOW
LRTI low, PED stable → normal passive mix.STATE B: BENIGN-DRAIN
OEAS rising, passive fill quality degrading → reduce lit passive dwell time, increase conditional midpoint probing.STATE C: LIT-TOXIC
LRTI high, markout tails widening → downweight lit passive; prioritize faster completion paths with tighter risk caps.STATE D: SAFE MODE
Tail budget breached or completion at risk → enforce conservative participation cap + deterministic fallback rules.
Use hysteresis to avoid rapid mode flapping.
Guardrails for live deployment
Markout-first venue scoring
Score venues by net post-fill outcome, not quoted spread alone.Passive-edge kill switch
If PED < 0 beyond threshold window, auto-throttle passive posting.Time-left residual cap
Prevent late forced catches by explicit residual budget checkpoints.Symbol-bucket segmentation
Internalization effects vary by symbol/price/retail intensity; avoid global policy.Counterfactual replay lane
Daily replay current policy vs toxicity-aware policy before expanding rollout.
Minimal rollout checklist
- Add OEAS/LRTI/PED/RCP telemetry to execution dashboard
- Split fill model from post-fill markout model in training pipeline
- Add CVaR95 objective and state-machine action map
- Enable passive-edge kill switch and residual checkpoints
- Run 2-week shadow A/B by symbol buckets
- Promote only if completion stays stable and q95 net slippage improves
Bottom line
Off-exchange internalization can quietly transform lit books into a more toxic residual pool.
If your slippage model ignores that selection effect, passive fills look cheap until markouts and late catch-up costs settle the bill.