Slippage Modeling Stack: From IS Decomposition to Live Routing (Practical Playbook)

Date: 2026-03-16
Category: research
Audience: small quant/operator teams running real capital with limited infra headcount

Why this research matters

"Slippage" is often tracked as one number (realized vs expected), but execution operations need a causal model:

what was spread tax,
what was market impact,
what was timing/latency drift,
what was opportunity cost from non-fill.

Without decomposition, teams overfit tactics to noise (e.g., blaming venue when the real issue is decision→gateway delay).

The practical target is not perfect prediction. It is stable pre-trade cost ranking + post-trade attribution that operators trust.

1) Cost decomposition you can run daily

For a parent buy order with decision price (P_0), shares (Q), and executed child fills ((q_i, p_i)):

VWAP fill price: (\bar{P}=\frac{\sum_i q_i p_i}{\sum_i q_i})
Executed quantity: (Q_f=\sum_i q_i)
Signed IS in bps:

[ IS_{bps}=10^4\cdot s\cdot\frac{\bar{P}-P_0}{P_0},\quad s=+1\ (buy),\ -1\ (sell) ]

Operationally split into:

[ IS \approx Spread + Fees/Rebates + Delay + TemporaryImpact + PermanentImpact + OpportunityCost ]

Where opportunity cost for unfilled residual (Q-Q_f) is marked at a deadline benchmark (close, TWAP horizon end, or strategy next decision time).

Key rule: keep this decomposition deterministic and versioned (benchmark choice changes numbers materially).

2) Baseline model family (fast, robust)

Before deep models, deploy a strong linear/nonlinear baseline on signed bps cost:

[ \hat{IS}=\beta_0 +\beta_1\cdot \text{halfSpread}{bps} +\beta_2\cdot \left(\frac{|Q|}{ADV}\right)^{1/2} +\beta_3\cdot \sigma{intraday} +\beta_4\cdot \text{participation} +\beta_5\cdot \text{latency}_{ms} +\beta_6\cdot \text{queueImbalance} +\beta_7\cdot \text{regimeDummies} ]

Useful choices:

Huber/quantile regression over pure OLS (tails are heavy).
Separate models by liquidity bucket (large-cap vs small-cap behavior differs more than most feature tweaks).
Include interactions like participation × spread regime.

This already beats many ad-hoc cost heuristics in production.

3) Impact priors that generalize

Use empirical priors as regularizers, not gospel:

A) Square-root impact prior

[ I(Q) \approx Y\cdot\sigma\cdot\sqrt{\frac{|Q|}{V}} ]

(\sigma): daily volatility scale
(V): daily volume scale
(Y): fitted constant by asset class/venue regime

B) Participation-rate prior

[ I \propto \sigma\cdot \eta^{\delta},\quad \eta=\frac{\text{traded volume rate}}{\text{market volume rate}},\quad \delta\in[0.4,0.8] ]

In live systems, the right move is usually:

start with priors,
fit on your own fills,
monitor drift weekly,
re-estimate by regime.

4) Feature contract for production slippage models

Minimum feature blocks (point-in-time safe):

Order state: side, remaining qty, urgency, deadline
Microstructure: spread, top-N depth, imbalance, recent trade sign pressure
Participation: child size / rolling venue volume
Volatility: short-horizon realized vol + gap/open/close flags
Path latency: decision→send, send→ack, ack→fill
Venue/routing: venue id, order type, midpoint/peg flags
Fee table version: explicit snapshot id at decision time

If one of these is missing, model confidence must be downgraded automatically.

5) Training/validation protocol (operator-safe)

Split

Purged time-series CV (no leakage across neighboring windows)
Symbol-stratified folds for cross-sectional stability

Metrics

MAE/RMSE on IS bps (q50)
Pinball loss on q90/q95 (tail control)
Calibration error of exceedance events (realized > predicted q95)
Decision metric: hit-rate of tactic ranking (did lowest predicted-cost tactic win?)

Deployment gates

Tail underestimation ratio < threshold
Residual drift PSI/KS within control limits
No performance cliff in open/close and high-news windows

6) Online serving pattern

Two-stage serving works best:

Fast scorer (<1 ms): predicts mean + q90/q95 per tactic
Policy layer: chooses tactic minimizing

[ Score = E[IS] + \lambda_{tail}\cdot Q95(IS) + \lambda_{deadline}\cdot P(unfinished) ]

Hard constraints override score:

max spread multiple,
max participation cap,
venue reject spike circuit breaker,
fail-open fallback tactic.

7) Post-trade attribution loop (the part teams skip)

Daily runbook:

Recompute decomposition per parent order
Compare predicted vs realized by bucket (symbol/venue/time-of-day)
Attribute residuals to one of: model miss, data quality, infra latency, venue behavior shift
Open incident if tail breach repeats in same bucket for 2+ sessions

Weekly runbook:

Refit coefficients/quantiles
Refresh fee/rebate snapshots
Re-validate fallback policy triggers

This is where slippage models become operations tooling, not just research output.

8) Common failure patterns and fixes

Ignoring opportunity cost → model recommends passive tactics that “look cheap” but miss fills.
Fix: include deadline-completion terms and residual mark-to-benchmark.
Latency treated as noise → unexplained drift during infra incidents.
Fix: make path-latency first-class features and alert dimensions.
Single global model across all liquidity regimes.
Fix: hierarchical/bucketed models with shared priors.
Fee table drift not versioned.
Fix: point-in-time fee snapshot id in both features and attribution.
No tail governance (mean error looks fine, blowups keep happening).
Fix: quantile objectives + breach-rate SLOs.

9) Minimal 10-day implementation plan

Days 1-2
Define deterministic IS decomposition + benchmark policy.

Days 3-4
Build feature contract and PIT-safe dataset manifests.

Days 5-6
Train robust baseline (mean + q90/q95), bucketed by liquidity regime.

Days 7-8
Wire online scorer + policy layer with hard constraints.

Day 9
Shadow run against current tactic, collect residual diagnostics.

Day 10
Canary with tight kill-switches; freeze runbook v1.

Bottom line

A practical slippage model is a control system:

decomposition for diagnosis,
predictive surfaces for routing,
tail-aware governance for capital protection.

If you can only do one thing this month: ship deterministic IS decomposition + q95-aware tactic ranking. That alone typically removes a large fraction of avoidable execution drag.

References

Perold, A. F. (1988), The Implementation Shortfall: Paper versus Reality (JPM)
https://www.hbs.edu/faculty/Pages/item.aspx?num=2083
Almgren, R., Chriss, N. (2000), Optimal Execution of Portfolio Transactions
https://www.smallake.kr/wp-content/uploads/2016/03/optliq.pdf
Almgren, R. et al. (2005), Direct Estimation of Equity Market Impact
https://www.cis.upenn.edu/~mkearns/finread/costestim.pdf
Tóth, B. et al. (2011), Anomalous price impact and the critical nature of liquidity in financial markets
https://arxiv.org/abs/1108.1632
Bucci, F. et al. (2022), Market Impact: Empirical Evidence, Theory and Practice
https://hal.science/hal-03668669v1/file/Market_Impact_Empirical_Evidence_Theory_and_Practice.pdf