Execution-Induced Volatility Feedback (EIVF) Slippage Playbook

Date: 2026-03-01
Category: research (slippage modeling)

Why this matters

Most execution models treat volatility as exogenous (market gives it, we react to it).
In practice, when participation and aggression rise in thin books, our own flow can:

widen spread,
increase short-horizon return variance,
trigger additional cancellations,
worsen queue survival for the next child orders.

That creates a feedback loop:

more urgency → more local volatility → worse expected slippage → even more urgency near deadline.

If this loop is not explicitly modeled, average slippage may look acceptable while p95/p99 blows up during fragile intervals.

Core idea

Model slippage as a coupled system:

direct cost (spread + impact + delay/opportunity)
feedback cost (today’s child order increases near-future volatility, which increases cost of remaining inventory)

So each child-order decision should optimize not only immediate fill quality, but also its future volatility footprint.

Minimal state and notation

At decision time (t):

(x_t): remaining shares
(\tau_t): remaining time to deadline
(u_t): control action (POV/aggression level)
(s_t): spread
(\sigma_t): short-horizon realized volatility estimate
(r_t): refill rate (depth recovery)
(k_t): cancel intensity near touch
(z_t): regime features (OFI, auction proximity, event flags, latency)

Target objective:

[ \min_{u_{t:T}} ; \mathbb{E}[\text{IS}] + \lambda_{95} \cdot Q_{0.95}(\text{IS}) + \lambda_{\text{miss}}\cdot \Pr(\text{underfill by deadline}) ]

with safety constraints (hard):

max participation cap,
max short-horizon expected markout,
kill-switch at stress thresholds.

Model decomposition

1) Base slippage model

Predict baseline conditional quantiles:

[ \hat{q}{\alpha}^{\text{base}}(t, u_t) = f{\alpha}(x_t, \tau_t, s_t, \sigma_t, r_t, k_t, z_t, u_t) ]

Use quantile model family (LightGBM/XGBoost quantile, or conformalized residual layer).

2) Volatility-feedback transition model

Estimate near-future volatility response from own action:

[ \Delta \sigma_{t+\delta} = g(\sigma_t, r_t, k_t, z_t, u_t) + \epsilon_t ]

Practical parameterization:

[ \sigma_{t+\delta} = a\sigma_t + b_1\cdot \text{POV}_t + b_2\cdot \text{crossFrac}_t + b_3\cdot \text{cancelShock}_t + b_4\cdot \text{latencyJitter}_t + \eta_t ]

where (b_1,b_2) represent execution-induced volatility sensitivity.

3) Feedback-adjusted slippage

For each candidate action (u_t):

predict immediate cost quantile (\hat{q}^{\text{base}})
simulate one-step volatility transition (\hat{\sigma}_{t+\delta}(u_t))
estimate marginal cost increase on remaining inventory

[ \text{EIVF}(u_t) \approx \frac{\partial \hat{q}{0.95}}{\partial \sigma} \cdot \big(\hat{\sigma}{t+\delta}(u_t)-\hat{\sigma}_{t+\delta}(u_t^{\text{neutral}})\big) ]

Final score:

[ \text{Score}(u_t)=\hat{q}{0.95}^{\text{base}}(u_t)+\gamma\cdot \text{EIVF}(u_t)+\lambda{\text{miss}}\cdot \widehat{\Pr}(\text{miss}|u_t) ]

Pick action with minimal score under constraints.

Causal estimation notes (important)

Naive regression overestimates/underestimates feedback because urgency is endogenous. Use at least one:

Micro-randomization windows: tiny randomized perturbation in aggression bucket under safe bounds.
Instrumental variables: queue-priority tie-break randomness, exchange micro-latency noise proxy.
Doubly robust correction: propensity model + outcome model for action assignment.
Regime-stratified estimation: estimate (b_1,b_2) by liquidity regime to avoid Simpson reversals.

Without this, EIVF control can become a confidence theater.

Controller design (state machine)

Define EIVF stress index:

[ \text{EIVFIndex}t = \frac{\hat{\sigma}{t+\delta}(u^{\text{planned}})-\hat{\sigma}{t+\delta}(u^{\text{neutral}})}{\hat{\sigma}{t+\delta}(u^{\text{neutral}})+\epsilon} ]

States

CALM: low EIVFIndex, healthy refill.
WATCH: rising EIVFIndex or cancel intensity spike.
STRESS: high EIVFIndex + weak refill + spread widening.
SAFE: hard risk breach.

Policy sketch

CALM → standard POV band, passive-first.
WATCH → reduce crossing fraction, increase child spacing, tighten venue filters.
STRESS → cap aggression, pause toxic venues, prioritize completion via safer slices.
SAFE → emergency throttling / temporary halt / human review rule.

Use hysteresis and minimum dwell time to prevent state flapping.

Online algorithm (pseudo)

for each decision tick t:
  observe state S_t = (x_t, tau_t, market features)
  for u in candidate_actions:
    q_base[u] = BaseQuantileModel.predict_q95(S_t, u)
    sigma_next[u] = VolFeedbackModel.predict(S_t, u)
    eivf[u] = dQdSigma(S_t) * (sigma_next[u] - sigma_next[neutral])
    miss[u] = FillModel.predict_miss_prob(S_t, u)
    score[u] = q_base[u] + gamma*eivf[u] + lambda_miss*miss[u]
  feasible = apply_hard_constraints(candidate_actions)
  u_t = argmin(score[u] for u in feasible)
  execute(u_t)
  update online diagnostics/calibration

What to log in production

Per child order:

selected action + top-3 rejected actions and score decomposition
predicted q50/q90/q95, predicted EIVF term
realized fill, markout(5s/30s/60s), spread change, volatility change
state transitions and trigger reasons

Daily diagnostics:

Calibration of (\widehat{\Delta\sigma}): MAE, directional accuracy.
q95 coverage reliability (target vs realized exceedance).
Regime-wise EIVF benefit: p95(IS)_baseline - p95(IS)_EIVF.
Completion-rate impact (avoid “tail improvement by underfilling”).
Action entropy (guard against controller collapse to one action).

Rollout plan (safe)

Shadow mode (2–4 weeks): score and recommend actions, do not control.
Canary traffic: 5% notional with strict kill criteria.
Progressive ramp: 5% → 15% → 30% if guardrails remain green.
Champion–challenger governance: weekly review with fixed promotion thresholds.

Hard rollback triggers:

q95 IS degradation > threshold,
miss probability surge,
calibration drift in (\Delta\sigma) model,
operational instability (state flapping, venue reject spikes).

Failure modes and mitigations

Over-throttling near deadline
- Mitigation: time-decay in (\gamma), explicit deadline penalty.
False positive stress detection
- Mitigation: multi-signal confirmation + dwell-time hysteresis.
Non-stationary microstructure
- Mitigation: rolling retrain + conformal residual monitor + drift alarms.
Cross-venue spillover not modeled
- Mitigation: venue interaction features and venue-level stress caps.
Policy gaming via metric focus
- Mitigation: monitor multiple objectives (q95 + completion + markout + rejects).

Practical implementation checklist

Define candidate action lattice (POV × aggression × spacing × venue mask)
Build base quantile slippage model with robust tails
Add one-step volatility feedback model with causal correction
Implement EIVF score decomposition and hard constraints
Add state machine with hysteresis and emergency safe mode
Instrument detailed decision logs for attribution
Run shadow→canary→ramp with explicit rollback criteria

References (starting points)

Almgren, R. & Chriss, N. (2000), optimal execution framework.
Bouchaud et al., propagator / transient impact literature.
Gatheral and follow-up work on no-dynamic-arbitrage impact structure.
Bacry, Mastromatteo, Muzy on Hawkes-style order-flow dynamics.
Recent TCA/quantile risk-control practice for p95/CVaR-focused execution.

Bottom line

A good slippage controller should optimize future path quality, not just immediate fill cost.
EIVF modeling turns “our flow changed the game board” from a postmortem sentence into an online control signal.