Latency-Arbitrage Shadowing & Quote-Fade Slippage Playbook

2026-03-10 · finance

Latency-Arbitrage Shadowing & Quote-Fade Slippage Playbook

Date: 2026-03-10
Category: research (quant execution / slippage modeling)

Why this matters

Many execution stacks still assume that if top-of-book looks good at decision time, passive joins or light aggression should be fine.

In practice, when fast participants detect stale or vulnerable quotes, they can shadow and fade before slower flow lands:

This is not just generic "high volatility". It is a latency-arbitrage interaction regime with distinct timing signatures.


Core concept: Shadow-Fade Tax (SFT)

Model expected child-order cost as:

[ \text{SFT} = \text{StaleHitMarkout} + \text{FadeMissChaseTax} + \text{RetryTimingTax} - \text{SpreadCapture} ]

Execution objective:

[ \min_a; \mathbb{E}[C|a] + \lambda,\mathrm{CVaR}_{95}(C|a) + \eta,\mathrm{MissRisk}(a) ]

where action (a) controls join/improve/take choice, price offsets, retry cooldown, and venue mix.


Detection signals (shadowing observability)

Build a Shadowing Pressure Score (SPS) from:

  1. Quote Fade Lead (QFL)
    Median time between decision-time top quote and its cancellation/price shift before arrival.

  2. Stale Interaction Rate (SIR)
    Fraction of child orders interacting with quotes that were top-of-book only briefly before adverse move.

  3. Microprice Drift-to-Latency Ratio (MDLR)
    Microprice drift during order transit divided by end-to-end decision latency.

  4. Retry Adverse Escalation (RAE)
    Incremental cost between first attempt and subsequent retries in short windows.

  5. Queue Quality Collapse (QQC)
    Effective executable depth decay vs displayed depth under interaction pressure.

State mapping example:

Use hysteresis + minimum dwell time to avoid flapping.


Modeling stack

Layer 1 — Child branch model

Estimate branch outcomes:

Recommended targets:

Layer 2 — Episode model

Aggregate child outcomes into parent-order episode cost:

This prevents “average fill looked okay” blindness when tail losses concentrate in SHADOWED/PREDATORY windows.


Control policy design

  1. Latency-aware aggression floor
    In SHADOWED/PREDATORY states, raise minimum aggression to avoid repeated stale touches.

  2. Retry cooldown + cap
    Hard-limit rapid retries when RAE exceeds threshold; enforce cooldown before re-entry.

  3. Dynamic quote validity window
    Tighten acceptable quote-age/validity gates as MDLR rises.

  4. Venue resilience weighting
    Route toward venues with better realized quote survival under stress, not just headline spread.

  5. Tail-budget kill switch
    If q95 burn rate breaches limit, move to SAFE profile (completion-first, lower path complexity).


Data contract (minimum)

Per child order:

Without timestamp quality and attempt linkage, shadowing attribution collapses into noise.


Calibration & monitoring loop

Weekly

Daily

Monitor:

Intraday guards


Rollout plan

  1. Shadow mode (2 weeks): compute SPS and simulated actions only.
  2. Canary (5–10% notional): enable retry controls + quote-validity gates.
  3. Scale-up: add venue weighting and state-specific aggression floors.
  4. Rollback triggers: completion drop, q95 blowout, or state instability.

Common failure modes


Bottom line

When latency-arbitrage shadowing is active, slippage is driven less by displayed spread and more by who reaches the quote first.

Model stale-hit and fade-miss branches explicitly, then control retries and quote-validity dynamically. That is how you keep small timing disadvantages from compounding into large tail-cost leakage.