L4S DualQ ECN Misclassification Slippage Playbook
Date: 2026-03-26
Category: research
Audience: quant execution engineers operating latency-sensitive routers over mixed network paths
Why this matters
Many execution stacks now run over heterogeneous links (colo cross-connect, WAN backup, cloud overlays). If some paths use L4S DualQ behavior and others behave as classic ECN/drop networks, packet marking dynamics can shift abruptly.
That shift propagates into order-routing latency shape (not just mean):
- fill/ack timing decouples,
- cancel safety windows shrink,
- microburst risk increases,
- and realized slippage tails widen.
If your slippage model only sees "median RTT", it will miss this regime switch.
1) Cost decomposition with network-regime term
For child order (i):
[ C_i = C_{micro}(s_i, a_i) + C_{lat}(\Delta t_i) + C_{regime}(z_i) + \epsilon_i ]
Where:
- (C_{micro}): spread + queue position + impact terms,
- (C_{lat}): delay-linked drift/opportunity term,
- (C_{regime}): additive penalty from path regime state (z_i),
- (z_i \in {\text{Classic-stable}, \text{L4S-stable}, \text{Mixed/uncertain}}).
A practical approximation:
[ C_{regime} \approx \beta_1,p_{mix} + \beta_2,\text{CEBurst95} + \beta_3,\text{RTTBimodality} ]
Interpretation: mixed signaling + bursty CE marks + bimodal RTT are direct predictors of tail slippage.
2) Data contract (point-in-time required)
A) Transport/network features
- ECN codepoint usage ratios: Not-ECT / ECT(0) / ECT(1)
- CE mark rate and burst metrics (p95 burst length, inter-burst gap)
- RTT quantiles + bimodality score (e.g., Hartigan dip / 2-component GMM weight)
- Retransmission, reordering, pacing gain-cycle proxies
- Path ID / NIC queue / egress class metadata
B) Execution-path features
- decision→gateway latency
- gateway→venue ack latency
- cancel→ack latency and p99 tail
- reject/retry frequency by venue and route
C) Market microstructure features
- spread, top depth, queue imbalance
- quote age / fade intensity
- local event flags (open/close auction windows, macro release windows)
Freshness SLO: transport features older than 1–2s in fast markets should invalidate aggressive routing policies.
3) Modeling stack
A) Regime classifier (online)
Train a 3-state classifier for (z_i):
- Classic-stable
- L4S-stable
- Mixed/uncertain
Use online-calibrated probabilities (p(z_i = k)) rather than hard labels.
B) Conditional slippage model
Predict (q50/q90/q95) slippage conditioned on:
- microstructure state,
- urgency,
- and regime probabilities (p(z)).
Gradient-boosted quantile models or distributional models (e.g., NGBoost) work well in practice.
C) Tail-risk correction
Maintain EVT/CVaR overlay per regime bucket:
[ \widehat{CVaR}{\alpha} = f\big(\widehat{q}{\alpha}, \text{tail index}, z\big) ]
Use this for hard execution guardrails, not for dashboard vanity.
4) Policy layer (turn model into action)
Score each candidate route/action (a):
[ \text{Score}(a)=\mathbb{E}[C|a] + \lambda_{tail}\widehat{CVaR}{0.95}(a) + \lambda{unc},p_{mix}(a) ]
Then enforce policy states:
- GREEN: low (p_{mix}), normal participation.
- YELLOW: rising CE bursts or RTT bimodality; reduce child size, tighten cancel guards.
- ORANGE: persistent mixed regime; prefer robust venues/routes, lower aggressiveness caps.
- RED: severe instability; fail safe (wider safety buffers, throttle routing fan-out, optional passive-only mode on fragile symbols).
No manual heroics. Encode transitions and rollback triggers explicitly.
5) Production KPIs
- MRR (Mixed-Regime Ratio): fraction of flow with (p_{mix} > \tau)
- CEB95: p95 CE-burst length
- RBI (RTT Bimodality Index): regime-separation score
- CAA95: cancel-ack p95/p99 inflation vs baseline
- TEX95: tail exceedance rate (realized > predicted q95)
Alert pattern worth paging:
- MRR up + CAA95 up + TEX95 up simultaneously for >10–15 min.
6) Validation ladder
- Historical replay with reconstructed transport telemetry.
- Counterfactual route test: compare policies under identical market slices.
- Shadow mode: score-only, no action changes.
- Canary rollout on low-risk symbols/venues with automatic RED rollback.
Critical trap: training with telemetry aggregated post-trade (non-causal). Use strictly point-in-time joins.
7) Two-week implementation sketch
Days 1–3
Define PIT schema for ECN/RTT/ack/cancel telemetry, with freshness checks.
Days 4–6
Train online regime classifier + calibration monitor.
Days 7–9
Train regime-conditioned quantile slippage model + tail overlay.
Days 10–11
Integrate GREEN/YELLOW/ORANGE/RED policy logic and route scoring.
Days 12–13
Run shadow evaluation and stress-event backtests.
Day 14
Canary launch with conservative tail budget and automatic rollback hooks.
Bottom line
L4S-era execution risk is less about average latency and more about regime instability.
If you model mixed ECN/queue behavior explicitly (rather than burying it inside generic latency features), you can control slippage tails before they become incident-level losses.
References
RFC 9330 — L4S Internet Service: Architecture
https://www.rfc-editor.org/rfc/rfc9330.htmlRFC 9331 — ECN Semantics for L4S
https://www.rfc-editor.org/rfc/rfc9331.htmlRFC 9332 — DualQ Coupled AQM for L4S
https://www.rfc-editor.org/rfc/rfc9332.htmlRFC 3168 — The Addition of ECN to IP
https://www.rfc-editor.org/rfc/rfc3168RFC 8311 — Relaxing ECN Restrictions for Experimentation
https://www.rfc-editor.org/rfc/rfc8311.html