L4S DualQ ECN Misclassification Slippage Playbook

Date: 2026-03-26
Category: research
Audience: quant execution engineers operating latency-sensitive routers over mixed network paths

Why this matters

Many execution stacks now run over heterogeneous links (colo cross-connect, WAN backup, cloud overlays). If some paths use L4S DualQ behavior and others behave as classic ECN/drop networks, packet marking dynamics can shift abruptly.

That shift propagates into order-routing latency shape (not just mean):

fill/ack timing decouples,
cancel safety windows shrink,
microburst risk increases,
and realized slippage tails widen.

If your slippage model only sees "median RTT", it will miss this regime switch.

1) Cost decomposition with network-regime term

For child order (i):

[ C_i = C_{micro}(s_i, a_i) + C_{lat}(\Delta t_i) + C_{regime}(z_i) + \epsilon_i ]

Where:

(C_{micro}): spread + queue position + impact terms,
(C_{lat}): delay-linked drift/opportunity term,
(C_{regime}): additive penalty from path regime state (z_i),
(z_i \in {\text{Classic-stable}, \text{L4S-stable}, \text{Mixed/uncertain}}).

A practical approximation:

[ C_{regime} \approx \beta_1,p_{mix} + \beta_2,\text{CEBurst95} + \beta_3,\text{RTTBimodality} ]

Interpretation: mixed signaling + bursty CE marks + bimodal RTT are direct predictors of tail slippage.

2) Data contract (point-in-time required)

A) Transport/network features

ECN codepoint usage ratios: Not-ECT / ECT(0) / ECT(1)
CE mark rate and burst metrics (p95 burst length, inter-burst gap)
RTT quantiles + bimodality score (e.g., Hartigan dip / 2-component GMM weight)
Retransmission, reordering, pacing gain-cycle proxies
Path ID / NIC queue / egress class metadata

B) Execution-path features

decision→gateway latency
gateway→venue ack latency
cancel→ack latency and p99 tail
reject/retry frequency by venue and route

C) Market microstructure features

spread, top depth, queue imbalance
quote age / fade intensity
local event flags (open/close auction windows, macro release windows)

Freshness SLO: transport features older than 1–2s in fast markets should invalidate aggressive routing policies.

3) Modeling stack

A) Regime classifier (online)

Train a 3-state classifier for (z_i):

Classic-stable
L4S-stable
Mixed/uncertain

Use online-calibrated probabilities (p(z_i = k)) rather than hard labels.

B) Conditional slippage model

Predict (q50/q90/q95) slippage conditioned on:

microstructure state,
urgency,
and regime probabilities (p(z)).

Gradient-boosted quantile models or distributional models (e.g., NGBoost) work well in practice.

C) Tail-risk correction

Maintain EVT/CVaR overlay per regime bucket:

[ \widehat{CVaR}{\alpha} = f\big(\widehat{q}{\alpha}, \text{tail index}, z\big) ]

Use this for hard execution guardrails, not for dashboard vanity.

4) Policy layer (turn model into action)

Score each candidate route/action (a):

[ \text{Score}(a)=\mathbb{E}[C|a] + \lambda_{tail}\widehat{CVaR}{0.95}(a) + \lambda{unc},p_{mix}(a) ]

Then enforce policy states:

GREEN: low (p_{mix}), normal participation.
YELLOW: rising CE bursts or RTT bimodality; reduce child size, tighten cancel guards.
ORANGE: persistent mixed regime; prefer robust venues/routes, lower aggressiveness caps.
RED: severe instability; fail safe (wider safety buffers, throttle routing fan-out, optional passive-only mode on fragile symbols).

No manual heroics. Encode transitions and rollback triggers explicitly.

5) Production KPIs

MRR (Mixed-Regime Ratio): fraction of flow with (p_{mix} > \tau)
CEB95: p95 CE-burst length
RBI (RTT Bimodality Index): regime-separation score
CAA95: cancel-ack p95/p99 inflation vs baseline
TEX95: tail exceedance rate (realized > predicted q95)

Alert pattern worth paging:

MRR up + CAA95 up + TEX95 up simultaneously for >10–15 min.

6) Validation ladder

Historical replay with reconstructed transport telemetry.
Counterfactual route test: compare policies under identical market slices.
Shadow mode: score-only, no action changes.
Canary rollout on low-risk symbols/venues with automatic RED rollback.

Critical trap: training with telemetry aggregated post-trade (non-causal). Use strictly point-in-time joins.

7) Two-week implementation sketch

Days 1–3
Define PIT schema for ECN/RTT/ack/cancel telemetry, with freshness checks.

Days 4–6
Train online regime classifier + calibration monitor.

Days 7–9
Train regime-conditioned quantile slippage model + tail overlay.

Days 10–11
Integrate GREEN/YELLOW/ORANGE/RED policy logic and route scoring.

Days 12–13
Run shadow evaluation and stress-event backtests.

Day 14
Canary launch with conservative tail budget and automatic rollback hooks.

Bottom line

L4S-era execution risk is less about average latency and more about regime instability.

If you model mixed ECN/queue behavior explicitly (rather than burying it inside generic latency features), you can control slippage tails before they become incident-level losses.

References

RFC 9330 — L4S Internet Service: Architecture
https://www.rfc-editor.org/rfc/rfc9330.html
RFC 9331 — ECN Semantics for L4S
https://www.rfc-editor.org/rfc/rfc9331.html
RFC 9332 — DualQ Coupled AQM for L4S
https://www.rfc-editor.org/rfc/rfc9332.html
RFC 3168 — The Addition of ECN to IP
https://www.rfc-editor.org/rfc/rfc3168
RFC 8311 — Relaxing ECN Restrictions for Experimentation
https://www.rfc-editor.org/rfc/rfc8311.html