TSO/GSO Segmentation-Burst Serialization Slippage Playbook

2026-03-19 · finance

TSO/GSO Segmentation-Burst Serialization Slippage Playbook

Why this exists

Execution stacks can show acceptable median latency while still leaking p95/p99 implementation shortfall.

One under-modeled source is transmit-path burst serialization from offload behavior:

When this path is ignored, desks often classify the outcome as "market randomness" instead of a repeatable host-side timing tax.


Core failure mode

In low-latency execution, child-order schedules are usually designed in fine time granularity. But TX offload can warp the realized wire-time pattern:

  1. Strategy issues smooth child intents.
  2. Kernel/NIC aggregates payloads (TSO/GSO path).
  3. Wire sees short microbursts with larger serialization blocks.
  4. ACK/feedback timing gets phase-distorted (sometimes compressed, sometimes delayed).
  5. Follow-up decisions run on mis-timed control feedback.
  6. Queue placement quality degrades and fallback aggression rises.

Result: tail slippage inflation without obvious spread/volatility anomalies.


Slippage decomposition with TX-burst term

For parent order (i):

[ IS_i = C_{delay} + C_{impact} + C_{miss} + C_{tx-burst} ]

Where:

[ C_{tx-burst} = C_{wire-phase} + C_{feedback-distortion} + C_{queue-reset} ]


Feature set (production-ready)

1) TX-path/offload features

2) Control-loop timing features

3) Outcome features


Practical metrics

Track by host class, NIC model/driver, kernel version, and session segment (open/mid/close/event windows).


Model architecture

Use baseline + infra-overlay:

  1. Baseline slippage model
    • spread/impact/urgency/deadline in infra-clean assumptions
  2. TX-burst overlay model
    • incremental mean/tail uplift from SBF/WST95/ACR/CPE

Final estimator:

[ \hat{IS}{final} = \hat{IS}{baseline} + \Delta\hat{IS}_{tx-burst} ]

Calibration rule: compare like-for-like market states (liquidity, vol, participation bucket) so offload-induced timing distortion is not confounded with regime volatility.


Regime controller

State A: WIRE_CLEAN

State B: BURSTING

State C: DEPHASED

State D: SAFE_TX_CONTAIN

Use hysteresis and minimum dwell time to avoid oscillatory policy flips.


Mitigation ladder

  1. Offload policy by role
    • execution-critical interfaces may need stricter TSO/GSO posture than throughput-oriented roles
  2. Queueing discipline alignment
    • validate qdisc + pacing assumptions against real wire shape
  3. TX queue topology hygiene
    • map critical threads and NIC queues to minimize incidental contention
  4. Control-loop damping
    • avoid high-frequency amend/cancel reactions when ACK compression rises
  5. Post-change recalibration
    • retrain overlay after kernel, driver, NIC firmware, or qdisc/pacing changes

Failure drills (must run)

  1. Synthetic burst-shape replay
    • verify WIRE_CLEAN -> BURSTING -> DEPHASED transitions
  2. Offload A/B drill
    • controlled TSO/GSO policy comparison with BTU tail impact tracking
  3. ACK-path stress drill
    • validate controller behavior under compressed/delayed feedback timing
  4. Containment failover drill
    • deterministic shift to safer host/path profile under sustained dephasing

Anti-patterns


Bottom line

TSO/GSO behavior can silently reshape execution timing by turning smooth intent into bursty wire reality.

If you model that transmit-path distortion explicitly, slippage tails become attributable and controllable. If you do not, you keep paying a recurring basis-point tax and mislabeling it as market noise.