Trade-Report Latency and Volume-Illusion Drift in POV Slippage Models

Date: 2026-04-11
Category: research (execution / slippage modeling)

Why this playbook exists

A percent-of-volume (POV) algo looks simple:

observe market volume,
target a participation rate,
speed up when behind,
slow down when ahead.

In production, the dangerous assumption is hidden in step one.

The algo usually does not observe true actionable market volume in real time. It observes a noisy, delayed, condition-filtered, sometimes corrected reported-volume stream.

That gap matters because pacing logic is denominator-driven.

If your denominator is wrong, the controller misclassifies its own aggressiveness:

late prints make you look too aggressive, so you slow down when you should not,
bursty catch-up reports make you look too passive, so you lunge when you should not,
ineligible or specially conditioned prints pollute the volume baseline,
corrections and cancellations rewrite the history the controller thought it was tracking.

The result is a distinctive slippage pattern:

early-session pacing error,
later catch-up urgency,
wrong-way aggression during already-stressed windows,
benchmark drift versus the participation target you thought you followed.

This note turns that failure mode into a modeling and control blueprint.

The core failure mode: observed volume is not decision-grade volume

For a live POV controller, at least three clocks matter:

Execution clock — when market trades actually occur.
Observation clock — when those trades become visible to your controller.
Eligibility clock — when you know whether those trades should count toward your pacing denominator at all.

Those clocks drift because of things like:

off-exchange reporting latency,
trade-report batching,
odd-lot handling,
sale-condition filtering,
corrected / canceled prints,
venue-specific reporting paths,
session-state transitions around open, reopen, and close.

A raw tape-volume number is therefore not the same thing as:

market volume that was both real, already knowable, and eligible for your pacing logic.

That mismatch is the volume illusion.

Mechanism map

1. Early under-observation

The controller sees less market volume than has actually occurred.

So measured participation looks too high:

you think you already traded 8% of volume,
true participation may only be 5.5%,
controller slows down,
residual inventory accumulates.

2. Delayed denominator repair

Late prints land in a burst.

Now measured participation suddenly looks too low:

denominator jumps,
controller thinks it is badly behind schedule,
child-order aggression increases abruptly,
slippage rises exactly when the market may already be moving.

3. Ineligible-volume contamination

Not all disseminated prints should necessarily drive the same pacing response.

Examples:

late reports that describe already-finished trading opportunity,
special-condition trades with weak relevance for near-term lit liquidity,
prints that alter historical stats but say little about the current execution opportunity,
odd-lot or off-exchange flow that changes the denominator but not the same way it changes available fill opportunity.

If the controller treats every visible print as equally actionable volume, it learns the wrong market clock.

4. Correction / cancel rewrite

A later cancel, correction, or reclassification changes the denominator after the controller already acted on it.

That creates a hidden feedback loop:

stale denominator -> pacing error -> catch-up impact -> revised denominator -> misleading TCA if replay is not as-of correct

A better abstraction: true eligible volume vs observed reported volume

Define:

(V^*_{elig}(t)): cumulative event-time eligible market volume up to decision time (t),
(V_{obs}(t)): cumulative as-of observed reported volume available to the controller at time (t),
(Q(t)): shares already executed by the algo,
(\rho): target participation rate.

The controller usually wants:

[ Q^_{target}(t) = \rho \cdot V^_{elig}(t) ]

But if it uses observed volume instead, it actually tracks:

[ Q_{target}^{obs}(t) = \rho \cdot V_{obs}(t) ]

The pacing error you should care about is:

[ e^(t) = Q(t) - \rho V^_{elig}(t) ]

What the controller thinks is:

[ e_{obs}(t) = Q(t) - \rho V_{obs}(t) ]

The difference is the volume-illusion term:

[ \Delta_V(t) = V_{obs}(t) - V^*_{elig}(t) ]

So:

[ e_{obs}(t) - e^*(t) = -\rho \Delta_V(t) ]

Interpretation:

if (\Delta_V < 0), observed volume is too low and the controller thinks it is more aggressive than reality,
if (\Delta_V > 0), observed volume is inflated and the controller thinks it is more passive than reality.

That sign mistake is enough to flip pacing decisions.

Cost decomposition

A useful decomposition is:

[ C_{total} = C_{base} + C_{pace} + C_{catchup} + C_{filter} + C_{rewrite} ]

Where:

(C_{base}): unavoidable execution cost under current liquidity,
(C_{pace}): cost from wrong real-time participation estimate,
(C_{catchup}): extra impact from denominator-repair bursts,
(C_{filter}): cost from counting the wrong kinds of prints,
(C_{rewrite}): TCA / control error caused by corrections, cancels, or late reclassification.

A practical approximation:

[ C_{pace} \approx \kappa_1 |e^(t)| + \kappa_2 \max(0, -e^(T_{deadline})) ]

and

[ C_{catchup} \approx \kappa_3 \sum_t |\Delta a_t| \cdot \sigma_t \cdot \text{spread}_t ]

where (\Delta a_t) is the change in aggression caused by denominator shocks.

This matters because the controller often looks acceptable in average participation terms while quietly paying a convex catch-up tax.

Grounding observations from public market-structure docs

A few public facts make this problem very real:

Public FINRA / SEC materials state OTC equity trades must be reported as soon as practicable and generally no later than 10 seconds after execution, which means “not instantaneous” is built into the reporting regime.
Public FINRA guidance also makes clear that some reported trades can later be canceled or reversed to remove them from the tape.
Nasdaq public technical notes explain that odd-lot executions are disseminated with a dedicated modifier and are included in volume statistical calculations, even though they do not play the same role as round-lot displayed liquidity.

That combination alone is enough to break naive POV logic:

reported volume can arrive late,
reported volume can be revised,
reported volume can be statistically relevant but not equally decision-relevant.

This is not a theoretical edge case. It is a denominator-quality problem built into the data path.

Feature set for modeling

A. Reporting-delay features

report_delay_ms_p50/p95/p99 by venue / reporting path / symbol
late_report_ratio_1m
burst_report_count_100ms/1s
interarrival_cv of reported prints
report_delay_skew by trade-size bucket
delay_tail_share for top 1% of print delays

B. Eligibility / condition-mix features

odd_lot_share
off_exchange_share
condition_mix_entropy
eligible_volume_ratio under current pacing rules
corrected_or_canceled_print_ratio
volume_added_after_delay_ratio

C. Market-state features

spread, microprice, queue imbalance
top-of-book depth and refill rate
realized volatility over 100ms / 1s / 5s
open / midday / close / halt / reopen phase
symbol ADV bucket and trade-size granularity

D. Controller-state features

current measured participation
estimated true participation
residual parent quantity
time-to-deadline
recent aggression changes
last denominator jump size
catch-up mode dwell time

The key is that denominator-quality features belong in the slippage model, not only in post-trade diagnostics.

Operational metrics

1. VAS — Volume As-of Skew

[ VAS(t) = V_{obs}(t) - V^*_{elig}(t) ]

The signed real-time denominator error.

2. RPE — Real-time Participation Error

[ RPE(t) = \frac{Q(t)}{V_{obs}(t)} - \frac{Q(t)}{V^*_{elig}(t)} ]

How wrong your self-measured participation is.

3. DRB — Denominator Repair Burst

[ DRB = \max_{\Delta t} \frac{\Delta V_{obs}(\Delta t) - \Delta V^*_{elig}(\Delta t)}{ADV} ]

Captures bursty late-volume repairs.

4. CAT — Catch-up Aggression Tax

[ CAT = \frac{\text{cost incurred while catching up after denominator shocks}}{\text{executed notional}} ]

This is often the business KPI that matters most.

5. EVM — Eligible Volume Mismatch

[ EVM = 1 - \frac{\sum_t \min(\Delta V_{obs}^{used}, \Delta V^{elig})}{\sum_t \max(\Delta V{obs}^{used}, \Delta V^_{elig})} ]

A useful summary statistic for replay evaluation.

State machine and controls

`CALIBRATED`

Normal POV pacing.
Standard denominator based on nowcasted eligible volume.
Small control adjustments allowed.

`UNDER_OBSERVED`

Triggered when VAS is materially negative.

Downweight measured participation.
Slow the rate at which the controller decelerates.
Prefer patience over premature “I’m already ahead” conclusions.

`FALSE_SURGE`

Triggered when late / bursty prints inflate observed denominator.

Clip denominator jumps per control interval.
Smooth catch-up target over multiple intervals.
Prevent one burst of repaired volume from causing one burst of impact.

`FILTER_DRIFT`

Triggered when condition mix shifts and eligible-volume ratio deteriorates.

Tighten print eligibility rules.
Switch to venue-aware denominator estimates.
Reduce trust in naive consolidated-volume counts.

`SAFE_HYBRID_PACING`

Triggered when denominator quality is poor for too long.

Blend POV with a time-based schedule.
Cap incremental aggression changes.
Use deadline-aware residual control instead of raw denominator chasing.

This fail-safe matters because a bad denominator should degrade into a slower, safer controller — not a panicky one.

Modeling blueprint

Layer 1: event-time eligible-volume reconstruction

Build a replayable table with at least:

trade_event_ts
report_arrival_ts
condition_code
venue_or_reporting_path
is_eligible_for_pacing
was_corrected
was_canceled
shares

Then reconstruct two cumulative curves:

event-time eligible volume,
as-of observable volume.

Never evaluate a POV controller on only one clock.

Layer 2: pending-volume nowcast

Estimate the volume that likely already happened but has not yet become visible:

[ \hat{V}{pending}(t) = E[V^*{elig}(t) - V_{obs}(t) \mid x_t] ]

Then set:

[ \hat{V}{elig}(t) = V{obs}^{eligible}(t) + \hat{V}_{pending}(t) ]

This becomes the denominator for live control.

Layer 3: delay-aware participation target

Use:

[ Q_{target}(t) = \rho \hat{V}_{elig}(t) ]

not raw observed tape volume.

Layer 4: shock-absorbing control law

Instead of chasing the denominator instantly, update aggression with something like:

[ a_{t+1} = a_t + \alpha \cdot \text{clip}(Q_{target}(t)-Q(t), -b, b) ]

where:

(\alpha) is responsiveness,
(b) is a cap on control-step size.

That single clip often saves more slippage than adding more predictive complexity.

Layer 5: denominator-quality-conditioned slippage model

Predict slippage conditional on both market state and denominator quality:

[ E[C \mid x_t, a_t, VAS_t, DRB_t, EVM_t] ]

This tells you whether current observed volume is trustworthy enough to support true POV behavior.

Practical policy rules

Rule 1: separate visible volume from usable volume

Do not let the same field power:

tape display,
TCA reports,
live POV pacing,
benchmark audit.

Each needs its own semantics.

Rule 2: clip denominator jumps

Late repaired volume should rarely cause one-shot participation repair.

Better:

amortize catch-up over several control intervals,
preserve a maximum aggression delta per second,
tolerate temporary participation miss if the alternative is impact convexity.

Rule 3: use event-time replays for evaluation

A backtest that only uses cleaned final tape will flatter the controller.

You need:

as-of replay for what the controller actually knew,
event-time truth for what actually happened.

Without both, you will underestimate denominator-induced slippage.

Rule 4: make eligibility explicit

Every strategy should document which prints count toward its pacing denominator.

If the answer is “all consolidated prints,” you probably have not thought hard enough.

Rule 5: degrade gracefully near deadlines

When denominator quality is bad and deadline pressure is rising, switch objective:

from exact participation tracking,
to bounded residual-risk minimization.

That is often the difference between a small benchmark miss and an ugly late-day sprint.

30-day rollout plan

Week 1 — Instrument the clocks

Store both event-time and arrival-time for all trade reports used by pacing.
Add condition-code and correction-state fields.
Baseline VAS / RPE / DRB by symbol and time of day.

Week 2 — Shadow denominator nowcast

Build (\hat{V}_{elig}(t)) in parallel.
Compare measured vs true participation in replay.
Rank symbols / venues by denominator instability.

Week 3 — Conservative control activation

Clip denominator jumps.
Introduce SAFE_HYBRID_PACING for worst offenders.
Activate venue / condition-aware eligibility filtering.

Week 4 — Tail optimization

Optimize CAT, not just mean slippage.
Tune catch-up smoothing by liquidity bucket.
Add incident alerts for abnormal delay bursts and correction spikes.

Common anti-patterns

Treating all reported volume as equally decision-relevant.
Evaluating POV quality only on final cleaned tape.
Letting a denominator burst cause an aggression burst.
Ignoring corrections because “they are rare.”
Using the same pacing rules for open, midday, and close.
Assuming odd-lot / off-exchange volume has the same control meaning as displayed round-lot flow.
Measuring participation perfectly in hindsight and calling the live controller “on target.”

What good looks like

A production-grade POV stack should be able to answer:

What volume did the controller see at each decision instant?
What volume was actually eligible by event time?
How much of today’s slippage came from denominator error rather than spread / impact alone?
Which venues or condition mixes create the worst volume-illusion tails?
When denominator quality degrades, does the controller become safer or more frantic?

If you cannot answer those, your POV algo is pacing against a ghost denominator.

And ghost denominators are expensive.

Selected public references

FINRA Rule 6622 / Trade Reporting FAQ — OTC equity trades must be reported "as soon as practicable," generally no later than 10 seconds after execution, which makes report latency a structural part of the data path:
- https://www.finra.org/rules-guidance/rulebooks/finra-rules/6622
- https://www.finra.org/filing-reporting/market-transparency-reporting/trade-reporting-faq
FINRA trade-cancel / reversal guidance — previously reported trades may later be canceled or reversed, which means the tape can be rewritten after the controller already acted:
- https://www.finra.org/filing-reporting/market-transparency-reporting/trade-reporting-faq
- https://www.finra.org/rules-guidance/rulebooks/finra-rules/6380b
Nasdaq odd-lot dissemination note — odd-lot trade messages carry a dedicated modifier and are included in volume statistics, even though their control meaning for displayed-liquidity pacing is not identical to round-lot lit flow:
- https://www.nasdaqtrader.com/TraderNews.aspx?id=ETU2013-33

Bottom line

POV slippage is not only a market-impact problem.

It is also a measurement-timing problem.

When reported volume arrives late, arrives in bursts, or arrives with the wrong control semantics, the controller misreads its own participation rate and creates avoidable urgency. The right fix is not “more reactive pacing.” It is:

delay-aware denominator reconstruction,
explicit print eligibility rules,
clipped denominator shock handling,
as-of replay for honest evaluation.

In short:

before you optimize participation, make sure you know what market volume means at decision time.