Benchmarks
Benchmark tables coming soon
The benchmark page is being rebuilt to separate model regimes, ground-truth tracks, coverage, and held-out evaluation splits. Final public benchmark tables will be available shortly.
Model regimes
Ground-truth tracks
Coverage / held-out splits