Skip to content

XY-982 qmd debug ergonomics Dreaming retest#209

Merged
yvette-carlisle merged 2 commits into
mainfrom
y/elf-xy-982
Jun 19, 2026
Merged

XY-982 qmd debug ergonomics Dreaming retest#209
yvette-carlisle merged 2 commits into
mainfrom
y/elf-xy-982

Conversation

@yvette-carlisle

Copy link
Copy Markdown
Member

Summary

  • Add the June 19 qmd debug-ergonomics Dreaming retest report and machine snapshot.
  • Confirm the qmd debug edge remains unchanged: qmd keeps default top-k/short CLI replay, while ELF keeps the narrow operator-debug trace/stage visibility wins.
  • Update README/index routing and add a focused snapshot regression test for the claim boundaries.

Benchmark

  • cargo make real-world-job-operator-ux-live-adapters: pass; ELF 6 pass/0 wrong_result, qmd 0 pass/6 wrong_result in the narrow operator-debug live slice.

Validation

  • jq empty apps/elf-eval/fixtures/report_snapshots/2026-06-19-qmd-debug-ergonomics-dreaming-retest-report.json
  • cargo test -p elf-eval --test real_world_job_benchmark qmd_debug_ergonomics_dreaming_retest_report_preserves_qmd_edge -- --test-threads=1
  • cargo make fmt-check
  • cargo make check-docs
  • decodex docs check
  • cargo make lint

Boundaries

  • Does not claim broad ELF-over-qmd superiority.
  • Does not treat qmd live operator-debug wrong_result rows as evidence that the qmd default top-k/replay edge is gone.
  • Leaves expansion, dense/sparse contribution, fusion, and rerank parity unproven until comparable artifacts are emitted.

@yvette-carlisle yvette-carlisle merged commit 5444642 into main Jun 19, 2026
12 checks passed
@yvette-carlisle yvette-carlisle deleted the y/elf-xy-982 branch June 19, 2026 05:08
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant