Commit 1adc6d2
pillar1(R8): isolate long-seq bug to batched prefill path (partial)
Extended TQ_DUMP_HIDDEN to support TQ_DUMP_POS={0|N|all} for per-
position dumps (infrastructure for the next-step per-layer diff).
A/B on Qwen3-0.6B with 144-token synthetic input:
batched prefill: UTF-8 garbage (definitively broken)
per-token prefill: ASCII but still wrong (secondary issue)
KV fp32 (both): same patterns — KV compression NOT the cause
Primary follow-on target: tq_forward_batch non-MoE path. Per-token
path has a separate, milder issue (likely RoPE/attention at larger
positions). Both need the HF per-layer diff to pinpoint.
Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>1 parent db4d87a commit 1adc6d2
2 files changed
Lines changed: 36 additions & 2 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
56 | 56 | | |
57 | 57 | | |
58 | 58 | | |
| 59 | + | |
| 60 | + | |
| 61 | + | |
| 62 | + | |
| 63 | + | |
| 64 | + | |
| 65 | + | |
| 66 | + | |
| 67 | + | |
| 68 | + | |
| 69 | + | |
| 70 | + | |
| 71 | + | |
| 72 | + | |
| 73 | + | |
| 74 | + | |
| 75 | + | |
| 76 | + | |
| 77 | + | |
| 78 | + | |
| 79 | + | |
| 80 | + | |
59 | 81 | | |
60 | 82 | | |
61 | 83 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
2489 | 2489 | | |
2490 | 2490 | | |
2491 | 2491 | | |
2492 | | - | |
2493 | 2492 | | |
2494 | 2493 | | |
| 2494 | + | |
| 2495 | + | |
| 2496 | + | |
| 2497 | + | |
| 2498 | + | |
| 2499 | + | |
| 2500 | + | |
| 2501 | + | |
| 2502 | + | |
| 2503 | + | |
2495 | 2504 | | |
2496 | | - | |
| 2505 | + | |
| 2506 | + | |
| 2507 | + | |
| 2508 | + | |
2497 | 2509 | | |
2498 | 2510 | | |
2499 | 2511 | | |
| |||
0 commit comments