You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The E2E tests for calculator_multiply and calculator_divide fail all 10 retry attempts due to non-deterministic receiver method instantiation patterns generated by qwen2.5-coder:0.5b.
Details
Even with temperature=0 and seed=42, the LLM randomly chooses between two valid receiver instantiation patterns:
Problem
The E2E tests for
calculator_multiplyandcalculator_dividefail all 10 retry attempts due to non-deterministic receiver method instantiation patterns generated by qwen2.5-coder:0.5b.Details
Even with
temperature=0andseed=42, the LLM randomly chooses between two valid receiver instantiation patterns:Pattern 1 (in golden files):
Pattern 2 (sometimes generated):
Both patterns are syntactically valid but produce different output strings, causing E2E test failures.
Current Status
calculator_multiplyandcalculator_divideE2E tests in internal/ai/e2e_test.goPossible Solutions
References