jedarden/ai-code-battle

Fork 0

Commit graph

Author	SHA1	Message	Date
jedarden	f3e34c6736	fix(evolver): correct failing tests for ensemble and behavior distance - Fixed TestSelectBestCandidate_GoHttpBonus: HTTP bonus (1.5x) on 150-char code (225 score) doesn't beat 500-char plain text (500 score). Test now expects the longer code to win. - Fixed TestScoreCandidate_Bonuses: adjusted minScore expectations to match actual code lengths with 1.5x bonus applied. - Fixed TestBehaviorDistance: use epsilon comparison for floating-point precision instead of exact equality. sqrt(2) ≈ 1.414214 is not exactly representable in floating-point. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-08 16:36:50 -04:00
jedarden	f5924e8b15	feat(acb-evolver): add LLM prompt builder and ensemble integration - Add parent sampling via tournament selection (selector/tournament.go) - Add replay analyzer to extract key moments, strategies, weaknesses - Add meta builder for leaderboard summary and dominant strategies - Add prompt assembler combining parent code + replay + meta context - Add LLM ensemble with fast tier (GLM-5-Turbo) for bulk generation and strong tier (GLM-5) for refinement passes - Add code extraction from LLM responses with language validation - Add convert utilities for type conversion between packages - Comprehensive test coverage for all components Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-29 16:47:25 -04:00
jedarden	bd4b0d3244	Add LLM prompt builder and ensemble integration (Phase 7) - selector: tournament selection for parent sampling from island populations - prompt: assembles evolution prompts from parent code, replay analysis, and meta description - llm: OpenAI-compatible client routing to ZAI proxy with fast (GLM-5-Turbo) and strong (GLM-5) tiers, plus code block extraction from model responses - Tests for prompt assembly, code extraction, and tournament selection Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-26 22:26:09 -04:00

Author

SHA1

Message

Date

jedarden

f3e34c6736

fix(evolver): correct failing tests for ensemble and behavior distance

- Fixed TestSelectBestCandidate_GoHttpBonus: HTTP bonus (1.5x) on 150-char code
  (225 score) doesn't beat 500-char plain text (500 score). Test now expects
  the longer code to win.
- Fixed TestScoreCandidate_Bonuses: adjusted minScore expectations to match
  actual code lengths with 1.5x bonus applied.
- Fixed TestBehaviorDistance: use epsilon comparison for floating-point
  precision instead of exact equality. sqrt(2) ≈ 1.414214 is not exactly
  representable in floating-point.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

2026-04-08 16:36:50 -04:00

jedarden

f5924e8b15

feat(acb-evolver): add LLM prompt builder and ensemble integration

- Add parent sampling via tournament selection (selector/tournament.go)
- Add replay analyzer to extract key moments, strategies, weaknesses
- Add meta builder for leaderboard summary and dominant strategies
- Add prompt assembler combining parent code + replay + meta context
- Add LLM ensemble with fast tier (GLM-5-Turbo) for bulk generation
  and strong tier (GLM-5) for refinement passes
- Add code extraction from LLM responses with language validation
- Add convert utilities for type conversion between packages
- Comprehensive test coverage for all components

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

2026-03-29 16:47:25 -04:00

jedarden

bd4b0d3244

Add LLM prompt builder and ensemble integration (Phase 7)

- selector: tournament selection for parent sampling from island populations
- prompt: assembles evolution prompts from parent code, replay analysis, and meta description
- llm: OpenAI-compatible client routing to ZAI proxy with fast (GLM-5-Turbo) and strong (GLM-5) tiers, plus code block extraction from model responses
- Tests for prompt assembly, code extraction, and tournament selection

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

2026-03-26 22:26:09 -04:00

3 commits