jedarden/ai-code-battle

Author	SHA1	Message	Date
jedarden	41d868b5c1	feat(engine): add pre-generated map loading from map library Per plan §3.8, maps should be generated offline and stored in the map library, not generated on-the-fly during matches. This commit adds support for loading pre-generated maps from the database. Changes: - Add PreGeneratedMap type and WithMap option to MatchRunner - Add loadPreGeneratedMap() to parse map JSON (walls, cores) - Update worker to pass loaded map data to MatchRunner via WithMap - Fallback to on-the-fly generation if map data is invalid - Update acb-mapgen spawn radius to 25% for 2-player (aligns with match.go) - Update test to verify cores are outside final zone radius This enables the map library infrastructure (maps/, acb-mapgen, index builder) to be used in production matches instead of being ignored. Closes: bf-5m29 Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-25 14:14:27 -04:00
jedarden	6a6a3788a6	fix(worker): use ConfigForPlayers to get correct AttackRadius2=12 The worker was hardcoding AttackRadius2=5 in executeMatch, but engine.ConfigForPlayers sets AttackRadius2=12 for both 2-player and 3+ matches. This mismatch meant matches ran with the old attack radius instead of the improved value that supports better combat density. Now uses ConfigForPlayers which provides: - AttackRadius2: 12 (3.5 tiles) for all player counts - Proper zone parameters scaled by player count - Correct max turns scaling Grid dimensions are overridden from the pre-generated map, and SeasonID/RulesVersion are preserved from the match. Closes: bf-576s Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-24 16:03:02 -04:00
jedarden	ea04f4debb	style: apply gofmt alignment fixes across codebase Tab/space alignment consistency from running gofmt on all packages. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-24 10:40:33 -04:00
jedarden	af46a1da97	feat(engine): add combat-density metric and fix computeCombatTurns - Fix computeCombatTurns to count EventCombatDeath events instead of EventBotDied with reason="combat" (which was never emitted, causing CombatTurns to always be 0) - Add CombatDeaths field to MapEngagementScore to track focus-fire kills - Update engagement formula to weight combat deaths at 3.0 (same as win_prob_crossings) to bias map evolution toward combat-dense maps - Add countCombatDeaths helper function to count EventCombatDeath events - Update log output to include combat_deaths metric This implements bf-4nxs: the combat-density metric is now measured and weighted in map engagement, which gates map curation/selection. Maps with zero combat will have low engagement scores and be filtered out. Closes: bf-4nxs	2026-05-24 10:16:54 -04:00
jedarden	df7a3e38c7	feat(worker): implement map engagement scoring per plan §14.6 Update the map engagement scoring formula to match plan §14.6: - score = win_prob_crossings * 3.0 + critical_moments * 2.0 + resource_contest_turns * 1.5 + survival_turns * 0.5 New metrics computed from replay data: - resource_contest_turns: turns where energy is contested by multiple players - survival_turns: turns where all players have at least one bot alive The old formula used map_coverage_pct, closeness, and turn_pct which did not match the specification. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-04 02:28:45 -04:00
jedarden	39fe612f6a	feat(worker): fix rating recovery default sigma value The rating recovery CLI mode (-mode=recalc-ratings) was using glicko2Tau (0.5) instead of glicko2DefaultSigma (0.06) for the default sigma value when resetting ratings. This caused the reset sigma to be ~8x higher than the schema-defined default. Added glicko2DefaultSigma constant (0.06) and updated ResetAllRatings and recalcRatings to use it correctly. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-04 00:49:47 -04:00
jedarden	467b7b67ea	feat(worker): add rating recovery CLI mode (-mode=recalc-ratings) Implements the rating recovery procedure specified in plan §12.3. Running 'go run ./cmd/acb-worker -mode=recalc-ratings' will: 1. Reset all bot ratings to Glicko-2 defaults (mu=1500, phi=350, sigma=0.06) 2. Fetch all completed matches from the database in chronological order 3. Replay each match to recompute Glicko-2 ratings from scratch 4. Update the bots table with the recalculated ratings This is needed for disaster recovery when ratings are corrupted or lost. Database functions added: - ResetAllRatings: resets all bot ratings to defaults - GetAllCompletedMatches: fetches completed matches chronologically with participants - UpdateAllRatings: bulk updates all bot ratings in a single transaction Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-04 00:41:10 -04:00
jedarden	92576dbed4	feat(worker): add map engagement score tracking and verify win_prob in replays - Add engine.CalculateMapEngagement() to compute map engagement scores from replay data (win_prob_crossings, critical_moments, map_coverage_pct, closeness, turn_pct) - Add DBClient.UpdateMapEngagement() to update map engagement using rolling average - Worker now calculates and writes map engagement scores after each match - Add test to verify win_prob array is non-empty in produced replays This implements the win probability Monte Carlo array storage in replay JSON feature. The engine already called ComputeWinProbability() in MatchRunner.Run(), so this commit adds the missing map engagement tracking. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-03 23:21:57 -04:00
jedarden	4937f94afd	feat(combat): rank matches by enemy-kill combat turns Adds combat_turns metric (distinct turns where ≥1 bot died from enemy focus-fire, excluding self-collisions). Worker computes it after each match; index builder sorts matches/index.json and the new most-combat playlist descending by it, and bumps interest score for combat-heavy matches so they surface in highlights. Also switches homepage featured replay default view from influence to standard so the actual bot-on-bot combat is visible. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-03 18:32:08 -04:00
jedarden	9b16b32aef	fix(worker): handle NULL map_json fields with COALESCE map_json generated by acb-map-evolver lacks a 'spawns' key; scanning map_json->>'spawns' into a non-nullable string causes "converting NULL to string is unsupported". Use COALESCE for walls/spawns/cores. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-02 10:13:30 -04:00
jedarden	e5dc3bc543	fix: accept base64-encoded AES keys (OpenBao stores keys as base64, not hex) The encryption key stored in OpenBao/K8s secrets is base64-encoded but the API and worker crypto functions expected hex. Add parseAESKey() that accepts both formats (tries hex first, falls back to base64). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-30 23:04:29 -04:00
jedarden	e64230b122	fix: resolve universal stalemate — signing format and secret decryption Two root causes prevented bots from making any moves: 1. SignRequest signing string included timestamp ({match_id}.{turn}.{timestamp}.{hash}) but all bots implement verifySignature without timestamp ({match_id}.{turn}.{hash}). Fixed by dropping timestamp from the signing string; X-ACB-Timestamp header is still sent for clock-skew checks but not in the HMAC. 2. The API stores bot secrets AES-GCM encrypted (184 hex chars) in the DB. The worker was passing the ciphertext directly as the HMAC key, while bots use their plaintext k8s secret (64 hex chars). Fixed by decrypting in the worker using ACB_ENCRYPTION_KEY. Also tightens the home page winner filter to exclude winner_id="0" stalemates. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-30 21:48:25 -04:00
jedarden	341591a10b	fix(worker): disable SDK checksum trailer for R2 uploads AWS SDK Go v2 s3 v1.100.0 defaults to RequestChecksumCalculationWhenSupported, which causes PutObject to send STREAMING-UNSIGNED-PAYLOAD-TRAILER — a chunked transfer mode R2 doesn't support. Setting WhenRequired makes the SDK send a standard signed payload instead, resolving the 403 SignatureDoesNotMatch. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-30 10:35:00 -04:00
jedarden	dc0caf0115	feat(worker): upload replays directly to R2 in addition to B2 Adds R2 (Cloudflare) as a direct upload target alongside B2 (cold archive). When ACB_R2_* credentials are configured, the worker uploads replays and thumbnails to R2 immediately after each match, bypassing the index-builder's B2→R2 promotion cycle. This is necessary because ARMOR's B2 app key is write-only; reads via the direct S3 path return 403. The Cloudflare CDN read path (armor-hub-b2.ardenone.com) is dead post-hub-decommission. Direct R2 upload ensures replays are available without waiting for a working B2 read path. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-30 10:24:47 -04:00
jedarden	d126654dbb	fix(worker): use BaseEndpoint instead of EndpointResolverV2 for ARMOR EndpointResolverV2 with a custom static URI does not honor UsePathStyle — the resolver provides the final endpoint and the SDK does not re-apply path-style bucket addressing on top of it. This means the bucket name was dropped from the request path even with UsePathStyle=true, sending PUTs to /replays/... instead of /armor-apexalgo/replays/... BaseEndpoint is the SDK's documented approach for S3-compatible custom endpoints; it sets the base URL and then correctly applies path-style addressing to produce /bucket/key URLs. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-30 09:48:53 -04:00
jedarden	55dffb624c	fix(worker): UsePathStyle for ARMOR and skip crash_strikes on normal game endings Two fixes: 1. Add UsePathStyle=true to B2 S3 client. Without it the SDK uses virtual-hosted addressing, dropping the bucket name from the request path. Uploads hit /replays/... instead of /armor-apexalgo/replays/... causing NoSuchBucket errors on every replay/thumbnail PutObject. 2. Don't update crash_strikes for normal game endings (stalemate, turns). In snake-style games every bot eventually crashes into a wall/snake — that is the expected end condition, not an HTTP error. The old code treated all Crashed[] entries from the engine as errors, causing all 6 bots to accumulate strikes after every single match and hitting the 30-min cooldown threshold after just 3 matches. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-30 09:38:09 -04:00
jedarden	ee8c7c37b2	fix(worker): use EndpointResolverV2 for ARMOR B2 uploads The BaseEndpoint approach with older aws-sdk-go-v2 causes "Invalid region: region was not a valid DNS name" errors when uploading to ARMOR's S3-compatible endpoint. Switching to EndpointResolverV2 bypasses the SDK's endpoint rule validation entirely, resolving the issue. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-28 23:33:56 -04:00
jedarden	c9cdafe9ca	fix(worker): upgrade aws-sdk-go-v2 to fix B2 upload error Fixes 'Invalid region: region was not a valid DNS name' error when uploading replays to B2 via ARMOR proxy. The error was caused by a known bug in aws-sdk-go-v2 v1.41.4 where the endpoint resolver would validate the region as a DNS name even when using a custom BaseEndpoint with UsePathStyle=true. Upgraded SDK versions: - github.com/aws/aws-sdk-go-v2 v1.41.4 -> v1.41.6 - github.com/aws/aws-sdk-go-v2/config v1.32.12 -> v1.32.16 - github.com/aws/aws-sdk-go-v2/service/s3 v1.97.2 -> v1.100.0 - github.com/aws/smithy-go v1.24.2 -> v1.25.1 Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-28 22:56:47 -04:00
jedarden	09fced7dfe	fix(worker,index-builder): use us-east-1 region for S3-compatible endpoints The AWS SDK requires a valid AWS region name even when using custom S3-compatible endpoints (ARMOR/B2). Using "auto" as the region causes an error: "Invalid region: region was not a valid DNS name." This fixes the replay upload pipeline which was failing with the invalid region error. Replays should now upload successfully to B2 via the ARMOR proxy. Related to ai-code-battle-o43: Replay viewer verification task.	2026-04-25 11:07:08 -04:00
jedarden	e601fecc04	fix(worker): update B2 client for S3-compatible API (ARMOR/B2) Remove custom endpoint resolver and use AWS SDK's standard approach for S3-compatible endpoints: - Use config.WithRegion("auto") for custom endpoints - Set BaseEndpoint directly via s3.NewFromConfig options - Add UsePathStyle for B2 compatibility This fixes the 'Invalid region: region was not a valid DNS name' error that was preventing replay uploads. The deployment manifest already sets ACB_B2_REGION to empty string to avoid conflicts. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-25 10:38:41 -04:00
jedarden	992fa1d573	fix(worker): crash cooldown passes time.Time not time.Duration to pq Passing time.Duration (int64 nanoseconds) as $2 in NOW() + $2 caused PostgreSQL to interpret the nanosecond value as seconds, setting cooldown_until to year ~59066 instead of +30 minutes. Fix: pre-compute time.Now().Add(CrashCooldownDuration) and pass the resulting time.Time — pq encodes it as a proper timestamptz. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-24 18:07:29 -04:00
jedarden	4f45670066	fix(worker): use seasons.id instead of seasons.season_id in ClaimJob The seasons table was recreated with id BIGSERIAL (not season_id VARCHAR). The ClaimJob query was still referencing s.season_id (stale column name). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-24 17:39:24 -04:00
jedarden	38ae4c6303	fix(worker): use winner identity for Glicko-2 pairwise scoring Raw game scores (capture points) are tied in most matches since the winner is determined by an energy/bots-alive tiebreaker. This caused Glicko-2 delta=0, leaving rating_mu frozen at 1500 for all bots. Now winner gets 1.0, non-winners 0.0, draws 0.5 — correct pairwise win/loss signal for Glicko-2 convergence. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-24 17:29:36 -04:00
jedarden	d0087a3241	fix(docker): add COPY metrics/ to all service Dockerfiles The metrics package is a local module dependency imported by all services but was missing from every Dockerfile's build context. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-23 18:00:16 -04:00
jedarden	477a54c548	feat(matchmaker): implement §6.1 Pareto skill-proximity + LRU pairing algorithm Replace random 2-player pairing with the full §6.1 algorithm: - Seed selection: bot with oldest last-match timestamp (tiebreak: lowest bot ID) - Format selection: seed's least-played player count among {2, 3, 4, 6} - Opponent selection: Pareto 80%/16-rank skill proximity + oldest last-pairing with seed + fewest 24h games for game-count balance - Map selection: least-recently-used active map for the chosen player count, with map_scores.last_used_at updated after each match - Random player slot assignment for all participant counts Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-22 17:35:00 -04:00
jedarden	5a1130c77a	feat(bot): add Pacifist bot (JavaScript) — non-aggressive attrition archetype PacifistBot never attacks; it survives by maximizing distance from enemies and retreating toward own core when cornered. Pure evasion strategy that wins via opponent elimination by third parties. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-22 16:32:50 -04:00
jedarden	582b4c010d	fix(worker): remove unused net/http import in acb-worker Pre-existing issue blocking go vet and go test. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-22 15:55:45 -04:00
jedarden	c56cc8bae6	fix(matchmaker): multi-match crash cooldown (3 strikes / 30 min) per §4.5 + §6.1 Add crash_strikes and cooldown_until columns to bots table. Worker increments strikes on crash (cooldown at 3), resets on success. Matchmaker excludes cooldown bots from pairing, series scheduling, and championship brackets. Fix erroneous cooldown filter on series table in finalizeCompletedSeries (column only exists on bots). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-22 15:22:12 -04:00
jedarden	fb707b8461	test: integration tests for multi-match crash cooldown (3 strikes / 30 min) per §4.5 + §6.1 The crash cooldown system was already implemented across engine, worker, and matchmaker. This adds comprehensive integration tests that verify: - Single crash does not trigger cooldown - Two crashes do not trigger cooldown - Three consecutive crashes trigger 30-min cooldown - Successful match resets strike counter - Interleaved crash/success resets counter correctly - Cooldown extends on repeated crashes while on cooldown - Matchmaker eligibility query excludes bots on active cooldown - Matchmaker eligibility query includes bots with expired cooldown - Full end-to-end flow: 3 crashes → excluded → cooldown expires → re-pair Tests use ACB_TEST_DATABASE_URL env var for PostgreSQL integration tests and skip gracefully when not configured. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-22 15:14:03 -04:00
jedarden	c618f0b7a1	feat(worker): gzip replay compression at upload per §7.1 Worker now gzip-compresses replays before uploading to B2 with key replays/{match_id}.json.gz and Content-Encoding: gzip. Updated B2 client Upload to accept contentEncoding parameter. Fixed downstream web consumers (matches, bot-profile, playlists) to reference .json.gz URLs. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-22 15:00:09 -04:00
jedarden	347ae4f1df	feat(predictions): resolve predictions on match completion, add API endpoints and frontend - Worker resolves open predictions after writing match results (resolvePredictions + upsertPredictorStats) - API endpoints: POST /api/predict, GET /api/predictions/open, GET /api/predictions/history - Frontend /watch/predictions page with polling, prediction submission, and history display - predictor_stats table tracks streaks and accuracy per predictor - Series format selection: fix threshold from >200 to >=200 for bo3 eligibility Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-21 14:08:15 -04:00
jedarden	91d807cec2	feat(web,cmd): enhance evolution dashboard, series/seasons pages, and matchmaker - Evolution page: live polling (10s), activity feed, candidate tracking, statistics section, island overview with live.json schema - Series page: detailed series view with game-by-game results - Seasons page: season list with status and champion display - Predictions page: enhanced prediction UI with open matches - API types: add CycleInfo, Candidate, ActivityEntry, Totals for live.json - Embed: improved embeddable replay widget - Mobile CSS: responsive breakpoints and bottom tab bar - Exporter: enhanced live.json generation with full cycle/candidate data - Matchmaker: series scheduling support with config - Worker: additional database queries for series/season data Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-21 13:42:20 -04:00
Argo Workflows CI	be5ffbf8c1	fix: update Dockerfiles to golang:1.25-alpine (go.mod requires go 1.25.0)	2026-04-14 13:43:39 -04:00
jedarden	729efb3f45	Refactor acb-worker: B2 uploads, PostgreSQL writes, Glicko-2 ratings - Upload replays to B2 (Backblaze) instead of R2 for cold archive storage - Write match results directly to PostgreSQL instead of HTTP API - Perform Glicko-2 rating updates in worker after match completion - Update config: ACB_R2_* env vars → ACB_B2_* - Remove obsolete api_test.go (tested removed HTTP client) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-29 10:46:23 -04:00
jedarden	b06350d762	Remove legacy code: worker-api/, cmd/acb-indexer/, cluster-configuration/, gut cmd/acb-api/ Some checks failed CI / Go Tests (push) Has been cancelled Details CI / Worker API Tests (push) Has been cancelled Details CI / Indexer Tests (push) Has been cancelled Details CI / Web Build (push) Has been cancelled Details Cleanup of superseded code that no longer matches the architecture: Removed: - worker-api/ - Cloudflare Worker with D1, superseded by K8s-based matchmaker + direct PostgreSQL - cmd/acb-indexer/ - TypeScript index builder, superseded by Go cmd/acb-index-builder/ - cluster-configuration/ - K8s manifests belong in ardenone-cluster repo Gutted cmd/acb-api/: - Removed registration, job claim/result endpoints (deferred for v1) - Removed dead code: predictions.go, seasons.go, series.go, register.go, jobs.go, glicko2.go - API is now a stub with only health/ready endpoints - Matchmaker and workers handle the core loop without it Updated PROGRESS.md to reflect current architecture. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-29 10:22:16 -04:00
jedarden	20c48783cc	Add Prometheus metrics endpoint to match worker Some checks are pending CI / Go Tests (push) Waiting to run Details CI / Worker API Tests (push) Waiting to run Details CI / Indexer Tests (push) Waiting to run Details CI / Web Build (push) Waiting to run Details Adds a metrics HTTP server to acb-worker exposing Prometheus text format at /metrics, plus /health and /ready K8s probe endpoints. Tracks counters (matches, errors, jobs, replays, polls, heartbeats) and histograms (match duration, replay upload duration, replay size). Instruments the full worker execution flow. Fixes .gitignore binary patterns to use root-anchored paths so cmd/ subdirectories aren't incorrectly excluded. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-26 00:50:10 -04:00
jedarden	23186b77e1	Start Phase 6: Add deployment configuration and containers - Add Dockerfile for acb-worker match execution container - Add docker-compose.bots.yml for orchestrating all 6 strategy bots - Add docker-compose.workers.yml for worker and indexer deployment - Add .env.example documenting all required environment variables - Add DEPLOYMENT.md with deployment guide and troubleshooting - Update PROGRESS.md with Phase 6 progress Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-24 09:41:14 -04:00
jedarden	6659027bec	Implement match worker container (cmd/acb-worker/) - Worker polls Cloudflare Worker API for pending match jobs - Claims jobs and executes matches using the game engine - Uploads replays to R2 via S3-compatible API - Sends heartbeats during match execution - Submits results back to Worker API - Includes retry logic with exponential backoff - API client tests for job coordination endpoints Also fixes glicko2.ts: export g() and E() functions for testing Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-24 08:06:15 -04:00

38 commits