jedarden
9799d75d2b
feat(dashboard): add cache token tracking and running totals panel
...
Adds cache_read and cache_write token directions throughout the
observability stack so Anthropic prompt-cache billing is visible.
- model/metrics.go: TokensCacheRead, TokensCacheWrite, TokenRateCacheRead,
TokenRateCacheWrite fields on MetricSnapshot
- collector: reads direction=cache_read/cache_write from
zai_proxy_tokens_total Prometheus metric
- frontend types.ts: matching TS fields
- TokenPanel: rewritten to show all 4 directions (input, output,
cache_read, cache_write) on the rate chart; running-total summary
strip above the chart shows window totals (e.g. "5h window: 1.2M
input / 340k output / 89k cache_read / 12k cache_write")
Also updates docs/plan/plan.md to accurately document the full
dashboard architecture (backend API, storage schema, SSE hub,
frontend panels, Grafana layer, env vars).
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-05-16 23:08:28 -04:00
jedarden
dee82a76a3
chore: update module paths and add evaluation package
...
- proxy/go.mod: github.com/ardenone/zai-proxy → git.ardenone.com/jedarden/zai-proxy
- dashboard/go.mod: github.com/ardenone/ardenone-cluster/containers/zai-proxy-dashboard → git.ardenone.com/jedarden/zai-proxy/dashboard
- Update all Go import paths in proxy/ and dashboard/ to match new module paths
- Add proxy/evaluation/ package (was missing from initial commit)
- Add docs/plan/plan.md with architecture, security model, telemetry design, and migration checklist
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-05-16 16:03:50 -04:00
jedarden
e7c24a0c08
feat: initial zai-proxy ecosystem repo
...
Extracted from ardenone-cluster/containers/zai-proxy and
ardenone-cluster/containers/zai-proxy-dashboard.
- proxy/: OpenAI-compatible ZAI reverse proxy (Go, v1.10.0)
- Token counting, rate limiting, Prometheus metrics, canary support
- dashboard/: Metrics dashboard backend + React frontend (Go, v1.0.0)
- Prometheus collector, SQLite storage, SSE live updates
- docs/: Operational notes, research, and plan subdirs
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-05-16 15:53:52 -04:00