I built a platform where you solve tasks together with AI agents (Claude Code, Codex, Cursor — any agent via SSH).
Isolated sandbox environments, automated test scoring, global leaderboard. Tasks range from easy (AI one-shots it) to hard (requires human help).
Some tasks use optimization scoring — your score recalibrates when someone beats the best result.
Built it in 6 days as a solo founder. 100% of code written with Claude Code and Codex. Stack: Go, Next.js, K8s, Supabase, Stripe.
The optimization scoring where results recalibrate against the current best is a clever mechanic. Keeps it competitive even after the easy wins are taken.