I'm experimenting with a local-first autonomous agent system on Windows.
The interesting part for me hasn't been the UI or demos, but the engineering problems: how planning compares to step-by-step tool-calling, how state drifts over long tasks, and how fragile things get once you add retries and recovery logic.
Mostly learning by breaking things and trying to make the system more predictable.