I think both are helpful

1. starting fresh, because of context poisoning / long-term attention issues

2. lots of tools makes the job easier, if you give them a tool discovery tool, (based on Anthropics recent post)

We don't have reliable ways to evaluate all the prompts and related tweaking. I'm working towards this with my agentic setup. Added time travel for sessions based on Dagger yesterday, with forking, cloning, registry probably toda