the "containers manually vs compose" framing is exactly right -- that's the coordination layer that's missing from raw worktrees.
one thing i'm curious about with chatml: how's the experience on windows and linux? you mentioned it's running but with bugs. i built pane specifically because conductor and most of the other tools in this space were mac-first (or mac-only), and a huge chunk of the multi-agent dev community is on windows or linux.
pane is the same app, same shortcuts, same UI across all three. not "cross-platform eventually" -- ships to all three now. built on xterm.js (same engine as vs code) so terminal compatibility isn't a thing you have to debug.
if anyone on windows or linux is hitting walls with chatml or conductor, pane might be worth a look: github.com/Dcouple-Inc/Pane