For ultra low-power embedded, wouldn't a mutex approach work just fine? You're running on a single core anyway.

I'm not sure about the single-core scenario, but would love to learn if someone else wants to add something

In reality multiple threads for single core doesn't make much sense right?

> In reality multiple threads for single core doesn't make much sense right?

Not necessarily, I think -- depends what you're doing.