Have you tried using them as macro assemblers in the way described in the paper?

No, I haven't. I have used them with shallow embedding techniques that are relatively similar, but not in this way.

That sounds interesting!