I beg to differ I've done this already. This is a harness issue not a model issue.

It won't be identical, but as long as the A->B test loop can be closed I've had 100% success rate.