The issue is misrepresenting the situation.

One of those operations makes a row-major array, the other makes a col-major array. Downstream functions will have different performance based on which is passed.