Is it slow because of the inherent design or because it's recent and not as optimised as x86 or arm ?