Faster than the 0.2tok/s this approach manages