At 8-bit quantization (q8_0) I get 20 tokens per second on a Radeon R9700.