To be fair, the author does mention the huge difference between Gemma 3 and Gemma 4 on Tau function calling benchmark.