Hah, that's because the prompt itself was only about 30 tokens. We need a much bigger prompt to properly test PP.