This is just a benchmark of how much of a sycophant an LLM is. Anything that scores > 50% on this test should be punted into the bin.
This is just a benchmark of how much of a sycophant an LLM is. Anything that scores > 50% on this test should be punted into the bin.