The issue i think is that to model sycophancy you'd need another model that can address signs of sycophancy - it's turtles all the way down
The issue i think is that to model sycophancy you'd need another model that can address signs of sycophancy - it's turtles all the way down