> But that's not what they were testing for. It passes the test for prompt injection, and then usability would be a different set of tests
That's like claiming that a database has 10x faster write speed than any other database on the market[1], and the read speed wasn't measured because that's a different metric.
------------------
[1] By writing all data to /dev/null