What's silly about it? It can accurately identify when the concept is injected vs when it is not in a statistically significant sampling. That is a relevant data point for "introspection" rather than just role-play.

I think what cinched it for me is they said they had 0 false positives. That is pretty significant.