Is there research showing if and under what conditions LLM output is detected accurately. What are the false positive and false negative rates?