How are these agents stress tested today? Are there tools that are typically being used for QA and/or security?