Having agents create their own skills works well if you also give it a layer of verifiability.

Eg. Ask the agent to write a skill then get it to prompt a subagent to use the skill, then iterate until it verifies the task was completed correctly

Yes, I wrote a forge skill to do this via a/b testing and third agent to judge the result.

https://github.com/bjcoombs/ai-native-toolkit/blob/main/skil...

It hardens a skill through judge-panel refinement rounds, it’s a quality gate that runs after authoring, not an authoring tool.

This is a pretty neat, I suspect that eventually every skill will have some sort of validation/verification loop like this