I see what you mean, but I disagree. I expect that Claude Code is backed by a separate post-train of Claude base which has been trained using the Claude Code harness and toolset.

It is possible of course, but I see no reason to believe it.

fwiw, other models seem to / are reported to struggle much more with using claude code compared with codex / opencode / pi etc.

that being said, there are other potential explanations