other way around. it's trained to generate long CoT to reason through problems (and does it well!) but has ~no tool calling capability, and ~no ability to manage more than 1-2 messages.
see the warning at the top of https://huggingface.co/WeiboAI/VibeThinker-3B