I could be wrong but I believe this is a non-vision model. Please weigh in to correct me bc I would love to be wrong

GLM 5.2 is text only, not multi modal. And Opus is multi modal.