Is V4 still not a multi-modal model?

Not yet... Which is a shame.