I still can't believe that LLM encoders aren't unsupervised learned.

So much left on the table

They are using Qwen, so this is decoder only.

Yes, my comment was kind of a non sequitur