Look at VLM mechanistic interpretability papers vs just pca on JEPA trained weights.
JEPA gives you interpretability for free.
I have not personally inspected them and my view is maybe a more exaggerated/dramatic claim of those working in the JEPA sphere
Sounds interesting, any links?
JEPA in chess leads to interpretable chess boards:
https://arxiv.org/abs/2606.11860
JEPA in image classification leads to interpretable image latents
https://arxiv.org/abs/2508.10104
Easy intro to JEPA, demonstrating that interpretability is as easy as running a PCA on latents
https://youtu.be/kYkIdXwW2AE?is=CfCBcy1jLt-FfI2E