Pretty cool! It reminded me of this work from NVIDIA Research - https://nvidia-ai-iot.github.io/remembr where they used VLMs and RAG on top of a real robot to navigate the Voyager campus in Santa Clara. You also might like the new OpenAI o3 models and how well they can play GeoGuessr ;)

https://simonwillison.net/2025/Apr/26/o3-photo-locations, https://news.ycombinator.com/item?id=43835044, https://www.astralcodexten.com/p/testing-ais-geoguessr-geniu...