The motion tracking system used by ARKit/ARCore is called SLAM, combination of sensors + camera data if available.

Sure, but my point was that just because something uses the same inputs and gets a similar output doesn't mean the processing is the same. This is just localization. Which is still impressive. But I'm not understanding there being any mapping going on.