Part of the challenge is that machines are significantly different. The radiologist’s statement that an object measured from two different machines is the same and has not changed in size is in large part judgement. Building a model which can replicate this judgement likely involves building a model which can solve all common computer vision tasks, has the full medical knowledge of an expert radiologist, and has been painstakingly calibrated against thousands of real radiologists in hospital conditions.