Perennially checking if local models stack up to Claude 3.