our training stack doesn't make strong assumptions about data integrity, it's chill