I think you need play around with some of the early codegen models so you can get a better intuition for how LLMs work/fail.