This model is more comparable to GPT-2 than anything we use now.