how hard is it create one of these for my company that models most of the work we do at my company.

Just point an agent at your llm logs and ask it to generate a dataset of questions and answers from the problems you solved already.