This kind of stuff can be a great LLM benchmark as Opus basically screwed it up and created a monstrosity as solution on first try.
This kind of stuff can be a great LLM benchmark as Opus basically screwed it up and created a monstrosity as solution on first try.
Interesting! It did well for a first try. This was my prompt:
Lets play elevator saga! Here's the initial implementation:
and documentation attached in the PDF.