Cool, yes benchmark results are great to show

Also, are you using this tool as part of another project? It’d be interesting to see what the main applications of CoT prompting are (the examples are great but a little basic)

I'm not using it in a larger project at the moment. The examples right now are mainly included to help people get started quickly. About the applications, they are somewhat context-dependent, but I might add one or two larger examples later if I have the time.

I guess CoT prompting could be used for ARC Prize (https://arcprize.org)