Hacker News

1970-01-01 8 hours ago [ - ]

Creating a single real-world object and declaring it a benchmark? No, it doesn't work that way for a robust tool. You need to do something like Iron Chef, with a Greek architecture theme and and a panel or judge that declares the winner. This is just seeing which tool subjectively makes the best looking Pantheon.

Eridrus 8 hours ago [ - ]

Yeah, this is less of a benchmark and more "I like this one guys!".

Just totally subjective grading criteria of a single poorly defined example with no end use case in mind to guide how to even do evaluation.

davej 3 hours ago [ - ]

It's still interesting in a similar way to Simon Willison's Pelicans on a bicycle.

Eridrus 3 hours ago [ - ]

The Pelicans are mostly just entertainment.