Experiment: "We got AI to do things and it did weird stuff sometimes".
Brilliant! Amazing! I'm glad ~4 years down the line we're still re-discovering Ha Ha Funny Output.
Experiment: "We got AI to do things and it did weird stuff sometimes".
Brilliant! Amazing! I'm glad ~4 years down the line we're still re-discovering Ha Ha Funny Output.
At this point I think many of us are similarly exhausted by this sort of trite exercise. I really don't need some VC backed startup to show me this sort of output any more, especially when the output in question is obviously boring and substandard.
> I'm glad ~4 years down the line we're still re-discovering Ha Ha Funny Output
Four years or forty millennia? So a certain extent, all whimsical art is “haha funny” result.
Yea what are they trying to test? Where is the hypothesis?
We're generally trying to test if/when AIs can run companies. Not many people know this, but Vending-Bench (our other project where AIs run vending machines) is intended as a datapoint for measuring whether AIs can acquire resources by themselves, which is a prerequisite to AIs taking over. This is similar, but now instead of a retail business, it's a media business.
They're trying to test if it's good enough to replaced the few remaining paid radio/streaming DJs yet.
I am reminded of how not even 2 weeks ago we had an “experiment” of rewriting Bun in Rust.