In the article the author says they are doing reinforcement learning with LLMs.
Seems like they just want to play PewDiePie after making tons of money from their salaried job and have a bunch of spare time now.
Seems like they just want to play PewDiePie after making tons of money from their salaried job and have a bunch of spare time now.