In general, most researchers already incorporate LLM into their workflows, as it is quite good at context search. However, the relevant training data is based on the collective works of the field of experts. Collecting current data on that work is what makes the LLM sound relevant, and any improvement of the LLM model requires frequent new data from both researchers and the chat bot users themselves. LLM are not real "AI", and anyone that says otherwise is selling people something.
To phrase this differently, LLM companies conduct unauthorized targeted intelligence gathering on peoples work, codify that act of plagiarism or theft as MoE documentation, and sell unaccountable token output to other users.
There is a reason output becomes more nonsensical as "AI" companies try to use dynamic weight granularity and conceptual compaction. It is not necessarily "AI" hallucinations, but rather people fooling themselves into believing smart people are no longer needed if they willingly become a hapless exploited data source caste. This simply isn't true, as people will leave the field for awhile.
The LLM business model regularly requires copyright theft and plagiarism to persist. It will not magically become sentient/AGI/less-stupid, as these algorithms have been operating for over 40 years. What has changed is the scale of the deployment, data pool size, and the energy consumed.
Scientists are still necessary, as they create the world models LLM try to guess at by statistical inference. Hype and FUD ahead of an IPO for a highly dubious revenue company is expected. We look forward to the low cost liquidated GPU hardware in the near future. =3
Reading this invoked the image of ouroboros in my mind.
The Ouroboros in western mythology is a cautionary tale about the uselessness of the first perfect immortal being, and why humans should suffer our imperfections with insightful grace. The concept also made a great Red Dwarf episode.
LLM are more like the Mechanical Turk trick, but the persons inside the machine running the con is unaware of how their actions affect the confounded observers.
Have a wonderful day =3
Thank you, human being! Have you looked at the price of a RTX 5090? I can get a used car for that.
Indeed, just paid $3k more for the same workstation we purchased last year at this time. Just the DDR5 sticks and NVMe drive cost more than most parts right now including a rtx 5070 Ti 16G card. For h265 hardware encoding, the performance differences on higher-end cards benchmarks was negligible.
Building systems based on application specific benchmarks rather than general what-if use-case scenarios will sometimes show you something interesting. ymmv. =3