Hacker News

The best chinese models are deepseek (general purpose) and glm (coding) and they are both open weight and share lots of their tooling.

There are lots of AI companies and it doesn’t seem that they all have the same funding fountain or share monetization goals. I wouldn’t read much into what each one of them is doing.

Barbing 12 hours ago [ - ]

Had seen weeks back that the top two non-Western models on ArtificialAnalysis were both closed: https://artificialanalysis.ai/#intelligence-category-tabs

How much stock should we put into that graph, though, I'm not sure.

sameersri2004 11 hours ago [ - ]

Even if the models by the Chinese labs are open source or open weights even after they get to mythos level intelligence lets say, still inference and the optimization of those models to be accessed at speeds of 1000 tokens/sec in not in the hands of general public as these models have parameters more than a trillion and they can't be run on some publicly available hardware, So even after being open source it does'nt fix the problem as the general public will still pay the company for inference.

state_less 9 hours ago [ - ]

I'm pretty sure these large models are run on Nvidia GPUs, not some unobtainable piece of secret kit. You could go down the street and buy from AMD or a number of other vendors to push out FLOPs if you wanted or needed, but you'll need a thick wallet to shell out for a cluster of GPUs to run these models. The reason people don't run the big Chinese models at home is that they can't afford the hardware, not that it isn't publicly available. This tech is essentially a large amount of matrix multiplications afterall.

I think the larger problem is that restricting US AI companies gives the Chinese a leg up because they now have a window open where they can become the source of the most powerful models available due to government restrictions rather than on technical merits. All Anthropic customers just got a downgrade last evening, for example. While the Chinese are able to serve the world or whoever, the US corporations will be limited to the US market, or whatever the powers that be will allow. This restrictiveness could turn out to be disadvantageous to American companies since people will migrate to wherever they can get the most powerful models.

rcxdude 10 hours ago [ - ]

if it's open source there will be many potential providers, though.

tmpz22 2 hours ago [ - ]

If only we had established means of pooling community resources for the public good

corimaith 9 hours ago [ - ]

You know a statement like this just makes Chinese big tech look bad right?

tw1984 12 hours ago [ - ]

> The best chinese models are deepseek (general purpose)

DeepSeek is developed by the largest Chinese hedge fund, their models used to make them $ on the share market are very profitable, they've never ever released anything on those models.

Somehow you are claiming that those same group of people are going to totally change their very consistent long term behaviour and start promoting openness when they are in the global leading position in AI?

hurtigioll 12 hours ago [ - ]

selling LLMs is much more profitable than trading, and with much less risk

fn-mote 10 hours ago [ - ]

> much more profitable

I think you made this up.

Right now, I don’t believe any LLM company is profitable at all.

Unless you meant “more profitable” to mean “not-as-badly-negative profit”.