Hacker News

libraryofbabel 16 hours ago [ - ]

I am saying this probably is "silly behavior by a government" and it is a milestone that points towards what the future may look like. Why can't it be both?

It's easy to wave this aside as the current administration playing political games. But I don't think there is any reason to assume that the current era of open availability of models is going to continue indefinitely. Do you think that Chinese labs will continue to release open models forever, even why they get to the level that Mythos is at now, and beyond? And do you think that a competent US government would have no interest in regulating and restricting model access in 2 years time, assuming that model capabilities continue to improve? I think we bias towards thinking the status quo is the norm and will continue, but this news invites us to question that assumption and think about different ways the future could go.

gpm 16 hours ago [ - ]

> Do you think that Chinese labs will continue to release open models forever

Yes.

I think the Chinese government either already has, or will soon, grasp that if they train the models that people use they dictate what people believe (at least around the margins where that's malleable), and they will happily throw resources at that.

And simultaneously that the only way they can actually get everyone to use their models is if it's possible for us to run them on our own hardware.

(This isn't exactly a utopian view of the future)

jychang 16 hours ago [ - ]

This is going to age very poorly when the best Chinese labs ALREADY just started not open sourcing their models.

Qwen 3.7 is not open source; previous Qwen versions would have open source releases, but Qwen 3.7 plus does not. The second best Chinese model, Minimax M3, is testing the waters by taking longer and longer between “model release” and open sourcing it. This time, they spent 2 weeks after release before open sourcing it. There’s also a lot of rumors of GLM and Deepseek not open sourcing future models.

It’s pretty obvious that you cannot take Chinese models as open source for granted, they’ll be closed source soon.

roenxi 8 hours ago [ - ]

If we're measuring progress in hours and days then yes. But if we're measuring progress in months then OSS models are doing fine. You can get a state-of-the art performance in an open model if you pretend it is January 2026 instead of June.

There is no evidence here that the cutting edge labs have any durable advantage. Extrapolating current trends it seems likely that even the Europeans will be capable of meeting any given performance measure with enough time. In fact the evidence suggests that the capital required to run the models is where a moat will develop. Knowing the weights won't help much.

chabes 4 hours ago [ - ]

Qwen does closed and open. This is not new.

rupx 5 hours ago [ - ]

Kimi 2.7 and GLM 5.2 released today and are open source.

c0rruptbytes 5 hours ago [ - ]

Minimax M3 too, and huawei claims to be releasing non-nvidia dependent training software too. openPangu 2.0 could be a shake-up if it holds up as a good model

China may not care about open source, but they know they will personally fund AI through government investments while US relies on private investments, best way to scare private investments is a free capable alternative for everyone

Add on the fact that they actually invested in energy infrastructure and can offer AI very cheap to their citizens and you can get a population well versed in AI to reduce menial tasks and focus on more productive things (if we're to believe the claims of the technology)

csomar 13 hours ago [ - ]

The best chinese models are deepseek (general purpose) and glm (coding) and they are both open weight and share lots of their tooling.

There are lots of AI companies and it doesn’t seem that they all have the same funding fountain or share monetization goals. I wouldn’t read much into what each one of them is doing.

Barbing 12 hours ago [ - ]

Had seen weeks back that the top two non-Western models on ArtificialAnalysis were both closed: https://artificialanalysis.ai/#intelligence-category-tabs

How much stock should we put into that graph, though, I'm not sure.

sameersri2004 11 hours ago [ - ]

Even if the models by the Chinese labs are open source or open weights even after they get to mythos level intelligence lets say, still inference and the optimization of those models to be accessed at speeds of 1000 tokens/sec in not in the hands of general public as these models have parameters more than a trillion and they can't be run on some publicly available hardware, So even after being open source it does'nt fix the problem as the general public will still pay the company for inference.

state_less 9 hours ago [ - ]

I'm pretty sure these large models are run on Nvidia GPUs, not some unobtainable piece of secret kit. You could go down the street and buy from AMD or a number of other vendors to push out FLOPs if you wanted or needed, but you'll need a thick wallet to shell out for a cluster of GPUs to run these models. The reason people don't run the big Chinese models at home is that they can't afford the hardware, not that it isn't publicly available. This tech is essentially a large amount of matrix multiplications afterall.

I think the larger problem is that restricting US AI companies gives the Chinese a leg up because they now have a window open where they can become the source of the most powerful models available due to government restrictions rather than on technical merits. All Anthropic customers just got a downgrade last evening, for example. While the Chinese are able to serve the world or whoever, the US corporations will be limited to the US market, or whatever the powers that be will allow. This restrictiveness could turn out to be disadvantageous to American companies since people will migrate to wherever they can get the most powerful models.

rcxdude 10 hours ago [ - ]

if it's open source there will be many potential providers, though.

tmpz22 2 hours ago [ - ]

If only we had established means of pooling community resources for the public good

corimaith 9 hours ago [ - ]

You know a statement like this just makes Chinese big tech look bad right?

tw1984 12 hours ago [ - ]

> The best chinese models are deepseek (general purpose)

DeepSeek is developed by the largest Chinese hedge fund, their models used to make them $ on the share market are very profitable, they've never ever released anything on those models.

Somehow you are claiming that those same group of people are going to totally change their very consistent long term behaviour and start promoting openness when they are in the global leading position in AI?

hurtigioll 12 hours ago [ - ]

selling LLMs is much more profitable than trading, and with much less risk

fn-mote 10 hours ago [ - ]

> much more profitable

I think you made this up.

Right now, I don’t believe any LLM company is profitable at all.

Unless you meant “more profitable” to mean “not-as-badly-negative profit”.

ls612 15 hours ago [ - ]

The main reason the Chinese labs are releasing models as open weights is because they don't have the compute necessary to provide all of the inference. For the US frontier models something like 80-90% of the lifetime compute required for the model is inference rather than training. China wants to shepherd as much of their limited compute as possible towards training to keep up in the race.

Slartie 13 hours ago [ - ]

I think the main reason is to minimize the market for closed-source models from US companies.

China knows that doing what Anthropic/OpenAI/Google/... are doing is impossible for them. No one outside of China in any sane condition will send their data to compute farms IN CHINA like people currently do with US-based frontier models. Even if they could muster the inference power.

Hence they do the second-best thing possible to attack the dominance of the US-based corporations: reduce their moat by open-sourcing models that are not fully equal, but practically useful and good enough for easily 90% of typical tasks people use agents for in their daily lives. But way cheaper to run.

As long as this arms race in AI continues, China as "number two" will have some incentive to continue open-sourcing models. But of course the US government might force a change if they continue to enforce limited public access to new frontier models - there is no market to minimize if a model is not allowed to be publicly available.

Al-Khwarizmi 12 hours ago [ - ]

I'm European and I don't see sending my data to China as more risky than sending it to the US. Rather the opposite.

I think your vision of how the rest of the US sees the world is tinted by a massive bias.

wongarsu 12 hours ago [ - ]

As a private citizen, yes.

But at work the calculus is entirely different. There is already lots of exposure to US companies (guess where our emails and tickets life), so the increase in espionage risk from adding another American company is small. Not zero, and trust towards AI companies is limited. But adding the first Chinese company to send data to would be a major risk. One nobody would sign off on, given the general reputation of the Chinese economy for widespread espionage, disregard for copyright and producing copies of successful products using insider information

dofm 7 hours ago [ - ]

Not sure why anyone in the EU thinks the US is not a significant espionage risk. Adding any major US supplier would have been a significant espionage risk until really recently.

Before the EU cleaned up Europe's act pretty considerably on corruption, US companies used corporate but also state-level espionage actors to level the playing field against a culture of bribes and they were fairly open about it. They absolutely needed to do it, because of the potential penalties back home if they engaged in bribery abroad.

The tables have turned, now. The EU runs much more cleanly than decisionmaking in DC, which is clearly corrupted and lubricated with cash and opportunities for failsons and faildaughters; it has accelerated radically quite recently but it was heading that way from the first Bush era.

But I'd bet the corporate-state merger of industrial espionage is in full flow.

stickfigure 6 hours ago [ - ]

This would require active participation by people inside Anthropic and OpenAI. Given how generally ideological the people working in these companies are, I'd be willing to bet that we would already be reading Snowden-style leaks if it were true.

I have zero expectation that a similar culture exists inside Chinese companies. If you think these corporate and national cultures are the same, you need to adjust your priors.

dofm 6 hours ago [ - ]

> This would require active participation by people inside Anthropic and OpenAI.

Not necessarily of the companies themselves, though; just embedded people at the right hiring level.

> Given how generally ideological the people working in these companies are

History has many examples of truly surprising spies, over the long term. Including in highly ideological environments such as animal rights and eco-campaigning groups. The embedded police spying scandals in the UK make this clear.

It is naïve to think that there are no CIA or NSA employees in some functional role at these two businesses, just as it is naïve to think that they don't have intelligence industry contacts playing them because they are naïve. You only have to look at how the NSA weakened open cryptography to see that two companies staffed by young, absurdly rich people barely out of college with wobbly moral e/acc compasses might be getting played by homegrown spooks.

> I have zero expectation that a similar culture exists inside Chinese companies. If you think these corporate and national cultures are the same, you need to adjust your priors.

I suggested absolutely nothing of the sort — I flatly was not talking about China at all.

FWIW it cuts both ways: in the dim and distant past of the early dot-com era, I remember encountering someone who wafted inexplicably between US and UK multinational companies who I thought was possibly British intelligence. An odd duck for sure.

rvnx 11 hours ago [ - ]

> given the general reputation of the Chinese economy for widespread espionage, disregard for copyright and producing copies of successful products using insider information

Quite funny because if you use that phrase verbatim except swapping China with the US it could actually fit.

Good governments try to do things that are in the interest of their population, and yes it could mean opposite interests to your/someone else governments.

No reason to blame US, Israel, China, Russia, etc. They just defend their piece of cake.

59nadir 11 hours ago [ - ]

Anthropic and OpenAI are not just "another American company", their entire business (and industry) was created based on stealing data and using it for profit. You make this point about "another company" so casually that you'd think you added a SaaS bill for generating thumbnails or whatever. The exact same point you make about China can be made much more confidently and with stronger evidence for the entire modern LLM lab industry.

Again I have to echo the previous poster's point: Most people outside of the US really do not see the US as some much better alternative than China. If anything, in the specific area of LLMs, China are the ones doing work benefitting the everyman whereas almost everything the US labs do does not.

wongarsu 11 hours ago [ - ]

That's why I added "Not zero, and trust towards AI companies is limited". Reaching the decision that adding one single US-based LLM provider had more benefits than risks took months. And we were selective about who that would be (hint: not OpenAI). And I know companies who are not willing to go that step, using open-weight models on their own infra instead. But outsourcing inference to China was never even a serious suggestion. The notion is absurd to us

That said, I imagine e.g. South Americans thinking very differently on this front

tripzilch 4 hours ago [ - ]

> disregard for copyright

what did you think US-based AI is trained on

I'm pretty sure the US just jumped to the front of the list with their biggest IP heist in humankind history

crote 8 hours ago [ - ]

I'm not sure I agree.

China indeed has a general reputation for widespread espionage, so any Chinese company wanting to expand into the European market has to prove it isn't spying on its potential customers. US companies have traditionally been seen as friendly, so their platforms are essentially built around "trust me bro" guarantees.

In a world where both China and the US are now seen as hostile-by-default, this might actually leave some Chinese companies with an advantage in their ability to demonstrate trustworthiness.

dofm 6 hours ago [ - ]

The blurring of US state and corporate espionage in the EU is the stuff of legend. They have always spied, and you can easily make the case that in late 1980s/early 1990s Europe they had good reason to, because European businesses were corrupt.

rvnx 12 hours ago [ - ]

Totally agree, though it is an unpopular opinion here.

It’s the same paradox as people claiming: “we are European, our data is safer in Europe” when actually your privacy is higher when your data is stored in China (or Russia) you are safer because it is out of reach from your local government.

The only thing I dislike, and that’s no matter the service, is that my data or information usage is shared with third-party.

For example, Anthropic conveniently forgets to mention Datadog has tons and tons of information about Claude users, or that your data transits through machines they don’t operate.

crote 7 hours ago [ - ]

Safety has more than one definition. Being able to sue the company in small claims court when it threatens to delete your account is also part of that, and so is being able to pay for the service when Russian companies are once again put on a sanctions list.

WarmWash 9 hours ago [ - ]

China wants everyday people data because some of those people will get power one day, and China wants to be able to leverage knowledge of you, perhaps even "deep dark secret" data, if they need to.

viking123 4 hours ago [ - ]

Israel already does this through Epstein information from all the cameras and microphones that were listening and filming all the powerful people who visited the Island and the houses. They probably have a new Epstein already.

dariosalvi78 12 hours ago [ - ]

was going to say this.. open sourcing Chinese models will enforce Chinese dominance instead of reducing it. When an open Chinese model becomes the best alternative to inaccessible closed US models guess what everybody will start to use. And that same open model may embed certain narratives and values that please the Chinese government.

nxm 10 hours ago [ - ]

Doubtful that’s happen

FpUser 6 hours ago [ - ]

This sounds like a really strong argument

Barbing 12 hours ago [ - ]

Ya. You know enough about China to know: would they be willing to sell users outside of China models that aren't fully pro-China (and won't deflect on tough questions)? Or would that be dirty money that they wouldn't want anyone to make?

Like if they could release Ch-ythos 6 tomorrow BUT it had Western ideals, would they take the fame, clout, attention, & profit, or stick to the party line?

(hope the monolithic brush is appropriate, considering, I mean it's an impressive system/country even if I have my own strong preferences - also I've taken as true reporting about their models deflecting etc. on sensitive topics)

rvnx 12 hours ago [ - ]

Sounds perfect, sell it to me.

I use LLMs for health, design and programming.

If you want to make a political or religious pamphlet it’s not a single LLM that you should base yourself on. No matter where it comes from.

throwa356262 8 hours ago [ - ]

Serious question: why would sending data to China be worse then the US?

londons_explore 15 hours ago [ - ]

With nearly everyone using inference accelerators, the pool of hardware is no longer shared between training and use.

SubiculumCode 13 hours ago [ - ]

No, they are open sourcing them because they don't have another play, being second/3rd tier lans

zardinality 15 hours ago [ - ]

[dead]

nine_k 15 hours ago [ - ]

The US administration restricting the use of US-trained models is one of the best gifts it could make to the Chinese LLM producers, and to the PRC government.

dozerly 15 hours ago [ - ]

This entire administration is a gift to everybody but the US. It’s either in service of Russia, China or whoever is willing to pay Trump the most.

rjzzleep 14 hours ago [ - ]

Chinese have a nickname for Trump. 川建国. Trump the nation builder(meaning China). But Biden actually continued most of Trumps policies.

Der_Einzige 8 hours ago [ - ]

I won’t forgive Biden for not reversing more of trumps policies, especially immigration

Between RBJ refusing to step down, Biden not reversing immigration policy, and Biden refusing to step down in the primary until too late, he’s going to go down as a poor president in the history books - even if he wasn’t a bad dude or even bad in terms of policy.

FpUser 6 hours ago [ - ]

He was getting senile. What did you expect. There must be age limit for rulers

Der_Einzige 4 hours ago [ - ]

Trump was also getting senile before they attempted to assassinate him. Hatred of his enemies gave him another 5 years of energy. Very frustrating, because he absolutly was doing word salad nonsense like this regularly before someone tried to shoot him:

"Look, having nuclear — my uncle was a great professor and scientist and engineer, Dr. John Trump at MIT; good genes, very good genes, OK, very smart, the Wharton School of Finance, very good, very smart — you know, if you’re a conservative Republican, if I were a liberal, if, like, OK, if I ran as a liberal Democrat, they would say I'm one of the smartest people anywhere in the world — it’s true! — but when you're a conservative Republican they try — oh, do they do a number — that’s why I always start off: Went to Wharton, was a good student, went there, went there, did this, built a fortune — you know I have to give my like credentials all the time, because we’re a little disadvantaged — but you look at the nuclear deal, the thing that really bothers me — it would have been so easy, and it’s not as important as these lives are — nuclear is so powerful; my uncle explained that to me many, many years ago, the power and that was 35 years ago; he would explain the power of what's going to happen and he was right, who would have thought? — but when you look at what's going on with the four prisoners — now it used to be three, now it’s four — but when it was three and even now, I would have said it's all in the messenger; fellas, and it is fellas because, you know, they don't, they haven’t figured that the women are smarter right now than the men, so, you know, it’s gonna take them about another 150 years — but the Persians are great negotiators, the Iranians are great negotiators, so, and they, they just killed, they just killed us, this is horrible." - Donald Trump, 2016

lyu07282 3 hours ago [ - ]

> even if he wasn’t a bad dude

Technically his material support to a genocide makes him complicit, it would not have been nearly at the scale without US support tens of thousands of women and children were murdered as a direct result of his decisions[1], if international law meant anything we would hang him for that. So no, he was a "bad dude".

[1] https://en.wikipedia.org/wiki/Gaza_genocide

scotty79 12 hours ago [ - ]

It's funny how the acceleration of the downfall of the US (due to trump) is a gift to everyone else. It's almost as if US didn't have as postitive impact on the world as they thought.

gpm 8 hours ago [ - ]

A gift to [every dictatorial regime]. It's not a gift to the common people. The hundreds of thousands of people who got aids, and wouldn't have if not for Trumps withdrawal, didn't benefit. The women of Afghanistan didn't benefit. The countries of the EU... Canada... Korea... Taiwan... Ukraine... really just about any democracy didn't benefit.

The downfall of the US benefiting bad people is not evidence that the US didn't have a positive impact.

rvnx 11 hours ago [ - ]

Downfall sounds exaggerated.

US is a great and respectable country with amazing nature, people tech and military, very very far a collapsed state.

If anything to be worried of, it's the state of Europe. Closer and closer to war, full of insecurity and no innovation.

US is a great country.

vintermann 12 hours ago [ - ]

There's also the Meta motivation, that even if you don't get the control you would like from releasing a model, it may still be worth it to at least deny others that control. I'm sure that matters even more to China vs. the US than it mattered to Facebook vs. Google.

spiralpolitik 4 hours ago [ - ]

There is no moat in the model and by making the them open, it’s hard for one to be established when the free models are “good enough”.

OpenAI and Anthropic are both hamstrung by this. Anthropic does have the better chance of surviving.

close04 13 hours ago [ - ]

You don’t need the cutting edge to influence people’s opinion. “Export LLMs” to the rescue.

tw1984 16 hours ago [ - ]

> I think the Chinese government either already has, or will soon, grasp that if they train the models that people use they dictate what people believe (at least around the margins where that's malleable), and they will happily throw resources at that.

that doesn't require the model to be SOTA, it can be just a compact model capable of running on some inexpensive hardware. that is vastly different from SOTA models like Mythos which can potentially disrupt lots of things.

strangegecko 16 hours ago [ - ]

Of course it requires SOTA, people will always choose better models over some compact thing that is obviously more limited. You can't control the truth with models nobody wants to use.

columnarx3 15 hours ago [ - ]

People choose SOTA right now because of the heavily subsidised model subscriptions. People aren't going to pay 20x the price for a model that's maybe 10% better.

ezst 14 hours ago [ - ]

And the fact that "better" is highly subjective and domain/task/vibe-specific

adrianN 14 hours ago [ - ]

Why do I want the model I use for coding to know Shakespeare or vice versa?

Jare 13 hours ago [ - ]

Because you communicate with it using natural language and real-world references and descriptions of what you want, you use emotion and emphasis (especially when re-prompting), you use examples and illustrative stories and common expressions. Understanding and interpreting all of that and replying in kind, to some degree, requires a large body of non-computation, cultural knowledge, or else the prompts are just meaningless words, and the replies will look like compiler output.

adrianN 10 hours ago [ - ]

That sounds intuitively true, but I’m not convinced that it is actually the case. I don’t think we know enough about neural network training to say what training and how many parameters are necessary for what kind of performance on which tasks. To me it looks like we currently guess that more is better and try to throw as much compute and data at the problem as is economically feasible. There is little incentive for companies to invest into small model research since their moat is huge models that require special hardware to run.

Der_Einzige 8 hours ago [ - ]

This is why: https://www.emergent-misalignment.com/

rjzzleep 14 hours ago [ - ]

Small models are the future.

baq 11 hours ago [ - ]

> > Do you think that Chinese labs will continue to release open models forever

> Yes.

holy shit the naivete of HN nowadays.

deanishe 16 hours ago [ - ]

> Why can't it be both?

Is the government going to fund all further development? Hard to imagine investors continuing to throw billions at products they aren't allowed to sell.

CraftingLinks 13 hours ago [ - ]

Why wouldn't they? They see this technology as a military asset now.

VBprogrammer 12 hours ago [ - ]

Honestly, with the caliber of people who currently comprise the US administration; leaving the whole thing to Openclaw and some new fancy model might not be the worst idea.

8 hours ago [ - ]

[deleted]

layer8 9 hours ago [ - ]

Trump and friends are only interested in investments they can personally make money from.

JohnBooty 5 hours ago [ - ]

Yeah, there’s been a lot of debate about this on r/localllama — will there be a steady supply of new free/open models in the future?

And if not, can we simply keep augmenting “stale” models with new knowledge to keep them useful?

I’m on the pessimistic side of things on both questions.

As for the second question, obviously stale models can be augmented to an extent but it’s nowhere near a substitute for new knowledge being fully baked directly into its training.

locknitpicker 16 hours ago [ - ]

> I am saying this probably is "silly behavior by a government" and it is a milestone that points towards what the future may look like. Why can't it be both?

Here is why it's unlikely this is anything other than "silly behavior by a government":

- some benchmarks show GPT-5.5, Gemini 3.1, and even Claude Opus outperforming Claude Fable, and yet it's Fable which is restricted.

- some benchmarks still show the likes of Kimi 2.5 outperforming any Claude model, and DeepSeek is getting equivalent scores (a few tenths of a percent difference)

> Do you think that Chinese labs will continue to release open models forever (...)

That's immaterial to the discussion. Even if China forced Chinese labs to restrict access to all models, the truth of the matter is that Trump's administration to restrict access to US-based models does not prevent others from having access to models that are as capable or even better.

So what's exactly the point of this?

solumunus 15 hours ago [ - ]

You’re completely overrating these benchmarks and it’s landing you at a nonsense opinion. Just actually use the models and you will see that the gap is significant.

irthomasthomas 11 hours ago [ - ]

It should be easy for a company like Anthropic to prove this beyond a doubt. Why don't they? Why don't they have a collection of prompts and side-by-side comparisons with other models showing how far ahead they are?

largbae 10 hours ago [ - ]

I think it's mainly because the difference in models at the frontier isn't "response to prompt X", but rather "coherence with 500K tokens of context and instructions in play"

viking123 4 hours ago [ - ]

Good morning to the Anthropic office good sir

dagss 13 hours ago [ - ]

I got to try using Fable for a day... it was a clear and definite shift in quality and how independent it is.

It was almost like having another human using and shepherding Opus for me, instead of herding Opus directly myself.

rileyphone 15 hours ago [ - ]

All that says is some benchmarks aren’t worth the tokens it takes to evaluate them. Mythos is clearly capable of finding zero days other models can’t, and Fable is close enough to be lumped with it.

mullingitover 14 hours ago [ - ]

> Mythos is clearly capable of finding zero days other models can’t

I'm unconvinced that this is anything more than proof of work and marginal improvement that other models will catch up with, perhaps as early as to next week. Lots of other current-gen models will find vulns that can be chained together if you're willing to burn enough tokens on the task, and Fable is an absolute token incinerator.

kolinko 15 hours ago [ - ]

Did you use the models yourself?

lightbendover 3 hours ago [ - ]

[dead]