Hacker News

Listen - that's the sound of millions of companies and users doubling down on Chinese models.

It might be a national security problem for other nations to have access to these models. But it's equally now a national security problem for any other nation to depend on them. Or US tech in general.

tkgally 19 hours ago [ - ]

As it happens, the current number-two article on HN is about a similar consequence of Chinese export controls--a car manufacturer developing electric motors that do not use rare earths:

https://news.ycombinator.com/item?id=48510010

roenxi 18 hours ago [ - ]

The incentives around OSS become stronger the further down in the list of market leaders a company is. The #1 company has no particular incentive to push open software apart from a belief that the market is going to be come commoditised anyway. But the 2nd or 3rd largest player has actual incentives to break the market up and remove software quality as a consideration. No #10 may as well not bother with a proprietary option since if they make it a software quality battle they're going to lose each customer 9 times anyway.

Just because the Chinese are running export controls in one market doesn't mean that they're going to close of access to AI. They might, but each market should be considered in isolation.

kccqzy 18 hours ago [ - ]

Realpolitik in action. Great powers just impose export controls because they know they can and they think it would be beneficial to the nation.

zmmmmm 18 hours ago [ - ]

And it is nearly always hubris - the people making these decisions are surrounded by yes-men who built their whole career pumping up the egos of their superiors.

epolanski 5 hours ago [ - ]

This has never been a winning move, yet it became increasingly popular in the last decade.

dyauspitr 17 hours ago [ - ]

Yeah because they’re just using electromagnets. Those motors are not better than the rare earth ones.

Aurornis 18 hours ago [ - ]

> Listen - that's the sound of millions of companies and users doubling down on Chinese models.

They’re falling back to Opus 4.8. Most people weren’t using Fable for everything anyway because it’s so expensive.

None of open weights models are even at Opus 4.8 levels. If someone was using Fable they don’t have any second best alternative outside of Anthropic.

itopaloglu83 18 hours ago [ - ]

A sample of one, but I was getting more stuff done despite Fable uses tokens twice as fast as Opus, because it understood the goals so well and worked to achieve them.

hodgehog11 17 hours ago [ - ]

Same experience. Wouldn't waste my tokens on easy stuff for it. It blasted through some of my toughest problems and produced some truly great code.

mk89 13 hours ago [ - ]

Can you give an example of what those "toughest problems/great code" are? I don't need to know the prompt nor the output, but the general idea, what it is about.

hodgehog11 11 hours ago [ - ]

Some very tough computational geometry problems I couldn't solve on my own, nor with the assistance of other AIs or my colleagues. Fable did them all first go. The most impressive built a custom optimizer with a ludicrous number of adaptive switches that absolutely crunched through an error surface with a bunch of nontrivial nullspaces and some wild curvature. That optimizer is of independent interest; it's not totally novel in theory, but the implementation is an impressive piece of engineering.

12 hours ago [ - ]

[deleted]

2001zhaozhao 18 hours ago [ - ]

> more stuff done

More stuff done per dollar or more stuff done for more dollars? Seems to be an important distinction

itopaloglu83 17 hours ago [ - ]

Given the same usage limits, I was able to get more stuff done and not even hit the usage limits, because I wasn't working on constantly fixing what Opus was trying to do, Fable just understands the task correctly and works great with the given context.

pshc 17 hours ago [ - ]

Same, I was actually having interesting thought experiments with Fable.

malshe 17 hours ago [ - ]

I even upgraded my Max plan because Fable was doing so well.

consumer451 18 hours ago [ - ]

Same here, now n=2.

toss1 7 hours ago [ - ]

Same here, not n=3, plus the above 3 reporting, so n=6 and rising

Fable was definitely better for a variety of tasks, even accounting for using 2X the token rate, like the way it used the tokens faster reduced the wasted tokens, as least for the subset of those who already knew at least some optimizations...?

dbish 18 hours ago [ - ]

Yep. I love open source but there isn’t a model that comes close still to the closed source options like Opus 4.8 and that’s obvious from most people I see across the software industry as well. There are at least another few models after Opus from OpenAI and Anthropic most would go down the list using before any of the Chinese models at this point.

sixothree 16 hours ago [ - ]

I could really use something that can just refactor a few classes and create DTOs from entities.

cube00 18 hours ago [ - ]

> Most people weren’t using Fable for everything anyway because it’s so expensive.

Or they were getting silently rerouted and couldn't realise they weren't using Fable

loeg 17 hours ago [ - ]

> If someone was using Fable they don’t have any second best alternative outside of Anthropic.

GPT-5.5 isn't awful.

dyauspitr 17 hours ago [ - ]

Opus 4.8 has taken such a beating over the last couple of days since the release of fable, videos online of people referring to it like the “redheaded stepchild” (is there a better way of saying this, this sounds racist) basically at this point, everyone is going to be seriously disappointed to fall back to that.

nozzlegear 17 hours ago [ - ]

> is there a better way of saying this, this sounds racist

It's not racist or even politically incorrect in the US, it's a common saying.

johndough 13 hours ago [ - ]

As a foreigner, the casual racism against red-haired people in the US and UK always baffled me.

xienze 9 hours ago [ - ]

As long as the person is white no one will blink an eye no matter what you say.

dyauspitr 4 hours ago [ - ]

Except that’s all every MAGA can talk about. This take is from 2010

poszlem 11 hours ago [ - ]

If anything is racist here it's thinking that redheads are a separate race.

johndough 8 hours ago [ - ]

Feel free to replace "racism" with "discrimination" if you prefer. English is not my first language and the minutiae elude me.

dyauspitr 17 hours ago [ - ]

Yes, I am aware. Kind of paints redheads as unwanted though. Seems hurtful.

nozzlegear 16 hours ago [ - ]

Yeah, not sure where the phrase originated but it does sound bad when you put some thought into it. My sister is a redhead and people loved to make fun of her growing up, telling her there's no way two parents with brown hair could have a kid with red hair, so the mailman (who also had red hair) was obviously her dad.

laichzeit0 15 hours ago [ - ]

As a non-US person, I will use whatever is the best and reasonably priced. I could not give one iota about who makes or hosts these models. The origin or political leanings of these models mean nothing in my usage calculus.

nonethewiser 19 hours ago [ - ]

Which models? Im curious what kind of more specific hypothesis you're willing to put forth. Anthropic going to lose 20-30-40-50% of users to Deepseek? What?

bigyabai 19 hours ago [ - ]

I quit paying for Claude Code to buy z.ai's coding plan for use with OpenCode. I'm not a power user, but I don't regret switching away from Claude. OpenCode is generally nicer for my work.

pkulak 19 hours ago [ - ]

Why z.ai and not an ollama pro plan that can use all the open models? Real question, not snark. I've only ever done ollama and wonder what I'm missing.

cube00 18 hours ago [ - ]

> I've only ever done ollama and wonder what I'm missing.

Friends Don't Let Friends Use Ollama https://news.ycombinator.com/item?id=47788385

commanderkeen08 18 hours ago [ - ]

The z.ai was stupid cheap during the great anthropic opencode rugpull.

bigyabai 18 hours ago [ - ]

Because I bought a year's subscription in December, when it was still $6/mo :P

I have decently capable hardware, but stuff like Qwen 3.6 and Gemma 4 still doesn't compare to agentic editing with a frontier model. Right now, OpenCode's $10/mo "Go" plan is what I'd be looking to try once my year expires.

garciasn 18 hours ago [ - ]

I guess if it works for you, great; that’s why competition is a good thing.

Enjoy.

nonethewiser 18 hours ago [ - ]

Have never heard of it, thanks for the info

paulmist 19 hours ago [ - ]

Aren't biggest Qwen 3.7 closed? I don't suspect China's policy here would be anything but ruthless.

girvo 18 hours ago [ - ]

MiniMax M3 is surprisingly powerful, and open weight (or is about to be). There's others in this space too: MiMo v2.5, GLM 5.1. There's quite a few to pick from if you want strong models running on "your" hardware.

johndough 13 hours ago [ - ]

MiniMax M3 weights have already been released: https://huggingface.co/MiniMaxAI/MiniMax-M3

girvo 10 hours ago [ - ]

Open weights like this make me wish I had a bunch more DGX Sparks to cluster so I could fit it!

andrewchambers 19 hours ago [ - ]

deepseek v4 pro is great and open weight.

EchoVoicy 19 hours ago [ - ]

It is, and I love it, but it isn't capable of performing the tasks I've been giving to Opus, let alone Fable.

Don't get me wrong, I use it, it's fast-smart-and affordable. But not suitable for all tasks.

droidjj 18 hours ago [ - ]

What kinds of tasks are you finding deepseek v4 incapable of?

EchoVoicy 16 hours ago [ - ]

For starters, there's a C++ application written with MFC and an absolute ton of inline assembly and threading (yes, in a 1990's C++ application). I'm porting it to MacOS/Linux currently.

Opus 4.6+ is able to make slow progress, but it takes several revisions per workstream. It requires constant supervision as it often creates convoluted solutions that expand the code in bloated ways. It works, but still requires my constant input.

Fable was able to almost one shot most of the big migrations with very few bugs, and was able to fix those bugs with 1 review pass. I almost didn't believe it. I was able to put it on a task (with dangerous permissions) and come back hours later to see it done, working, and clean.

I tried DeepSeek v4 and it wasn't able to make any meaningful progress at all. It kept creating dangling pointers and had trouble understanding the inline assempbly needed to be replaced if we were to compile for 64 bit. It kept getting stuck and looping on the same problem, without making progress.

What I do use DeepSeek for is lots of my automations on my websites. I find DeepSeek is fantastically cheap and fast and effective as summarization, collation, generating reports, finding and reporting issues from logs, etc. But I haven't found a way to get it to effectively port 90's C++ code to modern, cross-platform standards. But I want to be clear- I really like DeepSeek and use it wherever I can.. I mean.. it's so affordable!

noworriesnate 12 hours ago [ - ]

I was building a cli tool that showed a graph of git commits, kind of like git log --graph, and deepseek v4 simply could not figure out a specific ui quirk where things weren't lining up correctly. I spent like $0.10 and 30 minutes trying to figure it out on deepseek.

Then I had deepseek summarize the bug, gave it to Opus, and it solved it in $1.12 and five minutes.

epolanski 5 hours ago [ - ]

Fun fact, I was trying this afternoon Deepseek vs Opus 4.8 high, and I was surprised at how good Deepseek was. It outperformed Opus 4.8 on multiple occasions.

Found just later I was using v4 flash and not pro (for mistakenly setting the model to deepseek-chat and not v4-pro).

There are aspects about Deepseek I don't like though, when pushed against it will eagerly bend instead of reasoning and advocating for his points, something Opus 4.7 and later models started doing a lot (even when wrong).

ac29 17 hours ago [ - ]

All current Qwen 3.7 models are closed though they have said more releases are coming

epolanski 5 hours ago [ - ]

US is well known to impose world wide embargoes on technologies and resources that they pretend to apply to companies beyond its borders.

And it works, because the American market is generally more important than the markets of these countries 9 times out of 10.

epolanski 9 hours ago [ - ]

As an European I'm happy to be convincing my clients from years to move out of any us dependencies.

It's tough, US technologies are everywhere but they are only liabilities. Microsoft is the stickiest of all.

WarmWash 18 hours ago [ - ]

You are drinking the cool aid if you think the CCP is going to let the world get ahead of China using CCP models.

epolanski 5 hours ago [ - ]

They are?

They have pioneered and been very influential in different aspects of LLM engineering and research, while also publishing in the open.

You would not have Fable 5 as good as it is without Chinese work.

operatingthetan 18 hours ago [ - ]

Do you mean Kool-Aid?

aabhay 16 hours ago [ - ]

Banned Aid

ks2048 19 hours ago [ - ]

Wait until it is illegal to download or use Chinese models (only half-joking).

platinumrad 19 hours ago [ - ]

Anthropic is explicitly lobbying for this.

mcast 18 hours ago [ - ]

Is there any SCOTUS precedent for this? It seems like a huge 1A issue for the government to limit self hosted access to a foreign country’s LLM.

wyrdcurt 16 hours ago [ - ]

After what happened to TikTok, I don't think it's a stretch.

mcast 4 hours ago [ - ]

TikTok wasn’t open source and able to be run on your own hardware. Banning Alibaba’s models (or even a personal fork of them) running on your machines seems hard to defend in court.

fosco 18 hours ago [ - ]

Know where I can read about that?

platinumrad 18 hours ago [ - ]

The two main bills I'm aware of are the Decoupling America's AI Capabilities from China Act and No Adversarial AI Act. The former would have made it illegal for any American citizen to simply use DeepSeek. I couldn't find any lobbying data, but the obvious effect is that Americans would be forced to pay for more expensive domestic alternatives.

A House committee also recently probed Cursor and Airbnb for using Chinese models, rather than more expensive American alternatives. A sexagenarian Congressman gave a nonsense quote that he certainly did not come up with himself,[1] which sounds very similar to language Anthropic uses in its marketing materials.[2][3]

[1] https://www.semafor.com/article/04/29/2026/house-committee-p...

[2] https://www.anthropic.com/news/updating-restrictions-of-sale...

[3] https://www.anthropic.com/research/2028-ai-leadership

aesthesia 17 hours ago [ - ]

Moolenaar's quote: "The AI models these companies use are trained by China’s censorship regime and introduce hidden vulnerabilities that put Americans’ data and businesses at risk." That is, Americans using Chinese-trained AI models are exposed to some form of cybersecurity risk.

That's not really a threat model described in either of the Anthropic posts you share, which mainly talk about the risks of allowing authoritarian regimes to use powerful US-trained models, and the geopolitical risks of authoritarian countries developing strong AI before democratic/liberal countries do.

karmasimida 18 hours ago [ - ]

Anthropic hates open weight Chinese models so yes

sh34r 17 hours ago [ - ]

Good thing these corrupt gerontocrats are also all in on cryptocurrency then.

19 hours ago [ - ]

[deleted]

CamperBob2 18 hours ago [ - ]

Nothing funny about it. That's exactly what Amodei asks for, every time he rubs his monkey's paw.

verdverm 18 hours ago [ - ]

They'll have to remove sections like this from their AI Action Plan

> We need to ensure America has leading open models founded on American values. Open- source and open-weight models could become global standards in some areas of business and in academic research worldwide. For that reason, they also have geostrategic value. While the decision of whether and how to release an open or closed model is fundamentally up to the developer, the Federal government should create a supportive environment for open models.

ks2048 18 hours ago [ - ]

Unless they (gasp!) write some statement they don’t believe or don’t follow through with.

utilize1808 7 hours ago [ - ]

US will ban American companies from using Chinese models and also ban them from dealing with companies who use Chinese models. “Code produced by Chinese models may be deliberately introduce backdoors and vulnerabilities” that kind of thing.

hmmmmmmmmmmmmmm 13 hours ago [ - ]

Chinese models are next, the whole reason this is happening is because they don't want China to steal their tech. It is no secret anymore that they have been distilling US models. That's why it is explicitly aimed at foreign nationals.

rw2 18 hours ago [ - ]

Not really, they are not even as good as opus 4.7

anonzzzies 17 hours ago [ - ]

So, a few month difference... Definitely usable as far as we found, especially being so much cheaper.

miyuru 15 hours ago [ - ]

yes, I am using mimo code(free version) for the last 2 days. I gets the job done for me.

If I need to upgrade, the plan start at $6, so its a no brainer.

dyauspitr 17 hours ago [ - ]

To do what? I mean they’re good models, but frankly, they fucking suck (relatively speaking). I’m not looking to going back to a week of back-and-forth with the LLM once I’ve gotten used to all this one shotting.

256BitChris 18 hours ago [ - ]

No one serious is using the open models. Using them is like traveling back 2-2.5 years in time and using ChatGPT.

zmmmmm 18 hours ago [ - ]

DeepSeekv4 Pro is roughly Opus 4.5 - Opus 4.6 in my estimation. That's about 8 months difference, not 2.5 years.

It's definitely not as good. But it's also definitely good enough.

EchoVoicy 16 hours ago [ - ]

Curious- in what tasks? I find Opus 4.5/4.6 too expensive and have tried to migrate to DeepSeek for C++ work, but found it couldn't cut it.

What's your DSv4 setup? What harness? It sounds like I should give it another try!

zmmmmm 13 hours ago [ - ]

works really well with pi for small to medium sized coding tasks for me - C++ is an interesting case since it's probably more challenging just due to the complexity of the syntax. But it works great with Groovy which is another slightly off-mainstream language (these days).

lbreakjai 13 hours ago [ - ]

I use DSv4 through opencode. I use it from deepseek directly, not through a third-party platform.

I mostly do C# and some frontend. I was starting to feel really depressed and unengaged at work because I was starting to use AI far too much like a magic slot machine. I'm now making a conscious effort to go back to using it as a tool used a bit more deliberately.

I'm not even using the pro model. The flash version is fast so I can keep it interactive rather than context switching to reddit while the model is working, and it turns out using my brain means I don't really need the model to be that smart.

I spent about $1.5 this week.

untcarcandy 17 hours ago [ - ]

[dead]