Hacker News

> Browsers and operating systems are increasingly expected to gain access to language models.[0]

Are they?

[0] https://github.com/webmachinelearning/prompt-api/blob/main/R...

I think this is the wrong way. I don’t want my OS or browser to have access to an LLM, but I do want my LLM to have access to a browser or OS (and they already have).

So they should provide an interface to LLMs, disabled by default, enabled when users want it, and that’s it imho.

That also gives me the choice of which LLM provider to use, rather than being locked in whatever LLM Apple decided to do put in their OS.

I want to give Claude access to the stuff Apple Intelligence has access to, for example.

domenicd 44 minutes ago [ - ]

(I wrote those words originally.)

Wow. I had no idea that people would misinterpret what I was saying in this way. I was not meaning to imply it was an expectation of users or developers. I was meaning it as a statement of what was currently a growing industry trend by OS and browser vendors, of shipping or preparingto ship LMs.

By now the statement could probably be amended from "expected to gain access to" to "shipping with".

I hope the team maintaining the project now makes such an update, since apparently it's confusing so many people!

benterix 39 minutes ago [ - ]

[dead]

concinds 3 hours ago [ - ]

Sure. macOS, iOS and Windows have local model APIs for third-party devs. Chrome is trialing it. Firefox uses models to generate alt-text, but no API.

In theory it's useful. If devs can rely on local models, it's more private and decentralized, they don't need to funnel money to AWS or Anthropic. There are low-stakes use cases that only make sense if they're local (available offline) and free.

But in practice I've seen zero adoption of Apple Foundation Models in native apps. I wonder if any Mac/iOS devs have anything to share on this.

dannyw 3 hours ago [ - ]

In practice it’s useful too. The local translation in Firefox is quite good, and I love that I can translate pages entirely on my machine; without the contents going to another server.

As for Apple foundational models, I think the issue is more that they’re just not very intelligent or good; maybe WWDC will change that; but if you want to implement LLM functionality, you’re better off either calling an API, or shipping a better small on device model.

pbronez 9 minutes ago [ - ]

Yeah I looked into the Apple Foundation models and was surprised at their limited scope. On reflection it made sense though. They’re giving you the small part of the LLM capability surface that (1) can run with good performance on all their hardware and (2) works reliably.

It’s not enough for a chat-first research agent, but it’s definitely enough to unlock features that rely on natural language understanding. Seems like a small thing compared to Claude/ChatGPT and the general hype, but still magic in its own context.

getpokedagain an hour ago [ - ]

I don't think thus is what was meant. I don't think they were questioning if OS and browser makers were embedding llm features but rather if people want them.

I find many frustrating. I had an iphone previously and the llm summaries of text messages are what drove me to finally drop ios. I have a family member who is undergoing cancer treatment. I can't explain to you the frustration of seeing wrong text summaries when an llm goes wild hallucinating test results when the actual text simply said taking a test. OS basics and communication should be trustable. Not perhaps hallucinations of a small shitty model.

clscott 4 hours ago [ - ]

Those exact words are the positioning statement (start the second paragraph) of the document you linked.

What are you trying to say?

benterix 4 hours ago [ - ]

Their whole argument is based on this sentence. So I'd expect some rationale. Instead, they provide as "example" links to Google, Microsoft and Apple. The funny thing is that the one by MS is probably the most criticized one, with the company partly backpedaling on it. And Apple is often criticized by LLM aficionados for being quite conservative. Google is the one proposing it.

So my question is: are browsers and operating systems really expected to gain access to language models? If so - by whom: the users or LLM vendors like Google?

walletdrainer 4 hours ago [ - ]

> What are you trying to say?

GP is clearly asking ”Are they?”

loloquwowndueo 4 hours ago [ - ]

That “are expected” is a euphemism for “are shoehorning AI in and trying to shove it down users’ throats”. Whereas the truth is nobody (actual end users, that is) wants it.

I hate having to “dodge” all the AI-enabled controls my phone (iOS) is sprouting - I don’t need that shit, but there’s also no alternative.

noirscape 4 hours ago [ - ]

It's the typical "cart before the horse" kind of corporate tech talk. It's pretty standard if Silicon Valley wants to sell shit that nobody actually wants; they just assume that people will want it, regardless whether or not they actually want it. Most of the tech press is too obsessed with retaining their "access" to actually be critical of this sort of thing, and most of the regular press doesn't care enough to actually investigate.

We've seen this sort of song and dance before, crypto jumps to mind. Remember when social media sites suddenly were all about those hexagonal avatars? Most of this stuff is really in that same vein.

(Which to be clear, users don't want this. AI pushes by pretty much all recent user feedback metrics are largely tiring out users and reek of corporate desperation to sell shit. It's only a very specific subsection of Silicon Valley that wants to stuff AI in everything like this.)

stingraycharles 4 hours ago [ - ]

I think the resentment for Copilot is pretty much universal. People like AI, when it’s not forced upon them.

A lot of these products feel unguided by an “everything must become AI” FOMO movement, rather than actual thoughtful integrations.

raincole 4 hours ago [ - ]

Browsers: Chrome (proposed this Prompt API)

Operating Systems: Windows (built-in Copilot), MacOS, iOS (Apple Intelligence)

So it's >90% desktop browser and OS, plus >30% mobile OS.

Yes, I think it's very safe to say "browsers and operating systems are increasingly expected to gain access to language models."

kirb 3 hours ago [ - ]

These features are enabled by default, and in the case of iOS/macOS, desktop Chrome, probably also Copilot+ PCs, download 4 - 7 GB local models without properly explaining this to users. This doesn’t confirm any demand because if you just don’t use the features and don’t fill up your device, you may never notice.

I think this API is probably fine, but only if the user already has a model downloaded and wants these features. Naturally, case in point, Chrome quietly downloads Gemini Nano without any opt-out except through group policy. Things like this and Microsoft’s recent admission that they’ve overindexed on Copilot features in Windows make it increasingly difficult to trust that users actually want more than a few killer AI features, most of which are just ChatGPT.

Anecdotally, non-technical friends and family members know about ChatGPT and increasingly Gemini, get frustrated by Copilot, and don’t know Apple Intelligence exists.

https://superuser.com/questions/1930445/can-i-delete-the-chr...

benterix 4 hours ago [ - ]

The word "expected" is a weasel word in this context, especially given how muck backlash MS has received. I'd expect a link to a study where users say: "I'd like to have an LLM integrated with my operating system and my browser" and how it changes over time. Then you can seriously argue for "increasingly expected".

deaux 4 hours ago [ - ]

You omitted the clause "by shareholders" after "expected".

bakugo 4 hours ago [ - ]

What this proves is that browsers and operating systems are increasingly integrating language models, not that they are expected to do so.

The only people who expect them to do so are big tech executives. The average user does not expect nor want Copilot shoved into every possible corner of Windows, and Microsoft themselves have acknowledged this.