Hacker News

nine_k 5 hours ago [ - ]

$1k per day, 50 work weeks, 5 day a week → $250k a year. That is, to be worth it, the AI should work as well as an engineer that costs a company $250k. Between taxes, social security, and cost of office space, that engineer would be paid, say, $170-180k a year, like an average-level senior software engineer in the US.

This is not an outrageous amount of money, if the productivity is there. More likely the AI would work like two $90k junior engineers, but without a need to pay for a vacation, office space, social security, etc. If the productivity ends up higher than this, it's pure profit; I suppose this is their bet.

The human engineer would be like a tech lead guiding a tea of juniors, only designing plans and checking results above the level of code proper, but for exceptional cases, like when a human engineer would look at the assembly code a compiler has produced.

This does sound exaggeratedly optimistic now, but does not sound crazy.

richardw 2 hours ago [ - ]

It’s a $90k engineer that sometimes acts like a vandal, who never has thoughts like “this seems to be a bad way to go. Let me ask the boss” or “you know, I was thinking. Shouldn’t we try to extract this code into a reusable component?” The worst developers I’ve worked with have better instincts for what’s valuable. I wish it would stop with “the simplest way to resolve this is X little shortcut” -> boom.

It basically stumbles around generating tokens within the bounds (usually) of your prompt, and rarely stops to think. Goal is token generation, baby. Not careful evaluation. I have to keep forcing it to stop creating magic inline strings and rather use constants or config, even though those instructions are all over my Claude.md and I’m using the top model. It loves to take shortcuts that save GPU but cost me time and money to wrestle back to rational. “These issues weren’t created by me in this chat right now so I’ll ignore them and ship it.” No, fix all the bugs. That’s the job.

Still, I love it. I can hand code the bits I want to, let it fly with the bits I don’t. I can try something new in a separate CLI tab while others are spinning. Cost to experiment drops massively.

latch an hour ago [ - ]

Claude code has those "thoughts" you say it never will. In plan mode, it isn't uncommon that it'll ask you: do you want to do this the quick and simple way, or would you prefer to "extract this code into a reusable component". It also will back out and say "Actually, this is getting messy, 'boss' what do you think?"

I could just be lucky that I work in a field with a thorough specification and numerous reference implementations.

devin 4 minutes ago [ - ]

I agree that Claude does this stuff. I also think the Chinese menus of options it provides are weak in their imagination, which means that for thoroughly specified problem spaces with reference implementations you're in good shape, but if you want to come up with a novel system, experience is required, otherwise you will end up in design hell. I think the danger is in juniors thinking the Chinese menu of options provided are "good" options in the first place. Simply because they are coherent does not mean they are good, and the combinations of "a little of this, a little of that" game of tradeoffs during design is lost.

throwaway7783 6 minutes ago [ - ]

This has happened to me too. Claude has stopped and said on occasions "this is a big refactor, and will affect UI as well. Do you want me to do it?"

htrp a minute ago [ - ]

why stop at 5 days a week?

lbreakjai 5 hours ago [ - ]

$250k a year, for now. What's to stop anthropic for doubling the price if your entire business depends on it? What are you gonna do, close shops?

ikr678 an hour ago [ - ]

Yeah this is just trading largely known & controllable labour management risks for some fun new unknown software ones.

You can negotiate with your human engineers for comp, you may not be able to negotaiate with as much power against Anthropic etc (or stop them if they start to change their services for the worse).

teaearlgraycold 4 hours ago [ - ]

What’s to stop them? Competition.

lbreakjai 4 hours ago [ - ]

From whom? OpenAI and Google? Who else has the sort of resources to train and run SOTA models at scale?

You just reduced the supply of engineers from millions to just three. If you think it was expensive before ...

simonw 4 hours ago [ - ]

> Who else has the sort of resources to train and run SOTA models at scale?

Google, OpenAI, Anthropic, Meta, Amazon, Reka AI, Alibaba (Qwen), 01 AI, Cohere, DeepSeek, Nvidia, Mistral, NexusFlow, Z.ai (GLM), xAI, Ai2, Princeton, Tencent, MiniMax, Moonshot (Kimi) and I've certainly missed some.

All of those organizations have trained what I'd class as a GPT-4+ level model.

lbreakjai 3 hours ago [ - ]

Ah but I said "_... and running at scale_"

simonw 3 hours ago [ - ]

Of the list I gave you, at a guess:

Google, OpenAI, Anthropic, Meta, Amazon, Alibaba (Qwen), Nvidia, Mistral, xAI - and likely more of the Chinese labs but I don't know much about their size.

lbreakjai 2 hours ago [ - ]

I guess where I was leading to is who owns the compute that runs those models. Mistral, for example, lists Microsoft and Google as subprocessors (1). Anthropic is (was?) running on GCP and AWS.

So, we have multiple providers, but for how long? They're all competing for the same hardware and the same energy, and it will naturally converge into an oligopoly. So, if competition doesn't set the floor, what does?

Local models? If you're not running the best model as fast as you can, then you'll be outpaced by someone that does.

1. https://trust.mistral.ai/subprocessors

mediaman an hour ago [ - ]

If there are low switching costs, and if there are multiple highly capable models, and if the hardware is openly purchasable (all of these are true), then the price will converge to a reasonable cash flow return on GPUs deployed net of operating expenses of running these data centers.

If they start showing much higher returns on assets, then one of the many infra providers just builds a data center, fills it with GPUs, and rents it out at 5% lower price. This is the market mechanism.

Looking at who owns the compute is barking up the wrong tree, because it has little moat. Maybe GPU manufacturers would be a better place to look, but then the argument is that you're beholden to NVIDIA's pricing to the hyperscalers. There's some truth to that, but you already see that market position eroding because of TPUs and belatedly AMD. All of these giant companies are looking to degrade Jensen's moat, and they're starting to succeed.

Is the argument here that somehow all the hyperscalers are going to merge to one and there will be only one supplier of compute? How do you defend the idea that nobody else could get compute?

lbreakjai 33 minutes ago [ - ]

The starting point was that competition would prevent AI providers from doubling the price of tokens, because there's lots of models running on lots of providers.

This is in the context of the article, that paints a world where it would be unreasonable not to spend $250k per head per year in tokens.

My argument is the current situation is temporary, and _if_ LLMs provide that much value, then the market will consolidate into a handful of providers, that'll be mostly free to dictate their prices.

> If they start showing much higher returns on assets, then one of the many infra providers just builds a data center, fills it with GPUs, and rents it out at 5% lower price. This is the market mechanism.

Except when the GPUs, memory, and power are in short supply. The demand is higher than the supply, prices go up, and whoever has the deeper pockets, usually the bigger and more established party, wins.

teaearlgraycold 4 hours ago [ - ]

A tri-opoly can still provide competitive pressure. The Chinese models aren’t terrible either. Kimi K2.5 is pretty capable, although noticeably behind Claude Opus. But its existence still helps. The existence of a better product doesn’t require you to purchase it at any price.

lbreakjai 3 hours ago [ - ]

> The existence of a better product doesn’t require you to purchase it at any price

It does if it means someone using a better model can outpace you. Not spending as much as you can means you don't have a business anymore.

It's all meaningless, ultimately. You're not building anything for anyone if no one has a job.

blackqueeriroh 11 minutes ago [ - ]

because in all of this change we can’t be bothered to imagine a world where people have money without jobs? Do you think billionaires are just going to want to stop making more money?

The best bull case for us reaching luxury gay space communism is that people not working and having near infinite capital to buy whatever they want to enjoy is the only way the billionaires get to see their pot growing forever.

direwolf20 an hour ago [ - ]

Have they stopped making a loss yet? They'll all need to raise prices or they'll all go out of business, and now it's a game of chicken.

blibble 4 hours ago [ - ]

that worked real well for cloud computing

aws and gcp's margins are legendarily poor

oh, wait

riku_iki 4 hours ago [ - ]

gcp was net negative until last year.

Big part of why clouds are expensive is not necessary hardware, but all software infra and complexity of all services.

direwolf20 an hour ago [ - ]

Maybe not worth using then. Your product costs 5x and delivers 0.2x of competing product in the adjacent product class (traditional server/VPS), why use it?

oblio 3 hours ago [ - ]

All the big clouds are still in market share acquisition mode. Give it about 5 more years, when they're all in market consolidation and extraction mode.

riku_iki 2 hours ago [ - ]

cloud providers indeed could abuse vendor lock, but LLMs are not that easily vendor lockable.

drited 4 hours ago [ - ]

By then perhaps it will be possible to continue with local LLMs

skeeter2020 5 hours ago [ - ]

>> $170-180k a year, like an average-level senior software engineer in the US.

I hear things like this all the time, but outside of a few major centers it's just not the norm. And no companies are spending anything like $1k / month on remote work environments.

nine_k 4 hours ago [ - ]

I mean, it's at best an average-level senior engineer salary, not some exorbitant L6 Googler salary.

shimman 3 hours ago [ - ]

Median salary for a software engineer in the US is ~$133k:

https://www.bls.gov/ooh/computer-and-information-technology/...

flaminHotSpeedo an hour ago [ - ]

I question their data if their p90 value is $211k

I recognize that not everyone makes big tech money, but that's somewhere between entry and mid level at anywhere that can conceivably be called big tech

sebmellen 3 hours ago [ - ]

Define “senior engineer” though..

ozim 4 hours ago [ - ]

I think that is easy to understand for a lot of people but I will spell it out.

This looks like AI companies marketing that is something in line 1+1 or buy 3 for 2.

Money you don’t spend on tokens are the only saved money, period.

With employees you have to pay them anyway you can’t just say „these requirements make no sense, park for two days until I get them right”.

You would have to be damn sure of that you are doing the right thing to burn $1k a day on tokens.

With humans I can see many reasons why would you pay anyway and it is on you that you should provide sensible requirements to be built and make use of employees time.

noosphr 2 hours ago [ - ]

OK, but who is saying that to the llm? Another llm?

We got feedback in this thread from someone who supposedly knows rust about common anti patterns and someone from the company came back with 'yeah that's a problem, we'll have agents fix it.'[0].

Agents are obviously still too stupid to have the meta cognition needed for deciding when to refactor, even at $1,000 per day per person. So we still need the buts in seats. So we're back at the idea of centaurs. Then you have to make the case that paying an AI more than a programmer is worth it.[1]

[0] which has been my exact experience with multi-agent code bases I've burned money on.

[1] which in my experience isn't when you know how to edit text and send API requests from your text editor.

bee_rider 4 hours ago [ - ]

That nobody wants to actually do it is already a problem, but some basically true thing is that somebody has to pay those $90k junior engineers for a couple years to turn them into senior engineers.

The seem to be plenty of people willing to pay the AI do that junior engineer level work, so wouldn’t it make sense to defect and just wait until it has gained enough experience to do the senior engineer work?

nixass 4 hours ago [ - ]

> 50 work weeks

What dystopia is this?

nine_k 4 hours ago [ - ]

This is a simplification to make the calculation more straightforward. But a typical US workplace honors about 11 to 13 federal holidays. I assume that an AI does not need a vacation, but can't work 2 days straight autonomously when its human handlers are enjoying a weekend.

monooso 3 hours ago [ - ]

There are no human handlers. From the opening paragraph (emphasis mine):

> We built a Software Factory: non-interactive development where specs + scenarios drive agents that write code, run harnesses, and converge without human review.

[Edit] I don't know why I'm being downvoted for quoting the linked article. I didn't say it was a good idea.

fipar 4 hours ago [ - ]

I took it as a napkin rounding of 365/7 because that’s the floor you pay an employee regardless of vacation time (in places like my country you’d add an extra month plus the prorated amount based on how many vacation days the employee has), so, not that people work 50 weeks per year, it’s just a reasonable approximation of what the cost the hiring company.

direwolf20 an hour ago [ - ]

Looks like standard USA?

simsla 3 hours ago [ - ]

It doesn't say 1k per day. Not saying I agree with the statement per se, but it's a much weaker statement than that.

jmalicki 3 hours ago [ - ]

"If you haven't spent at least $1,000 on tokens today per human engineer, your software factory has room for improvement" - how exactly is that a weaker statement?

jmalicki 44 minutes ago [ - ]

I would love to see setups where $1000/day is productive right now.

I am one of the most pro vibe-coding^H^H^H^H engineering people I know, and i am like "one claude code max $200/mo and one codex $200/mo will keep you super stressed out to keep them busy" (at least before the new generation of models I would hit limits on one but never both - my human inefficiency in tech-leading these AIs was the limit)

simsla an hour ago [ - ]

My read of it was "by today", aka cumulative. But you're right that it can also be read as "just today". The latter is an absurdly strong statement, I agree.

pydry 4 hours ago [ - ]

It sounds exaggeratedly crazy.