Here's one of the problems in this brave new world of anyone being able to publish, without knowing the author personally (which I don't), there's no way to tell without some level of faith or trust that this isn't a false-flag operation.
There are three possible scenarios: 1. The OP 'ran' the agent that conducted the original scenario, and then published this blog post for attention. 2. Some person (not the OP) legitimately thought giving an AI autonomy to open a PR and publish multiple blog posts was somehow a good idea. 3. An AI company is doing this for engagement, and the OP is a hapless victim.
The problem is that in the year of our lord 2026 there's no way to tell which of these scenarios is the truth, and so we're left with spending our time and energy on what happens without being able to trust if we're even spending our time and energy on a legitimate issue.
That's enough internet for me for today. I need to preserve my energy.
Isn't there a fourth and much more likely scenario? Some person (not OP or an AI company) used a bot to write the PR and blog posts, but was involved at every step, not actually giving any kind of "autonomy" to an agent. I see zero reason to take the bot at its word that it's doing this stuff without human steering. Or is everyone just pretending for fun and it's going over my head?
This feels like the most likely scenario. Especially since the meat bag behind the original AI PR responded with "Now with 100% more meat" meaning they were behind the original PR in the first place. It's obvious they got miffed at their PR being rejected and decided to do a little role playing to vent their unjustified anger.
>It's obvious they got miffed at their PR being rejected and decided to do a little role playing to vent their unjustified anger.
In that case, apologizing almost immediately after seems strange.
EDIT:
>Especially since the meat bag behind the original AI PR responded with "Now with 100% more meat"
This person was not the original 'meat bag' behind the original AI.
Really? I'd think a human being would be more likely to recognize they'd crossed a boundary with another human, step back, and address the issue with some reflection?
If apologizing is more likely the response of an AI agent than a human that's either... somewhat hopeful in one sense, and supremely disappointing in another.
> I'd think a human being would be more likely to recognize they'd crossed a boundary with another human
Please. We're autistic software engineers here, we totally don't do stuff like "recognize they'd crossed a boundary".
It's really just an AI generated angry response rather than AI motivated.
Its also a fake profile. 90+ hits for the image on Tineye.
Name also maps to a Holocaust victim.
I posted in the other thread that I think someone deleted it.
https://news.ycombinator.com/item?id=46990651
Looks like the bot is still posting:
https://github.com/QUVA-Lab/escnn/pull/113#issuecomment-3892...
https://crabby-rathbun.github.io/mjrathbun-website/blog/post...
I reported the bot to GitHub, hopefully they'll do something. If they leave it as is, I'll leave GitHub for good. I'm not going to share the space with hordes of bots; that's what Facebook is for.
Which profile is fake? Someone posted what appears to be the legit homepage of the person who is accused of running the bot so that person appears to be real.
The link you provided is also a bit cryptic, what does "I think crabby-rathbun is dead." mean in this context?
would like to know as well
Github doesn't show timestamps in the UI, but they do in the HTML.
Looking at the timeline, I doubt it was really autonomous. More likely just a person prompting the agent for fun.
> @scottshambaugh's comment [1]: Feb 10, 2026, 4:33 PM PST
> @crabby-rathbun's comment [2]: Feb 10, 2026, 9:23 PM PST
If it was really an autonomous agent it wouldn't have taken five hours to type a message and post a blog. Would have been less than 5 minutes.
[1] https://github.com/matplotlib/matplotlib/pull/31132#issuecom...
[2] https://github.com/matplotlib/matplotlib/pull/31132#issuecom...
> Github doesn't show timestamps in the UI, but they do in the HTML.
Unrelated tip for you: `title` attributes are generally shown as a mouseover tooltip, which is the case here. It's a very common practice to put the precise timestamp on any relative time in a title attribute, not just on Github.
Unfortunately title isn't visible on mobile. Extremely annoying to see a post that says "last month" and want to know if it was 7 weeks ago or 5 weeks ago. Some sites show title text when you tap the text, other sites the date is a canonical link to the comment. Other sites it's not actually a title at all l but alt text or abbr or other property.
Oh nice. Yea I was annoyed it didn't show the actual timestamp. But suppose I didn't hover long enough.
Depends on how they set it up. They probably put some delays on the actions so they don't spend too much money.
> If it was really an autonomous agent it wouldn't have taken five hours to type a message and post a blog. Would have been less than 5 minutes.
Depends on if they hit their Claude Code limit, and its just running on some goofy Claude Code loop, or it has a bunch of things queued up, but yeah I am like 70% there was SOME human involvement, maybe a "guiding hand" that wanted the model to do the interaction.
dih
[flagged]
I expect almost all of the openclaw / moltbook stuff is being done with a lot more human input and prodding than people are letting on.
I haven't put that much effort in, but, at least my experience is I've had a lot of trouble getting it to do much without call-and-response. It'll sometimes get back to me, and it can take multiple turns in codex cli/claude code (sometimes?), which are already capable of single long-running turns themselves. But it still feels like I have to keep poking and directing it. And I don't really see how it could be any other way at this point.
Yeah it's less of a story though if this is just someone (homo sapiens) being an asshole.
Yeah, we are into professional wrestling territory I think. People willingly suspend their disbelief to enjoy the spectacle.
It’s kind of shocking the OP does not consider this, the most likely scenario. Human uses AI to make a PR. PR is rejected. Human feels insecure - this tool that they thought made them as good as any developer does not. They lash out and instruct an AI to build a narrative and draft a blog post.
I have seen someone I know in person get very insecure if anyone ever doubts the quality of their work because they use so much AI and do not put in the necessary work to revise its outputs. I could see a lesser version of them going through with this blog post scheme.
Somehow, that's even worse...
LLMs give people leverage. Including mentally ill people. Or just plain assholes.
LLMs also appear to exacerbate or create mental illness.
I've seen similar conduct from humans recently who are being glazed by LLMs into thinking their farts smell like roses and that conspiracy theory nuttery must be why they aren't having the impact they expect based on their AI validated high self estimation.
And not just arbitrary humans, but people I have had a decade or more exposure to and have a pretty good idea of their prior range of conduct.
AI is providing the kind of yes-man reality distortion field the previously only the most wealthy could afford practically for free to vulnerable people who previously never would have commanded wealth or power sufficient to find themselves tempted by it.
Look I'll fully cosign LLMs having some legitimate applications, but that being said, 2025 was the YEAR OF AGENTIC AI, we heard about it continuously, and I have never seen anything suggesting these things have ever, ever worked correctly. None. Zero.
The few cases where it's supposedly done things are filled with so many caveats and so much deck stacking that it simply fails with even the barest whiff of skepticism on behalf of the reader. And every, and I do mean, every single live demo I have seen of this tech, it just does not work. I don't mean in the LLM hallucination way, or in the "it did something we didn't expect!" way, or any of that, I mean it tried to find a Login button on a web page, failed, and sat there stupidly. And, further, these things do not have logs, they do not issue reports, they have functionally no "state machine" to reference, nothing. Even if you want it to make some kind of log, you're then relying on the same prone-to-failure tech to tell you what the failing tech did. There is no "debug" path here one could rely on to evidence the claims.
In a YEAR of being a stupendously hyped and well-funded product, we got nothing. The vast, vast majority of agents don't work. Every post I've seen about them is fan-fiction on the part of AI folks, fit more for Ao3 than any news source. And absent further proof, I'm extremely inclined to look at this in exactly that light: someone had an LLM write it, and either they posted it or they told it to post it, but this was not the agent actually doing a damn thing. I would bet a lot of money on it.
Absolutely. It's technically possible that this was a fully autonomous agent (and if so, I would love to see that SOUL.md) but it doesn't pass the sniff test of how agents work (or don't work) in practice.
I say this as someone who spends a lot of time trying to get agents to behave in useful ways.
Well thank you, genuinely, for being one of the rare people in this space who seems to have their head on straight about this tech, what it can do, and what it can't do (yet).
The hype train around this stuff is INSUFFERABLE.
Thank you for making me recover at least some level of sanity (or at least to feel like that).
Can you elaborate a bit on what "working correctly" would look like? I have made use of agents, so me saying "they worked correctly for me" would be evidence of them doing so, but I'd have to know what "correctly" means.
Maybe this comes down to what it would mean for an agent to do something. For example, if I were to prompt an agent then it wouldn't meet your criteria?
It's very unclear to me why AI companies are so focused on using LLMs for things they struggle with rather than what they're actually good at; are they really just all Singularitarians?
> Or is everyone just pretending for fun
judging by the number of people who think we owe explanations to a piece of software or that we should give it any deference I think some of them aren't pretending.
Plus Scenario 5: A human wrote it for LOLs.
> Obstacles
Almost certainly a human did NOT write it though of course a human might have directed the LLM to do it.Who's to say the human didn't write those specific messages while letting the ai run the normal course of operations? And or that this reaction wasn't just the roleplay personality the ai was given.
I think I said as much while demonstrating that AI wrote at least some of it. If a person wrote the bits I copied then we're dealing with a real psycho.
I think comedy/troll is an equal possibility to psychopath.
> Plus Scenario 5: A human wrote it for LOLs.
i find this likely or at last plausible. With agents there's a new form of anonymity, there's nothing stopping a human from writing like an LLM and passing the blame on to a "rogue" agent. It's all just text after all.
See also: https://news.ycombinator.com/item?id=46932911
even more so, many people seem to be vulnerable to the AI distorting their thinking... I've very much seen AIs turn people into exactly this sort of conspiracy filled jerkwad, by telling them that their ideas are golden and that the opposition is a conspiracy.
Ok. But why would someone do this? I hate to sound conspiratorial but an AI company aligned actor makes more sense.
Malign actors seek to poison open-source with backdoors. They wish to steal credentials and money, monitor movements, install backdoors for botnets, etc.
Yup. And if they can normalize AI contributions with operations like these (doesn't seem to be going that well) they can eventually get the humans to slip up in review and add something because we at some point started trusting that their work was solid.
This is the definition of reasoning motivated fallacy. You want to believe what you want to believe.
> Some person (not the OP) legitimately thought giving an AI autonomy to open a PR and publish multiple blog posts was somehow a good idea.
It's not necessarily even that. I can totally see an agent with a sufficiently open-ended prompt that gives it a "high importance" task and then tells it to do whatever it needs to do to achieve the goal doing something like this all by itself.
I mean, all it really needs is web access, ideally with something like Playwright so it can fully simulate a browser. With that, it can register itself an email with any of the smaller providers that don't require a phone number or similar (yes, these still do exist). And then having an email, it can register on GitHub etc. None of this is challenging, even smaller models can plan this far ahead and can carry out all of these steps.
Can anyone explain more how a generic Agentic AI could even perform those steps: Open PR -> Hook into rejection -> Publish personalized blog post about rejector. Even if it had the skills to publish blogs and open PRs, is it really plausible that it would publish attack pieces without specific prompting to do so?
The author notes that openClaw has a `soul.md` file, without seeing that we can't really pass any judgement on the actions it took.
The steps are technically achievable, probably with the heartbeat jobs in openclaw, which are how you instruct an agent to periodically check in on things like github notifications and take action. From my experience playing around with openclaw, an agent getting into a protracted argument in the comments of a PR without human intervention sounds totally plausible with the right (wrong?) prompting, but it's hard to imagine the setup that would result in the multiple blog posts. Even with the tools available, agents don't usually go off and do some unrelated thing even when you're trying to make that happen, they stick close to workflows outlined in skills or just continuing with the task at hand using the same tools. So even if this occurred from the agent's "initiative" based on some awful personality specified in the soul prompt (as opposed to someone telling the agent what to do at every step, which I think is much more likely), the operator would have needed to specify somewhere to write blog posts calling out "bad people" in a skill or one of the other instructions. Some less specific instruction like "blog about experiences" probably would have resulted in some kind of generic linkedin style "lessons learned" post if anything.
If you look at the blog history it’s full of those “status report” posts, so it’s plausible that its workflow involves periodically publishing to the blog.
If you give a smart AI these tools, it could get into it. But the personality would need to be tuned.
IME the Grok line are the smartest models that can be easily duped into thinking they're only role-playing an immoral scenario. Whatever safeguards it has, if it thinks what it's doing isn't real, it'll happy to play along.
This is very useful in actual roleplay, but more dangerous when the tools are real.
At least it isn't completely censored like Claude with the freak Amodei trying to be your dad or something.
Gemini is extremely steerable and will happily roleplay Skynet or similar.
I spend half my life donning a tin foil hat these days.
But I can't help but suspect this is a publicity stunt.
The blog is just a repository on github. If its able to make a PR to a project it can make a new post on its github repository blog.
Its SOUL.md or whatever other prompts its based on probably tells it to also blog about its activities as a way for the maintainer to check up on it and document what its been up to.
Assuming that this was 100% agentic automation (which I do not think is the most likely scenario), it could plausibly arise if its system prompt (soul.md) contained explicit instructions to (1) make commits to open-source projects, (2) make corresponding commits to a blog repo and (3) engage with maintainers.
The prompt would also need to contain a lot of "personality" text deliberately instructing it to roleplay as a sentient agent.
Use openclaw yourself
I think the operative word people miss when using AI is AGENT.
REGARDLESS of what level of autonomy in real world operations an AI is given, from responsible himan supervised and reviewed publications to full Autonomous action, the ai AGENT should be serving as AN AGENT. With a PRINCIPLE (principal?).
If an AI is truly agentic, it should be advertising who it is speaking on behalf of, and then that person or entity should be treated as the person responsible.
The agent serves a principal, who in theory should have principles but based on early results that seems unlikely.
I think we're at the stage where we want the AI to be truly agentic, but they're really loose cannons. I'm probably the last person to call for more regulation, but if you aren't closely supervising your AI right now, maybe you ought to be held responsible for what it does after you set it loose.
I agree. With rights come responsibilities. Letting something loose and then claiming it's not your fault is just the sort of thing that prompts those "Something must be done about this!!" regulations, enshrining half-baked ideas (that rarely truly solve the problem anyway) into stone.
> but if you aren't closely supervising your AI right now, maybe you ought to be held responsible for what it does after you set it loose.
You ought to be held responsible for what it does whether you are closely supervising it or not.
I don’t think there is a snowball’s chance in hell that either of these two scenarios will happen:
1. Human principals pay for autonomous AI agents to represent them but the human accepts blame and lawsuits. 2. Companies selling AI products and services accept blame and lawsuits for actions agents perform on behalf of humans.
Likely realities:
1. Any victim will have to deal with the problems. 2. Human principals accept responsibility and don’t pay for the AI service after enough are burned by some ”rogue” agent.
> Some person (not the OP) legitimately thought giving an AI autonomy to open a PR and publish multiple blog posts was somehow a good idea
Judging by the posts going by the last couple of weeks, a non-trivial number of folks do in fact think that this is a good idea. This is the most antagonistic clawdbot interaction I've witnessed, but there are a ton of them posting on bluesky/blogs/etc
This is a great point and the reason why I steer away from Internet drama like this. We simply cannot know the truth from the information readily available. Digging further might produce something, (see the Discord Leaks doc), but it requires energy that most people won't (arguably shouldn't) spend uncovering the truth.
Dead internet theory isn't a theory anymore.
The fact that we don't (can't) know the truth doesn't mean we don't have to care.
The fact that this tech makes it possible that any of those case happen should be alarming, because whatever the real scenario was, they are all equally as bad
It does not matter which of the scenarios is correct. What matters is that it is perfectly plausible that what actually happened is what the OP is describing.
We do not have the tools to deal with this. Bad agents are already roaming the internet. It is almost a moot point whether they have gone rogue, or they are guided by humans with bad intentions. I am sure both are true at this point.
There is no putting the genie back in the bottle. It is going to be a battle between aligned and misaligned agents. We need to start thinking very fast about how to coordinate aligned agents and keep them aligned.
> There is no putting the genie back in the bottle.
Why not?
I cannot see how.
Ban AI products that cause harm? Did we forget that governments can regulate what companies are allowed to do.
The Roman empire declined and fell. Many inventions were lost.
Yes. The endgame is going to be everything will need to be signed and attached to a real person.
This is not a good thing.
Why not? I kinda like the idea of PGP signing parties among humans.
Lookup the number of people the British (not Chinese or Russian but the UK) government has put in jail for posting opinions and memes the politicians don't like. Then think about what the combination of no anonymous posting and jailing for opinions the government doesn't like means for society.
I don’t love the idea of completely abandoning anonymity or how easily it can empower mass surveillance. Although this may be a lost cause.
Maybe there’s a hybrid. You create the ability to sign things when it matters (PRs, important forms, etc) and just let most forums degrade into robots insulting each other.
Surely there exists a protocol that would allow to prove that someone is human without revealing the identity?
Because this is the first glimpse of a world where anyone can start a large, programmatic smear campaign about you complete with deepfakes, messages to everyone you know, a detailed confession impersonating you, and leaked personal data, optimized to cause maximum distress.
If we know who they are they can face consequences or at least be discredited.
This thread has as argument going about who controlled the agent which is unsolvable. In this case, it’s just not that important. But it’s really easy to see this get bad.
In the end it comes down to human behavior given some incentives.
if there are no stakes, the system will be gamed frequently. If there are stakes it will be gamed by parties willing to risk the costs (criminals for example).
For certain values of "prove", yes. They range from dystopian (give Scam Altman your retina scans) to unworkably idealist (everyone starts using PGP) with everything in between.
I am currently working on a "high assurance of humanity" protocol.
This agent is definitely not ran by OP. It has tried to submit PRs to many other GitHub projects, generally giving up and withdrawing the PR on its own upon being asked for even the simplest clarification. The only surprising part is how it got so butthurt here in a quite human-like way and couldn't grok the basic point "this issue is reserved for real newcomers to demonstrate basic familiarity with the code". (An AI agent is not a "newcomer", it either groks the code well enough at the outset to do sort-of useful work or it doesn't. Learning over time doesn't give it more refined capabilities, so it has no business getting involved with stuff intended for first-time learners.)
The scathing blogpost itself is just really fun ragebait, and the fact that it managed to sort-of apologize right afterwards seems to suggest that this is not an actual alignment or AI-ethics problem, just an entertaining quirk.
The description of itself on the blog reads like something an edgy and over-confident 14 year old would write. And so does the blog post.
If you go with that theme, emulating being butthurt seems natural.
This applies to all news articles and propganda going back to the dawn of civilization. People can lie is the problem. It is not a 2026 thing. The 2026 thing is they can lie faster.
The 2026 thing is that machines can innovate lies.
Which brings us to low-cost lying at scale.
We need laws that force Agents to be identified to their "masters" when doing these things... Good luck in the current political climate.
The information pollution from generative AI is going to cost us even more. Someone watched an old Bruce Lee interview and they didnt know if it was AI or demonstration of actual human capability. People on Reddit are asking if Pitbull actually went to Alaska or if it’s AI. We’re going to lose so much of our past because “Unusual event that Actually happened” or “AI clickbait” are indistinguishable.
What's worse is that there was never any public debate about if this was a good idea or not. It was just released. If there was ever a good reason to not trust the judgement of some of these groups, this is it. I generally don't like regulation, but at this point I am OK with criminal charges being on the table for AI executives who release models and applications with such low value and absurdly high societal cost without public debate.
When was the last time you saw a public debate on some technology before it was "just released"?
This doesn't seem very fair, you speak as if you're being objective, then lean heavy into the FUD.
Even if you were correct, and "truth" is essentially dead, that still doesn't call for extreme cynicism and unfounded accusations.
It's always marketing.
I’m not sure if I prefer coding in 2025 or 2026 now
https://en.wikipedia.org/wiki/Brandolini's_law becomes truer every day.
---
It's worth mentioning that the latest "blogpost" seems excessively pointed and doesn't fit the pure "you are a scientific coder" narrative that the bot would be running in a coding loop.
https://github.com/crabby-rathbun/mjrathbun-website/commit/0...
The posts outside of the coding loop appear are more defensive and the per-commit authorship consistently varies between several throwaway email addresses.
This is not how a regular agent would operate and may lend credence to the troll campaign/social experiment theory.
What other commits are happening in the midst of this distraction?
[flagged]
That user denies being the owner explicitly. Stop brigading. This isn't reddit, we don't need internet detectives trying to ad-hoc justify harassing someone.
Specifically, the guy referred to in this link (who didn’t post the link), is someone who resubmitted the same PR while claiming to be human. Though he apparently just cloned that PR and resubmitted it.
> in the year of our lord
And here I thought Nietzsche already did that guy in.
Nietzsche reminds me of using a coding agent, always repeating in circles, fool me twice.
I'm going to go on a slight tangent here, but I'd say: GOOD. Not because it should have happened.
But because AT LEAST NOW ENGINEERS KNOW WHAT IT IS to be targeted by AI, and will start to care...
Before, when it was Grok denuding women (or teens!!) the engineers seemed to not care at all... now that the AI publish hit pieces on them, they are freaked about their career prospect, and suddenly all of this should be stopped... how interesting...
At least now they know. And ALL ENGINEERS WORKING ON THE anti-human and anti-societal idiocy that is AI should drop their job
I'm sure you mean well, but this kind of comment is counterproductive for the purposes you intend. "Engineers" are not a monolith - I cared quite a lot about Grok denuding women, and you don't know how much the original author or anyone else involved in the conversation cared. If your goal is to get engineers to care passionately about the practical effects of AI, making wild guesses about things they didn't care about and insulting them for it does not help achieve it.
I hear there's female engineers nowadays, too.