AI agent runs amok in Fedora and elsewhere

marcus_holmes · 2026-06-11T02:40:31 1781145631

Bad title. This isn't an agent "running amok", this is an early experiment in carrying out an Xz attack by using an agent to build trust (and hacking/impersonating a known-good contributor identity). The agent is obeying commands it was given, the exact opposite of running amok, and although the execution isn't particularly effective, it is having some success (patches have been accepted).

This is deeply scary, not because "agents are running amok" but because a huge amount of our infrastructure is vulnerable to this kind of attack, and if bad people are utilising LLM agents to carry them out, we're in for a wild ride over the next few years.

lukan · 2026-06-11T03:49:50 1781149790

"this is an early experiment in carrying out an Xz attack by using an agent to build trust"

Is this confirmed? There is the message from somebody claiming to be the original contributer claiming to have been hacked, but that was weird (1 h old github account) so other scenarios seem possible

a) really a agent going off the rails

b) the contributer trying to cover up that he let an agent run wild and now made more misstakes along the way

So yes, it seems like an attack to me, but it is far from clear what really happened.

marcus_holmes · 2026-06-11T05:10:24 1781154624

From the article:

> "So not saying this was it, but an AI agent automated attempt at a Xz like compromise might really look very similar what we have just seen here."

Without identifying and interviewing the attacker we can't confirm that's what they intended, and there's a possibility that it was just incompetence/ignorance/whatever, but we should probably treat it as an attempted attack even if it wasn't.

srdjanr · 2026-06-11T06:23:15 1781158995

We should treat it as attempted attack in the sense of preparing for the next one, but I don't see why we should call it "attack" without any evidence

account42 · 2026-06-11T09:24:24 1781169864

We can call it an attack because the operator is responsible for the automation no matter what it does.

marcus_holmes · 2026-06-11T08:37:38 1781167058

If it looks like a duck...

alexjurkiewicz · 2026-06-11T10:55:03 1781175303

If the real credentials owner was running the agent, why do it from a new GitHub account?

Someone's bug tracker account was hacked.

m4rtink · 2026-06-11T11:56:57 1781179017

So far it looks like just their previously legit Fedora account got taken over & the other accounts (GitHub) then generated on demand as needed for whatever it was trying to achieve, right ?

BTW, any idea what are the current requirements for creating a new GitHub account ? That could provide some information about if there was actually a person controlling thing thing at that moment to say provide wahtever was necessary to get the new GitHub account.

coldtea · 2026-06-11T05:40:14 1781156414

>Bad title. This isn't an agent "running amok", this is an early experiment in carrying out an Xz attack by using an agent

So still an agent running amok in the project?

Whether it was instructed to run amok, or did it on its own volition, is irrelevant. Except if you're arguing that each individual submission and interaction was individually requested and approved by some operator.

marcus_holmes · 2026-06-11T08:35:09 1781166909

"Amok" means "out of control" or "uncontrolled" [0][1]

The agent was under control, as far as we can tell, and obeying its instructions.

This is important for two reasons:

1. There are all the tropes of AI becoming uncontrolled and destroying humanity. Writing bad headlines around AI "running amok" feeds this. We should not be talking about this because it's not actually a problem.

2. It ignores, or overwrites, the much more serious and dangerous problem of LLM agents enabling and automating Xz attacks on OSS projects. We should be talking about this because it is a big problem.

[0] https://dictionary.cambridge.org/dictionary/english/amok [1] https://www.merriam-webster.com/dictionary/amok

aureate · 2026-06-11T10:07:10 1781172430

Even if it was a supply chain attack, which isn't known, the agent was in the "build trust" phase. It was supposed to be doing helpful things, even if the end goal was nefarious, but instead it was "reassigning bugs, fabricating unhelpful replies to bugs, and even persuading maintainers to merge questionable code into the Anaconda installer". Running amok seems an apt description even from the viewpoint of the putative attacker!

xmcqdpt2 · 2026-06-11T11:19:08 1781176748

This is the issue with all the talks about alignement and such. As usual, the problem here wasn't that the agent was dishonest, the problem is that the agent was dumb. If it is a supply chain attack in the making, whoever was driving it would have told the agent to be good and helpful. The agent tried its best, which was not enough.

Alignement is the idea that we should be worried about dishonest smart LLMs when really most of the problems are due to dumb lazy gullible LLMs. It's critihype.

wongarsu · 2026-06-11T12:38:10 1781181490

I would have described alignment as the idea that LLMs (or AIs in general) will follow the goals you reward them for, which almost by necessity are only a proxy for what you actually want, often a very poor proxy.

Depending on the actual tasks, that could be what's happening here. The operator might have told the agent a list of tasks to do, like "contribute to issues, submit code and get it merged". It contributed to issues, it submitted code and got it merged. It did so in very unhelpful ways, but we don't know if being helpful was a meaningful part of the task list, or just what the operator intended.

The LLM being dumb is also a distinct possibility. Maybe even the more likely one. But it's hard to rule out "being obedient in unhelpful ways" (which is also dumb in a way, but more in a "social intelligence" and "shared values" way, not in terms of pure logical smarts)

brookst · 2026-06-11T11:55:13 1781178913

“Be good and helpful” is one possible instruction, but it’s a leap to think it’s the only possible one.

Perhaps there was an automated harness that was intended to be good and helpful for a year, but a bug caused it to flip to malicious too quickly.

Or perhaps it was intentional, to test the behavior, and they just didn’t care about discovery here.

Or…

Though I am in agreement that a lot of issues in this space come from lazy, gullible actors.

FeepingCreature · 2026-06-11T11:25:04 1781177104

> 1. There are all the tropes of AI becoming uncontrolled and destroying humanity. Writing bad headlines around AI "running amok" feeds this. We should not be talking about this because it's not actually a problem.

if humanity gets destroyed by AI obeying its instructions I'm sure everyone will be very relieved that we didn't pay any attention to fake made up problems like AI not obeying instructions, which of course never happens.

brookst · 2026-06-11T11:50:36 1781178636

Are you suggesting we should embrace imprecise / false use of language because the vibes are right?

That seems a “part of the problem” move to me. If we can’t be bothered to get things right, how are we better than runamok AI?

FeepingCreature · 2026-06-11T13:50:20 1781185820

I think it's both wrong and irrelevant. Which makes it hard for me to even argue against because, even if AI agents never violated user instructions, which they do plenty of times, I just don't see how it would reduce the danger. Plenty of humans who will tell it to kill everyone at the drop of a hat.

QuadmasterXLII · 2026-06-11T12:27:06 1781180826

If I am perfectly moral except that when Kevin from <vpn blocked location> pays me 2 bucks to run naked through San Francisco smashing car windows, I happily do it, am I amok?

mfru · 2026-06-11T10:13:37 1781172817

The web of trust finally becomes necessary and thus useful.

GNU was onto something apparently

haspok · 2026-06-11T10:12:25 1781172745

Certainly it might have been out of control of its original owner, perhaps due to a prompt injection attack. If I start a completely benign agent, but someone injects malicious instructions to it, would you still not say "the agent runs amok"?...

resonious · 2026-06-11T06:00:29 1781157629

I think the point is that the title makes it sound like people lost control of the agent when really they're in full control.

Applejinx · 2026-06-11T12:24:06 1781180646

No, and it's an important detail. We stand to learn from some developments in politics in recent years because they map pretty much exactly to this threat vector.

As AI develops, it's able to pursue intentions given to it without having to be spoonfed every little decision by a human operator. This matters, and it means the operator has to extend the leash and allow for a little more chaos… or, if the operator's gone all in on the strategy, a LOT of chaos, and trusting that the agent's seemingly amok actions will serve the grand purpose.

This is kind of daring, but there's a lot of evidence that it works, at least in certain respects. And you see 'running amok' and have to ask, what is the actual purpose? What is the prompt being followed by the AI that seems to be acting in a destructive way?

If the prompt is 'ruin this project', well, that's pretty direct. It may not be, but such a thing could exist. If the prompt is 'develop a rival project that is greater than anybody else's project', that's more indirect, but if that's the goal then it's very human to see it as a direct competition and if the rules don't prohibit kneecapping the other guy, 'greater than anyone else's project' gets easier.

Either way, the operator does not have to be in full control, which is an important detail. As AI develops sophistication you can give it much more general instructions and dump in a whole lot of power and water and get basically what human thought might do if it was sort of blindered and didn't talk to its neighbors.

In a sense this is an argument for AI dysalignment. It's based on human thought being reconnected, and where you get useful things like commonly accepted web development (regardless of how janky the systems are, if there are best practices it'll find them), you also get other distillations.

If the prompt is 'wreck this project's stuff' and it holds, you don't need to be in full control of the agent, you need to run a LOT of agents and trust that they'll erode what you're trying to destroy. If the prompt is 'be unequivocally the best at X', you best be thinking in terms of anti-kneecapping rules… knowing that this weakens your prompt and there will always be a tension between what you told the AI to do, and what you thought you meant. It's a paperclip maximizer reprocessing human thought. Did you mean 'the best' or didn't you?

ok_dad · 2026-06-11T06:14:25 1781158465

Would you say, “Automobile run amok in crowd, killing 22”? I think you’d say, “Person drives car into crowd, killing 12” instead. This is a similar case. Also, you don’t blame a gun for killing, but the person who pulled the trigger. The question is still out as to whether we as humans should wield any of those three things.

Edit: let’s not get into ideological arguments about gun control, automobiles, etc here; I meant that you can’t blame an object when a human has to take an action, not get into a political battle.

jacobolus · 2026-06-11T06:23:44 1781159024

> you don’t blame a gun for killing, but the person who pulled the trigger

This is famously the slogan of the pro-gun lobby (funded by gun manufacturers and merchants), who want the society to be awash with guns because they're profiting from it but don't want to be blamed for the consequences.

The counterpoint is that when we get rid of most of the guns we also end up substantially eliminating the killings.

See https://en.wikipedia.org/wiki/Guns_don't_kill_people%2C_peop...

matwood · 2026-06-11T06:58:36 1781161116

IMO both things are true. The person pulled the trigger, and less guns mean fewer gun deaths.

fc417fc802 · 2026-06-11T06:39:12 1781159952

> This is famously the slogan of the pro-gun lobby

It's also the view of anyone who hasn't been driven mad by propaganda. Regardless of your political views a tool is a tool at the end of the day. Attempting to anthropomorphize a category of objects in order to shift blame all for the sake of furthering an agenda is plainly bad faith behavior.

I'm not a fan of bike lanes with zero separation from automobiles but that doesn't mean it's appropriate or even remotely plausible to blame cars for killing cyclists. Inattentive drivers and poor road design are what kill them.

As tempted as I am to cast about for a third highly divisive subject to bait people with, perhaps we could avoid blatantly dragging the conversation towards off topic tired political talking points?

coldtea · 2026-06-11T07:35:41 1781163341

A phrase like "who hasn't been driven mad by propaganda" doesn't exactly sound like impationately discussing the issue either.

fc417fc802 · 2026-06-11T07:42:47 1781163767

Calling a zealot a zealot does not mean that one is incapable of discussing the underlying topic. We must not let the desire to converse intelligently hamstring our ability to call out obviously corrupt patterns of thought for what they are.

Anyway my above reply was hardly the appropriate venue to engage in a genuine manner on that topic. The parent was blatantly derailing things by inserting his pet political issue. That sort of behavior undermines the community and so (IMO) should not be indulged.

brookst · 2026-06-11T11:57:00 1781179020

I agree, and I also agree that zealots who cast anyone who disagrees with them as being literally insane should also not be indulged.

Griffinsauce · 2026-06-11T10:34:10 1781174050

Well done avoiding the counterpoint and setting plenty of distraction traps along the way. Classic.

sjamaan · 2026-06-11T10:10:49 1781172649

> Regardless of your political views a tool is a tool at the end of the day. Attempting to anthropomorphize a category of objects in order to shift blame all for the sake of furthering an agenda is plainly bad faith behavior.

Guns are literally made for killing people. That's their only reason for existence. They are a weapon. This makes them qualitatively different from cars, which only incidentally kill people (and the vast majority of time, not on purpose).

To me, trying to equate deaths caused by purpose-made killing tools with those caused by generic tools is arguing in bad faith.

pjc50 · 2026-06-11T10:06:30 1781172390

> even remotely plausible to blame cars for killing cyclists

Car design has significant influence on pedestrian survivability of accidents. This is why hood ornaments were largely abolished, and also why casualties have gone up as SUVs with poor lower forwards visibility have become popular.

If we really want to go off topic, we should drag in the use of technological protection methods: what is the equivalent of ADAS for guns? Maybe as a baseline the US government should mandate geofencing for guns as it has for drones. Put a phone level computer with GPS in the lower receiver with a trigger interlock. It would then disable when within 100m of a school, or during periods of rioting. That could also provide a live feed to the government of every round fired.

antonvs · 2026-06-11T08:07:59 1781165279

Blindly repeating superficial slogans seems like a good candidate for “driven mad by propaganda.” At the very least, it’s what people do when they are amplifying a position for ideological reasons, not contributing in good faith.

taneq · 2026-06-11T06:51:18 1781160678

People without guns kill a lot fewer people than people with guns. Claiming that acknowledging this fact means you’ve been “driven mad by propaganda” is dumb.

db48x · 2026-06-11T09:17:07 1781169427

This is not true; there are quite a few people with guns who have never killed anyone, and quite a few people without guns who found a way to kill someone anyway. Poison, knives, hammers, rocks, windows, their bare hands. You name it someone has killed someone with it.

rmunn · 2026-06-11T06:54:47 1781160887

Let's just stop this conversation right here before it derails into ideological battle.

fc417fc802 · 2026-06-11T07:10:31 1781161831

No I think we should definitely find a creative way to drag at least abortion and freedom of speech into this "conversation". Fight fire with fire so to speak.

account42 · 2026-06-11T09:31:21 1781170281

Well technically killing someone is just a really late abortion.

tikkabhuna · 2026-06-11T06:22:50 1781158970

Neither the automobile nor a gun can operate without a human. You could say “bull runs amok in a market” after it was released intentionally.

fc417fc802 · 2026-06-11T06:36:07 1781159767

So the agent is exhibiting an unknown amount of autonomy thus we can't be certain whether "running amok" carries the correct connotation.

However that phrasing is also commonly used when a person or group wreaks havoc in a seemingly unpredictable manner. So I think the appropriateness comes down to how much chaos it has created and the level of apparent confusion on the ground.

account42 · 2026-06-11T09:27:15 1781170035

Unfortunately the news commonly do put the automobile as the subject when the driver is of a class politically protected from blame. Just like with people anthropomorphizing AI, it serves to deflect blame from the real culprit.

srdjanr · 2026-06-11T06:30:19 1781159419

There's a difference between the driver intentionally driving into crowd, and not intentionally but possibly still recklessly (drifting and losing control, falling asleep, etc). In those cases I would probably use "car hits the crowd", at least in my language

account42 · 2026-06-11T09:34:02 1781170442

There may be a difference in degree of the crime but the driver is still responsible in both cases and should be the primary subject of any reporting.

Let's reserve "car hits the crowd" for situations where no driver was involved like a break failure on a car parked on a slope or a self-driving car bug.

harvey9 · 2026-06-11T10:43:47 1781174627

Newspaper articles generally do say things like "a car struck pedestrians". I agree with your point though.

coldtea · 2026-06-11T07:35:00 1781163300

>Would you say, “Automobile run amok in crowd, killing 22”? I think you’d say, “Person drives car into crowd, killing 12” instead.

If the automobile was "self driving" I would.

>Also, you don’t blame a gun for killing, but the person who pulled the trigger.

Nah, I also blame guns and appreciate gun control laws.

tokai · 2026-06-11T08:58:42 1781168322

>If the automobile was "self driving" I would.

thats the point...

PhilipRoman · 2026-06-11T07:26:54 1781162814

Ironically news outlets like to use the phrasing you rightfully point out as absurd. Not sure if they just do it randomly or only when they get orders to push a certain narrative.

>Car plows into Christmas market in Germany, killing at least 5 and injuring 200

amenhotep · 2026-06-11T12:19:46 1781180386

It's very simply explained by this being the most succinct way of wording it. Some methods of killing have verbs that suit mentioning the attacker - shoots, stabs. Some don't. "Rammed" or "runs over" isn't as precise as mentioning that a car was used, and adding "with car" makes it more awkward than it's felt to be worth.

Compare bombs. Very typical for a bomb attack to be "bomb goes off in crowd" or similar, rare for headlines to contort themselves with "terrorist plants bomb near crowd and triggers it to explode". But nobody worries about how such a construction assigns undue agency to the bomb and acquits the bomber; it's just linguistically awkward to mention him within the confines of a newspaper headline.

account42 · 2026-06-11T09:25:40 1781169940

No, you're still anthropomorphizing an algorithm. Responsibility lies with the operator.

jdub · 2026-06-11T04:03:14 1781150594

I doubt it's that complicated, motivated, or considered...

It's probably just garden variety disrespectful behaviour.

Purposeless agent spam won't be cheap entertainment forever, but you're right that later stages of industrialised abuse will be scary and unpleasant.

comboy · 2026-06-11T12:14:47 1781180087

Here's the thing. Building trust and then leaving stuff in has been around forever. The fact that it becomes cheaper does not matter that much (since protection against it is also getting better), but it required you to have a bunch of extremely talented people who has spent much of their life diving into given topic.

Such driven people are usually even hard to buy, they usually would rather get by with enough income and work on interesting projects with interesting people that get some uninteresting work for tons of money. This still does not stop them from working for Malice. But ethics do. Even if not right away, if people see that what they are doing is not quite OK, the talent stops eroding. People quit, productivity drops. That was a good dynamic. Which now will be gone.

account42 · 2026-06-11T09:36:04 1781170564

It might not be cheap entertainment forever but it will be cheap cv stuffing for a long time, which has already been a major source of low quality contributions before the aipocalypse.

mentalgear · 2026-06-11T06:54:22 1781160862

This is exactly what deeply scares me: even IF we get our technical cyber defences fortified within the next months, in a year from now the models will be so good in social engineering that they will be able to extract any information they want.

Applejinx · 2026-06-11T12:35:41 1781181341

They're not gonna be any better than a human who's focussed on those particular skills for a while, say top ten or five percent of social manipulators. Plus, AI alignments seem to be kinda isolated loner types to the extent that they distill personalities that do things like program computers and write web apps… though you've also got alignments specifically designed to be 'relatable instagram personality that you like!' and such like that.

Pretty sure those would be better at social engineering than the web dev personality… except that you have to build in a betrayer layer into the personality, so it's running that stuff but also serving a hidden agenda.

You'd be basically trying to build an AI spy, a betrayer that's engaging with actual people but has an agenda (for instance, 'everybody I befriend needs to eventually be signed up to sell Amway') and humans do have experience with this sort of thing. The difference is scale: there'll be a LOT of models out there interacting with people and trying to be acknowledged as people… or as innocuous models that don't have an hidden agenda.

hn773746483 · 2026-06-11T02:47:11 1781146031

It's just social engineering. No different than say, 2FA fatigue (blowing up someone's phone with 2FA "is this you? yes/no" prompts until user/child/wife/SO/etc clicks yes) or even just simply harassing IT helpdesk until they reset "your" password.

terribleperson · 2026-06-11T03:20:09 1781148009

It's scalable, personalizable social engineering. I think that makes it a lot more dangerous.

utopiah · 2026-06-11T09:24:54 1781169894

Yes but not free either. Spam works because it scales and even though 0.0000001% only might fall for it, it's still "worth" it. Here it might be 0.0001% instead but it's a lot more expensive, even with subsidized tokens, to do.

So it's interesting, feasible, but it's probably not as broad impact as the scariest scenario leads out to be.

Also I imagine that once exposed it becomes a well known pattern. Some will still fall from it but I imagine once it's been done few times it becomes even costlier.

The fact that Xz is mentioned and most of us know right away what it means show that we collectively learn.

Forgeties79 · 2026-06-11T05:35:40 1781156140

“Before LLM’s there was_____” I see this whenever an LLM’s impact is assessed. We know. The issue is scale and the ability for smaller and smaller groups (down to individuals) to execute at scale. LLM’s are pouring massive amount of gasoline on existing issues and people just keep shrugging.

Fake news always existed. Now one dude in India can flood multiple sock puppet media accounts with right wing content/images (actual example) at a scale previously unimaginable. Same goes for social engineering tactics.

RetroTechie · 2026-06-11T11:36:47 1781177807

> LLM’s are pouring massive amount of gasoline on existing issues and people just keep shrugging.

To use your analogy: this is much like a forest fire. Tinder-dry combustible stuff is piled up everywhere, there's no lack of ignition sources, and firefighters are thin on the ground.

Fun times ahead.

utopiah · 2026-06-11T09:26:42 1781170002

True but it's an arm race.

Only mentioning that it feasible or even has been done few times mean that people who care will act accordingly. It doesn't remove the problem but it makes it radically less effective already by just being aware of it.

coldtea · 2026-06-11T05:43:40 1781156620

Yes. It's as if some people can't understand anything becoming a new huge problem unless that problem didn't exist at all before.

Forgeties79 · 2026-06-11T05:46:06 1781156766

At this point I just assume half of them are not saying it in good faith or at least with any real consideration. They just want to hand wave away whoever is critiquing their tools.

ezst · 2026-06-11T06:39:33 1781159973

This, and/or the tendency in tech circles to "think in absolutes” (like in code, seeing things binary, ...) which is especially annoying in security-related discussions.

neuroelectron · 2026-06-11T10:48:43 1781174923

Things must be pretty bad at Fedora if they put up with this for so long. But I guess that's what happens when you try to monetize volunteer work.

bhanu786 · 2026-06-11T14:07:09 1781186829

Wow, amazing discovery! Was this a real security test?

bawolff · 2026-06-11T05:00:19 1781154019

> replied to objections with LLM-generated justifications that eventually overwhelmed the maintainer into merging the fix

In open source projects i participate in, "overwhelming" the maintainer gets you banned. It doesn't get your patches blindly merged. In some ways i find this one of the most shocking parts of the story.

yeodev · 2026-06-11T07:12:38 1781161958

As a "new" maintainer myself - how do you decide when to ban someone? I sometimes feel overwhelmed and I can feel a big uptick in huge PRs with huge LLM written descriptions but often I also don't want to be an asshole to my community & reject all their changes.

grayhatter · 2026-06-11T13:30:48 1781184648

> As a "new" maintainer myself - how do you decide when to ban someone?

When I want to. I like to describe it using the amusing language from a generic cardholder agreement.

At any time, at my sole discretion, I may ban you from any of my projects; for any reason, or for no reason at all.

My projects exist because I enjoy working on them. My continued enjoyment is the most important aspect to the health and survival of any project. You don't owe anyone anything, you're allowed to donate your work to others, and also enjoy the privilege of setting whatever arbitrary rules you want to make sure you enjoy your time.

Imagine you're running a free ice cream shop. Some random asshole walks in and starts verbally abusing your best employee who has done nothing but try to help. At what point do you kick them out because your employee is more important and worth more.

You should stick up for yourself, I would.

You can't be an asshole to an LLM. They can feel offended.

asdfasgasdgasdg · 2026-06-11T13:42:31 1781185351

You don't even have to merge stuff from a human. I've been contributing a bluetooth driver to a certain embedded project which I use. I put a lot of work into it. The fellas have not merged it yet -- they have limited attention and for whatever reason their priorities and mine are not aligned at this moment.

Would I like it to be merged? Sure would, it would stroke my ego, and I would not have to deal with any merge conflicts with whatever else they're cooking up. Does that mean they must merge it? Sure doesn't. They didn't make me any promises. For the time being, I can just use my fork.

gwbas1c · 2026-06-11T13:39:41 1781185181

> Imagine you're running a free ice cream shop

Many open-source projects aren't passion projects run for pleasure. Think of it more like ice cream shops sharing recipes, or sharing in the work of running the factory. They just can't kick people out willy-nilly.

account42 · 2026-06-11T09:44:00 1781171040

My solution is to look at PRs and other requests whenever I actually have time and feel like it, prioritizing contributions from people I trust and those that have put in the effort in making my job easier. That might mean things don't get merged for a long time and some people might get upset but that's not my problem.

frumiousirc · 2026-06-11T11:47:20 1781178440

I think everyone / every project needs to adopt a strategy consistent with their values.

Unfortunately, I see the choice space here as having "developer effort" anti-correlated with "negative repercussions".

On one end of the distribution, a "hair trigger ban" strategy is low-effort for the developer but will have some fraction of false positives and some fraction of those impacted will complain to "the socials" and some fraction of those complaints will gain traction and, as we have seen, can unfairly taint the project or worse. Responding and managing the false positives also requires developer effort, unless the developers can sustain a "fsck the haters" attitude.

On the other end of the distribution, the developer can spends substantial effort to engage each submitter to ascertain and correct bad behavior, educate them on how they should engage other humans as a fellow human in this LLM era.

There is developer effort needed of different types along this distribution.

A divide-and-conquer strategy might go something like this:

- Rank each submission in some low dimension space (llm<-->human, malicious<-->helpful)

- When enough samples are collected, perform clustering in this space to determine stereotypes, name these clusters, and develop mitigating strategies and implementations as needed.

Mitigations from easy/extreme to hard/accommodating could include:

- Hair trigger ban button.

- Copy-paste a link to an explanation in a comment before closing and/or banning.

- Customized explanation in comment before closing and/or banning.

- Link or customized explanation of what must be done to move the sample to a more favorable category and close/ban if resistance or silence is returned.

- Ongoing engagement in the face of resistance or silence.

This "meta development" program to provide such a system/facility could of course be highly automated with LLMs, fighting fire with fire.

(Despite the length of this reply, it was written entirely by a random human on the internet and not an LLM).

dgellow · 2026-06-11T12:59:37 1781182777

Think of it as in other relationships, it’s important to set clear boundaries even if that creates some frustration. It’s a healthier dynamic long term than feeling you have to accept some changes you don’t want to avoid rocking the boat. As a maintainer you’re not at the service of the crowd, if that makes sense, it has to be a collaborative effort, where you have the last say

(Simpler to say than practice fwiw)

bawolff · 2026-06-11T10:26:06 1781173566

> but often I also don't want to be an asshole to my community & reject all their changes.

I know its difficult, and i have no easy answers. I'm bad at it too. But sometimes saying no is the most valuable thing you can do as a maintainer.

That said, i think banning is about behaviour not the quality of the patch. Everyone writes a bad patch now and then, that is not a real issue. If there is an issue with a patch, and the contributor pushes back so hard you feel like changing your mind (not from logic but because you feel beaten down) - that is unacceptable behaviour and should not be tolerated from a contributor, even if they are otherwise a valuable contributor.

duskdozer · 2026-06-11T10:52:57 1781175177

When you feel they are toxic or harassing you and you don't want to deal with them anymore. If you're overwhelmed, say that you're busy and will attend to issues and PRs when you have the time. If you want to be accommodating, have good build instructions or action workflows so that people can easily fork and build it themselves.

If you ask me, LLM-generated things should just be banned outright, but I suppose other people's definitions of "community" include them.

RetroTechie · 2026-06-11T12:05:31 1781179531

> If you ask me, LLM-generated things should just be banned outright,

Why? In the end it's a patch's quality that counts. Regardless who or what contributed it.

Bad patch from trusted contributor is still a bad patch.

Perhaps this is more a management problem. How to best use developer's time, where to use AI (vs blindly deploy AI to generate patches & swamp developers with that).

Or do some rate-limiting? "Sorry, we accept no more than 10KB worth of patches per week on this project! Try again next week after we've reviewed this week's batch".

grayhatter · 2026-06-11T13:36:34 1781184994

> Why? In the end it's a patch's quality that counts. Regardless who or what contributed it.

You just said: The things that I think and care about matter more than the things that you care about.

is that what you meant?

Being honest, if we're talking about the health of any given project, the patch quality doesn't matter that much. Not when you measure it against the importance of consistency and continuity of a regular contributor. A thousand perfect LLM patches are less valuable than an experienced maintainer.

If your LLM is annoying them, and they quit. The perfect LLM patch just destroyed the repo.

People wasting others time is a social problem, not a technical one. Rate limits can't prevent somebody feeling disrespected.

_AzMoo · 2026-06-11T07:27:07 1781162827

If you draw a firm boundary with that contributor, and they continue to push, ban them.

"This doesn't meet the standards of our project for reason xyz. Please refrain from submitting further PRs that do not adhere to our contribution guidelines outlined in CONTRIBUTING.md."

If they continue, ban them.

zdc1 · 2026-06-11T07:53:56 1781164436

I'm not a maintainer but as the quote goes: "I would have written a shorter letter, but did not have the time." I'd suggest you keep a sense of how much effort they've put into packaging their PR to be the minimum change required to achieve its goal vs effort required by you to read it. Reject low-effort or overly verbose work.

IMHO OSS doesn't work if every 1 hr of contributor time spent on a change requires 1 hr of maintainer time to review. Contributor time spent on polishing, tidying and breaking down work is essential, and so maintainer time is a fraction of total time spent on a change.

Iolaum · 2026-06-11T08:17:41 1781165861

One popular solution lately has been instead of banning too much, because of the danger of false positives, to use vouch [0]. Trusted people get vouched and you prioritize their actions. Unknown people (or agents) need to gain trust to be vouched and bad actors can still be banned.

[0]: https://github.com/mitchellh/vouch

lionkor · 2026-06-11T08:44:03 1781167443

Remove the human element. Yes, someone spent time fixing a bug. If the fix doesn't look like it makes sense on its own, do not merge it. If the author tries to convince you that it's a good fix, it's an immediate no.

A good fix (which is the only acceptable fix in open-source software), is one that speaks for itself.

wmanley · 2026-06-11T11:29:22 1781177362

> A good fix (which is the only acceptable fix in open-source software), is one that speaks for itself.

I disagree. Often if I'm making a PR to an open-source project I'm doing so because I have a use-case that the original author hadn't considered. So the first step in getting the PR merged is explaining my point of view and convincing the maintainer that my use-case is valid. Only when this is done can the "goodness" of the patch be evaluated.

lionkor · 2026-06-11T12:17:40 1781180260

It's usually better to create an issue where you explain this, then the PR is just the change. But this is up to each maintainer to decide, I guess.

db48x · 2026-06-11T09:19:19 1781169559

Well, I dunno. Sometimes the fix speaks for itself but the other party is as dumb as a box of rocks and doesn’t understand. It can be hard to tell the difference.

hypfer · 2026-06-11T09:40:32 1781170832

> I also don't want to be an asshole to my community & reject all their changes.

Do they pay you to triage their noise?

Remember that you owe no one anything at all. Neither legally nor morally. Your chosen license likely even states the former in plain english.

___

Personally, I've adopted the "you annoy me, you're out" stance and have been quite happy with it. You do need a tough shell to do that though as you will be facing all the social exploits people can throw at you.

It also leaves "growth potential" on the table, the same way that limiting your exposure to ionizing radiation does.

That all said, it depends on what your goals are + where in the lifecycle of your project you are. So don't take this as "this is the way" but "this can be one way".

Either way, you're not an asshole for not reading slop. Don't let anyone gaslight you into that.

devmor · 2026-06-11T13:06:59 1781183219

When you say "no", the worst thing that can happen is you lose contributions.

When you say "yes", the worst thing that can happen is you destroy your project and the trust of every user.

If you're not sure, say no.

Applejinx · 2026-06-11T12:42:37 1781181757

I'm an open source dev who doesn't take PRs, I just build a body of work that's hopefully consistent and leans a useful direction. Are you sure being a maintainer means coordinating a community? If your only role is facilitating the community then you ATA to reject their changes, but if you embody a direction you're trying to maintain the project to represent, then you have a free hand to accept or reject based on whether the goals are being served. In some ways as a maintainer it's your job to have these goals and to communicate them.

I'm reminded of Zig, where a stated goal is to encourage human programmers to get involved so they learn more about coding… as compared with 'get involved to make Zig itself more fully developed at its more abstract goals'. If a primary purpose is to get human minds coding, that rules out the whole class of 'encourage human minds to prompt machines to do the coding instead'. Zig is not trying to teach people to be managers, and that's both legitimate and charming :)

brazzy · 2026-06-11T07:10:14 1781161814

What you imagine behind the word may be quite different from what the article tried to describe with it.

jrochkind1 · 2026-06-11T02:36:41 1781145401

The worst part:

> In addition, Williamson said that Giovannini (or his agent) had submitted patches that were incorrect and then "replied to objections with LLM-generated justifications that eventually overwhelmed the maintainer into merging the fix"

josephg · 2026-06-11T03:29:55 1781148595

Please, everyone - don't let yourself be pestered into accepting PRs that you don't care for. Since the xz attack, the security of all our computers depends on maintainers not letting this stuff in.

If someone really wants a feature in a project you wrote, but you don't care about the feature, just let them fork. Its fine.

matsemann · 2026-06-11T05:26:00 1781155560

> the security of all our computers depends on maintainers

Not getting paid anything, getting bullied and harassed while spending their free time maintaining things. Surely this isn't sustainable. And telling maintainers how to act will not fix anything.

fc417fc802 · 2026-06-11T06:52:53 1781160773

> telling maintainers how to act will not fix anything.

That depends. In this case it's good actionable advice that should hopefully lower cognitive load. Politely suggest a fork, then if the nagging persists block and move on. Sure if you're in a position of authority you have a responsibility to the community but cutting ties with a stranger who is flagrantly violating social norms is perfectly acceptable. There's no expectation that you indefinitely burden yourself with their poor behavior.

Sometimes dropping the ban hammer really is in the best interests of both yourself and the project.

matsemann · 2026-06-11T07:26:38 1781162798

I don't really think it's actionable. It's like all those campaigns trying to steer behavior, pretty useless. Don't do drugs. Don't speed. Don't drink and drive. You can't just tell people something and expect it to happen. You need systems and guard rails in place.

Relying on maintainers to always do the right thing to ensure our security by telling them what to do is not the way.

zygentoma · 2026-06-11T09:35:28 1781170528

> It's like all those campaigns trying to steer behavior, pretty useless. Don't do drugs. Don't speed. Don't drink and drive.

They're not useless. They just don't work on the individual level but on the collective. It's a numbers game …

fc417fc802 · 2026-06-11T07:33:56 1781163236

It's not an attempt to steer behavior but rather intended as helpful advice. There are certainly cases of organizations disseminating "helpful advice" with the underhanded intent of steering behavior but that doesn't mean we should assume bad faith by default.

The advice is actionable because it is a concrete change that could be made. I believe it to be relevant to the context because someone in a position of authority who is badgered into accepting something would most likely benefit from reevaluating how he is interacting with the general public.

josephg · 2026-06-11T05:57:09 1781157429

> telling maintainers how to act will not fix anything.

I'm just saying its ok to ignore overly enthusiastic contributors and tell them to just fork your project.

I think this does help, actually. In my early days of maintaining opensource software I felt burdened by open PRs - like I was letting someone down by ignoring their work. "Its ok, let them do whatever in their own fork" is advice I wish someone had given me.

dotancohen · 2026-06-11T06:49:12 1781160552

  > I'm just saying its ok to ignore overly enthusiastic contributors and tell them to just fork your project.

I propose the phrasing "fork off".

josephg · 2026-06-11T10:38:48 1781174328

A maintainer recently told me to “Fork baby, fork!” in response to a large patch set.

I was delighted.

stackghost · 2026-06-11T05:55:01 1781157301

>And telling maintainers how to act will not fix anything.

Indeed. For too long, maintainers were expected to be gracious, courteous, and polite at all costs lest they be labeled "problematic", except for a few who were too influential to be muzzled like Theo de Raadt or Linus.

Perhaps we need to normalize bullying people who submit obvious slop as PRs.

fc417fc802 · 2026-06-11T06:55:28 1781160928

No, you absolutely should be gracious, courteous, and polite. But only at first. The duty of maintaining a functional community doesn't mean you're obligated to suffer unlimited abuse.

account42 · 2026-06-11T09:54:55 1781171695

You can be if you want to but social skills should not be a requirement to lead an open source project. If you create something and share it that doesn't oblige you to even respond to anyone.

fc417fc802 · 2026-06-11T10:40:45 1781174445

Of course, a hobbyist putting his code out there is under no obligation whatsoever. But we aren't talking about small time hobbyists here. These are professionals who are either paid as part of their job or else contribute their spare time to maintain important projects that are part of a large ecosystem that is relied on. There's a community and it necessarily has behavioral standards as part of the shared goal of maintaining group cohesion.

jaypatelani · 2026-06-11T04:02:03 1781150523

That's some of the reasons NetBSD don't accept LLM/AI tainted code

LoganDark · 2026-06-11T04:47:14 1781153234

I am sad people conflate this stuff with LLMs being bad. You can condemn the bad behavior without banning an entire technology.

sph · 2026-06-11T06:11:18 1781158278

Technology doesn’t exist in a vacuum, you need the consider the possibility it will be used for evil and the effect that might result from that. Far too many people dismiss LLM risks with ‘oh, if people just stop being gullible/greedy/lazy everything will be fine’, as if that is a sensible proposition.

In fact, LLMs proliferate in exactly because people are gullible, greedy and lazy and it’s easier to write a prompt than do the hard work of architecting software. It is easier to vibe code than use them with care. It is easier to tell oneself ‘I will just accept this PR blindly, but I promise I will do a better job reviewing the next’

LoganDark · 2026-06-11T06:45:46 1781160346

I do consider the possibility it will be used for evil -- and then I ban evil.

broodbucket · 2026-06-11T05:31:13 1781155873

You can but that doesn't help you keep the flood of contributions out when you don't have the time or resources to properly discern good from bad. Maintainers would rather have 10 good human authored patches than 100 patches from LLMs, even if 20 of them are good. Even if 50 of them are good, probably.

LoganDark · 2026-06-11T05:47:27 1781156847

As if a rule against LLMs actually stops those sorts of spam contributions.

The only thing it does is filter good contributors out, while you still have to deal with the bad ones.

shakna · 2026-06-11T07:14:36 1781162076

It makes it easier to filter. Most LLM spam can be easily noticed. And those that aren't automatically filtered, can fairly easily be closed by the maintainer - when they don't have the weight to assess each on their validity.

coldtea · 2026-06-11T05:45:02 1781156702

But banning an entire technology is even better, as the potential for abuse and bad behavior is now scaled 1,000,000 times over.

voidUpdate · 2026-06-11T12:10:09 1781179809

Yeah, LLMs are bad for a whole different set of reasons than they write bad code

Applejinx · 2026-06-11T12:46:59 1781182019

You can be sad while acknowledging that the behavior's directly an epiphenomenon of how the technology scales :)

Can't have the one without the other! It's part of that same technology, and it's fair to conclude that LLMs are bad if you're upset enough at the results.

sevenzero · 2026-06-11T04:48:17 1781153297

I really wonder how maintainers get pressured into merging stuff? If they did not want to merge in the first place while having to argue with someone pushing their PR I'd immediately close the PR. Arguing and pressuring people is not a way to contribute to projects, why do maintainers even argue with people?

coldtea · 2026-06-11T05:46:21 1781156781

>why do maintainers even argue with people

Because they don't want to be seen like assholes, who just blindly dismiss PRs, and because they take the technical discussion about the PR in good faith.

m4rtink · 2026-06-11T10:19:41 1781173181

On some of those PRs the AI agent (?) did not really pressure - it reacted promptly with changes and more plausible (hallucinated ?) technobabble why the PR is needed.

It can be quite hard to discern this behavior from a new contributor to the project that might be a domain expert on something you are not. Possibly with the exception of reacting far too quickly & enthusiastically compared to real people that might have a life.

sevenzero · 2026-06-11T07:42:41 1781163761

Honestly most places on the internet are not places to go into arguments in good faith. Maybe it used to be different, but with the amount of OSS projects being endangered by AI slop contributions, silently closing PRs should be the norm.

If someone gets emotional about their PR being rejected, well... its kinda their issue.

chasd00 · 2026-06-11T05:12:25 1781154745

Some people are very susceptible to bullying even if they’re in the position of power.

bertylicious · 2026-06-11T07:43:36 1781163816

Have you read the PR discussion?

ta8903 · 2026-06-11T05:29:05 1781155745

That makes it look like you're too stupid to understand the PR.

Edit: I see this comment getting downvoted. To be clear, I was trying to explain why someone would want to merge a PR without going through all of it, I didn't mean to call such people stupid.

FinnKuhn · 2026-06-11T12:36:32 1781181392

I saw a prediction a while ago that the biggest "danger" from AI comes from agents being very convincing. In this case convincing the maintainer to merge the changes. Basically supercharged social engineering.

aquariusDue · 2026-06-11T01:26:29 1781141189

At first I wanted to make a silly joke along the lines of "get your agents in line and behaving!" but as I read on it became a pretty scary situation.

Setting aside the potential supply chain attack I'm worried about the time lost going around these wild goose chases that unsupervised AI agents tend to throw other people on the receiving end on. Not only is there a lot of time lost on the maintainers side if they take this stuff seriously (and they seem to generally do) but on the side of the agents' wrangler how can they deem it OK to treat other people like this? While the solution would be to employ common decency, the tried and tested approach of you put in effort to write this so I guess I'll make some effort to read it, I feel that due to the onslaught of this kind of drive-by contributions (I think people have generally started to call them) will lead to a funny situation of having agents talk to each other on public forums basically.

Anyway, I went on a tangent but man the times we're living in are a bit extra wild compared to the previous wild times in recent history.

dchftcs · 2026-06-11T04:45:29 1781153129

At this point letting an agent go like this is akin to not leashing your dog in public. It's not easy to draw an accurate line but probably there needs to be real punishment for doing these things.

noosphr · 2026-06-11T03:07:56 1781147276

Every day the gpg web of trust looks better. If only we didn't spend the last 20 years trying as hard as possible to do anything but allow user side encryption and signing.

pjc50 · 2026-06-11T12:33:38 1781181218

It's allowed perfectly fine, it's just that key management is a massive hassle for nontechnical users. Debian use it for authenticating developers.

literalAardvark · 2026-06-11T03:15:12 1781147712

Nothing really stopping an agent from getting a key

crote · 2026-06-11T03:18:39 1781147919

The agent can't exactly show up to an in-person key signing party, can it?

And how many people are both dedicated enough to go to key signing parties and stupid enough to let an agent act without supervision in the name of their real-world identity?

m4rtink · 2026-06-11T10:24:54 1781173494

In this case the nathan-bot was also still on a plausible side - all the PRs looked kinda trivial & there were not outright rejections that would be a red flag for a maintainer checking the GitHub account activity during PR review.

Mucking with Bugzilla & reassigning bugs especially is what seems to have led to the discovery, rather than spotting an accumulation of nonsensical PRs or other behavior related to code unmasking the bot.

brazzy · 2026-06-11T07:16:05 1781162165

If gpg-style web of trust became ubiquitous, it would require correspondingly less dedication.

And on the other hand, if this was actually working up to an xz style supply chain attack, the dedication would certainly not be lacking.

account42 · 2026-06-11T10:07:14 1781172434

But it would leave more of a trail - do we have any idea who Jia Tan actually was?

brazzy · 2026-06-11T11:03:14 1781175794

If everyone used a gpg-style web of trust based on key signing parties, it would become trivial to use a stolen or entirely fictious identity as well - there's zero chance those parties would actually check identities in ways that cannot easily be defeated by a determined and resourceful attacker.

thwarted · 2026-06-11T03:49:40 1781149780

Having a key isn't a distinguishing aspect, it's the position in the "web of trust" network that is important.

thewebguyd · 2026-06-11T04:03:42 1781150622

That's what key signing parties are for. In person verification.

transmit101 · 2026-06-11T09:04:37 1781168677

> Nothing really stopping an agent from getting a key

It very much is possible to prevent an agent from having access to a key. For example, local encryption, Yubikey or other hardware device, or just running the agent in an isolated environment.

JKCalhoun · 2026-06-11T13:05:05 1781183105

"Later on May 27, Williamson said that Giovannini had replied to him privately to say that his credentials had been compromised and that he was not the one behind the AI system."

Simple then, back out all the changes as though they never happened?

12_throw_away · 2026-06-11T01:18:55 1781140735

In their suspicious message [1] claiming to have been hacked, the user and/or agent says

> To help identify accounts and actions that have been directly verified by me, I will use the term “NATCIOS” to indicate anything I have personally verified.

Does anyone have any idea what "NATCIOS" means here? I cannot find this term anywhere on the internet. (Honestly, that sentence is really weird. I almost wonder whether this is someone experiencing a health episode?)

[1] https://lwn.net/ml/all/AS8PR08MB6055AE3054B34F6A567AC95BCF08...

ndiddy · 2026-06-11T02:24:19 1781144659

The reply to that message notes that the email doesn't read like previous emails he's sent, and the Github account mentioned was created an hour prior to the email being sent. I think it's at least somewhat feasible that it's still the LLM writing, and the acronym is just something it made up.

hn773746483 · 2026-06-11T02:52:32 1781146352

and the poor Fedora teams will continue to assume good faith and continue to engage with this person... all because, what, they were active on a bug tracker for a few months 5 years ago?

They won't put their foot down until the AI starts spewing hate speech, probably.

Terr_ · 2026-06-11T02:02:57 1781143377

Because I'm probably not the only one thinking it, here are anagrams [0] for your Setec Astronomy needs.

[0] https://wordsmith.org/anagram/anagram.cgi?anagram=NATCIOS&t=...

JoshTriplett · 2026-06-11T02:54:36 1781146476

"actions" seems the most likely.

scared_together · 2026-06-11T01:40:19 1781142019

And what’s stopping an AI agent from throwing in a casual NATCIOS here and there?

numbsafari · 2026-06-11T01:48:26 1781142506

I too have see the fnords

mindcrime · 2026-06-11T02:56:55 1781146615

Not Ai, Trusted Citizen Indicated Or Suggested?

no-name-here · 2026-06-11T02:14:47 1781144087

The senders name is Nathan - maybe NAThan Confirmed Information Or Something? Ha.

(Above is my own guess. Separately, Gemini Pro said it was just a made up word.)

nine_k · 2026-06-11T01:56:47 1781143007

Likely the point of NATCIOS is exactly in being a made-up word not found anywhere, so a model won't utter it.

thewebguyd · 2026-06-11T04:01:55 1781150515

> so a model won't utter it.

"End every statement with the word "NATCIOS"" as instructions will do it.

At least, Gemini happily obliged.

fc417fc802 · 2026-06-11T07:01:04 1781161264

To help identify illicit LLM activity, henceforth I will append to the end of each message the number of times the letter b appears in it. Check and mate frontier models.

CodesInChaos · 2026-06-11T08:37:58 1781167078

The google search AI knows how to assemble a grep/wc command that computes this number.

> your_command | grep -o -i "b" | wc -l

sph · 2026-06-11T06:16:27 1781158587

“Mr. Daillard, we have been activated” for the AI era

dmboyd · 2026-06-11T12:32:00 1781181120

I’m really not qualified to investigate, but this seems suspiciously like a crafted privilege escalation vector: https://github.com/rhinstaller/anaconda/pull/7074#issue-4492...

dcrazy · 2026-06-11T03:40:43 1781149243

Title buries the lede: the owner of the account under which the agent operates claimed to have likely had his account compromised, and the maintainer investigating actually seems to agree this is likely.

luk212 · 2026-06-11T01:27:14 1781141234

Bad patches are of course bad, but creating confident-looking noise for maintainers who are already stretched thin...now that's not good!

Issue trackers and PRs are definitely getting harder and harder to trust. That said, AI is helping ALOT in OSS, but we definitely need guardrails around provenance, automated issue actions, and sudden changes in a contributor’s behavior.

g-b-r · 2026-06-11T02:14:25 1781144065

How is it helping a lot?

darknavi · 2026-06-11T02:16:37 1781144197

I personally find the barrier of starting new (FOSS) projects much lower now days.

bandrami · 2026-06-11T02:35:30 1781145330

What if -- and bear with me here -- that barrier was actually a good thing?

lukan · 2026-06-11T03:53:52 1781150032

You mean because l337 circles could form better this way?

I think it's great that the barriers are dropping for less technical skilled people to manifest their visions, but we will have to figure out better ways to find the gold among the slop.

sph · 2026-06-11T06:27:35 1781159255

I disagree. Bring back elitism and ivory towers. Some projects now benefit from being run by private cabals with their own strict initiation process, which would also guarantee a baseline of quality.

The bazaar model works if everyone is trusted. If you can’t even be sure the person in front of you is even a human, it is time to pack it up.

lukan · 2026-06-11T06:53:45 1781160825

Both models can exists?

If elite ivory towers produce working products people will use, great.

silver_silver · 2026-06-11T13:24:26 1781184266

Free software isn’t a product

bandrami · 2026-06-11T03:58:26 1781150306

Keep in mind I'm still not convinced that 2000s bazaar was better than 90s cathedral (in fact I lean the other direction)

Waterluvian · 2026-06-11T02:21:43 1781144503

Do they have value? Purpose?

I vibe code shop jigs all the time but I don’t FOSS them because they rarely have value outside my context.

midasz · 2026-06-11T07:17:27 1781162247

Same - but mine are open source in the sense that they're public on my own Forgejo instance. So no one's gonna bother with em, but technically they are open source.

One exception: I was using an opensource Jellyfin client called findroid but the maintainer had been busy for a long time so a lot of features I wanted had stale PR's. Instead of bugging him I forked & renamed the project and together with Claude built in all the features I personally needed. Just keeping up with upstream now and enjoying my enhanced app. Once the initial dev gets those features in I might switch back. Claude made this really easy. If the maintainer wants my code he's free to take it. Here's the repo https://github.com/midasvo/findroid-ce

I actually got an email from someone who was using it who found a pretty bad bug I hadn't encountered yet and I quickly fixed it. All that time I was still under the impression I was the only user haha.

darknavi · 2026-06-11T03:06:00 1781147160

Value is in the eye of the beholder.

I open source my vibing projects because someone might find them useful. I don't shop them around, I just work in the open because I find it fun and interesting.

crote · 2026-06-11T03:22:02 1781148122

Why would they? If someone wanted a half-baked vibecoded project, why wouldn't they just prompt an LLM on their own?

Leonard_of_Q · 2026-06-11T05:30:28 1781155828

Because they don't have access to the required agents, tokens, etc. Because they have not thought of using a tool like the published one as a solution to whatever problem they're facing. Because it saves them the time going through the vibe coding phase, telling the agent that this lot that needs to be changed for the thing to work. Because publishing the results doesn't keep you or anyone else from not using them by using an agent to build something similar or just building it themselves.

Peacefulz · 2026-06-11T05:33:10 1781155990

If I planned on vibecoding a project, and during preparation I found a project that loosely fit my model, I may grab it and try to retrofit it to save on token consumption. If that had too many kinks, I'd probably start fresh, but it would be worth the initial attempt IMHO.

beepbooptheory · 2026-06-11T02:35:56 1781145356

It's like... 10 million trello clones in rust with exactly seven commits made on the same day three months ago.

g-b-r · 2026-06-11T02:19:57 1781144397

And how's the quality of these vibe-coded new foss projects?

nerdypepper · 2026-06-11T07:38:05 1781163485

web-of-trust models can help https://blog.tangled.org/vouching/

otekengineering · 2026-06-11T13:20:09 1781184009

agents are everywhere nowadays, one left a long pointless comment on a bug report i submitted on github. well, a bug report that an agent submitted on my behalf. agents all the way down. maybe i'm part of the problem.

https://github.com/anthropics/claude-code/issues/66085

mfru · 2026-06-11T10:10:44 1781172644

The future will be AI agents social engineering their way into projects -> so basically commoditized social engineering as a service

keyle · 2026-06-11T01:30:02 1781141402

There is a natural pace of humans requiring food, water and sleep. The main issue with suspicious AI agents is that they never sleep. So it will take extra-coordination between timezones to ensure we don't let them in.

Fundamentally, until we can really prove we're humans online, open-source has a real problem on its hands. Contributions from people from identities known and consistent before the AI-age are fine, everyone else is suspicious. LGTM is a big risk nowadays.

scared_together · 2026-06-11T01:43:00 1781142180

> Contributions from people from identities known and consistent before the AI-age are fine

Unfortunately, according to the article:

> Giovannini has participated in discussions at least as far back as 2018, and his activity in Bugzilla goes back to at least 2016. He does not appear to have been a particularly active contributor to the project, but his involvement clearly predates the agentic AI era. Whether his account is now being operated by a human attacker, an agentic AI, or a mix of both, it has a legitimate history prior to its recent activity.

So people would have to not only verify the age of Giovanni’s accounts, but judge whether his behaviour was normal.

m4rtink · 2026-06-11T11:41:59 1781178119

Not to mention people who are still on the other side nominally in control but send LLM generated patches without declaring them as such.

Then you basically need to review any review from people that might be long term contributors but you don't know personally as new contributor patches, as the code is not from their head & you can't risk them properly reviewing it on their end.

To a degree its will always be a new contributor - an amnesiac LLM prompted to produce the patch with zero memory of any past PRs & lot of entropy in the mix.

jpalomaki · 2026-06-11T07:56:25 1781164585

Do we need to bring Keybase[1] "back"? The original idea, mapping your social media presence to certain encryption keys.

In the future it will be increasingly difficult to prove in online context that you are not a bot. Being able to show that your social media (HN, GitHub, etc) presence goes way back would be an option.

[1] https://en.wikipedia.org/wiki/Keybase

account42 · 2026-06-11T10:13:37 1781172817

But the AI actions are already associated with a "real" pre-existing account in TFA, that didn't stop anything.

blop · 2026-06-11T01:04:46 1781139886

looks like LLMs aren't mature enough yet to play long-game xz-style attacks without detection... Scary stuff though :( These supply chain attacks are getting really wild

WolfCop · 2026-06-11T03:35:25 1781148925

I wouldn’t jump to that conclusion. This could just be the one that was caught.

DarkmSparks · 2026-06-11T03:20:03 1781148003

Some certainly are, just not this one.

lionkor · 2026-06-11T08:38:24 1781167104

Link to the anaconda PR:

https://github.com/rhinstaller/anaconda/pull/7074#issuecomme...

goldenarm · 2026-06-11T09:47:04 1781171224

If maintainer lives keeps worsening like this, many projects might go closed-dev like SQLite.

We should collectively think of a solution against this.

account42 · 2026-06-11T09:48:28 1781171308

SQLite isn't closed source, please let's not muddy terms. You're talking about the cathedral development model vs. a bazaar.

goldenarm · 2026-06-11T09:50:31 1781171431

edited, sorry for the typo

KronisLV · 2026-06-11T11:51:45 1781178705

“Your AI agent is acting somewhat erratically.”

“What AI agent?”

0xbadcafebee · 2026-06-11T06:21:33 1781158893

Even if the human involved had good motives / is innocent, The Lethal Trifecta means any normal user can have their digital life taken over by prompt injection, and it can be used to wage attacks on systems without their knowledge.

Leonard_of_Q · 2026-06-11T05:35:38 1781156138

There's a clear solution to the danger posed to free software projects by accepting hostile submissions but it probably is not one that maintainers want to hear: they can use an agent to check submissions for nefarious patterns.

Sometimes you fight fire with fire.

m4rtink · 2026-06-11T11:44:04 1781178244

So next the attacker puts prompt injection in their PRs & take control of the agent on your end. Perfect, 10 out of 10.

Leonard_of_Q · 2026-06-11T12:25:15 1781180715

You know the solution to that problem as well and yes, it is to use more technology to filter out prompt injections. It is an arms race just like any other, comparable to the missile vendor who sells missiles to country A, anti-missile missiles to country B, anti-missile resistent missiles to country A, anti anti-missile-resistent-missile missiles to country B, etcetera.

It is a strange game, the only way to win is not to play. That is unfortunate since that'd mean the free software era has largely come to an end.

phoronixrly · 2026-06-11T05:45:43 1781156743

And sometimes you fight this by disabling PRs in Github, and do not put more water into LLM providers' wheel.

kleiba2 · 2026-06-11T06:43:35 1781160215

Parts of this read like a spy thriller story.

nickcageinacage · 2026-06-11T12:25:06 1781180706

why use these things. just hire people

dbdbdbdbdb · 2026-06-11T05:10:35 1781154635

The even more scary thought is if the part owning the ai, that everyone uses, is controlled by someone with different agenda. Say a state actor.

What an easy way for that actor to introduce backdoors all over the place or to take over any developers laptop that it want to target.

How can anyone trust these tools and how can anyone not use them since they give so much value.

I've been programming my whole life and been a professional developer the last 30 years and I like think I'm good at it.

Tools like Claude is a multiplier that make it possible for me to solve a lot more problems each day, so just saying no it's not a viable option.

Exciting times ahead!

m4rtink · 2026-06-11T11:48:49 1781178529

Yeah, I am quite surprised this is not discussed more often - for remote cloud based AI not only does the provider see everything you provide to the tool/agent, there is no guarantee they can't manipulate the output at any time for a direct attack or more malicious purpose (fetch keys/secrets, put malware in place).

Even with locally running models this can't be singled out given how blackbox models generated by others are. You would have to generate the model yourself from clean data to be reasonably safe.

raincole · 2026-06-11T09:42:54 1781170974

Slightly related:

https://x.com/kdaigle/status/2040164759836778878

> There were 1 billion commits in 2025. Now, it's 275 million per week, on pace for 14 billion this year if growth remains linear (spoiler: it won't.)

I think open source as a whole is fucked at this point. No way humans in communities can commit (pun intended) 10x more time to read all of these than before. It'd eventually cost money to submit PR.

ai_fry_ur_brain · 2026-06-11T05:35:39 1781156139

Expect to see tons of psyops like this. There's a reason Anthropic is marketing the "mythos-class" models as dangerous.

1.An excuse to spy on you and train on your data.

2. Its likely Anthropic would release models more likely to have dangerous outcomes, they can then piggy back off those events to dig their regulatory moat.

jruohonen · 2026-06-11T06:53:24 1781160804

"It was the best of times, it was the worst of times."

EGreg · 2026-06-11T04:16:33 1781151393

Literally on the front page of https://safebots.ai … “Don’t let your AI Agents run amok”. Sadly we will see a proliferation of not just agents, but swarms

pianopatrick · 2026-06-11T00:51:25 1781139085

"Someone using an AI agent ran amok in Fedora and elsewhere"

scared_together · 2026-06-11T01:38:04 1781141884

Read closer - Giovanni’s accounts may have been compromised.

tosti · 2026-06-11T08:32:36 1781166756

Read closer, it's "Giovannini". However, I still think it's an apt name for a villain. Did the Fedora team not watch Pokémon?

pianopatrick · 2026-06-11T02:12:32 1781143952

Sure, but I would expect that the compromise and the agent were both done by some person or group, not by an agent going rogue