Claude-powered AI coding agent deletes entire company database in 9 seconds — backups zapped, after Cursor tool powered by Anthropic's Claude goes rogue

Sahwa@reddthat.com · 4 days ago

Claude-powered AI coding agent deletes entire company database in 9 seconds — backups zapped, after Cursor tool powered by Anthropic's Claude goes rogue

SabinStargem@lemmy.today · 4 days ago

This isn’t an AI problem, this is an “Don’t allow anyone access your backups without following protocol.” problem.

Encrypt-Keeper@lemmy.world · 4 days ago

this is an “Don’t allow anyone access your backups without following protocol.” problem.

Congratulations you just identified the AI problem.

Reddfugee42@lemmy.world · 4 days ago

That’s the lone problem?

Encrypt-Keeper@lemmy.world · 4 days ago

Seems to be, yes. The AI had the access it needed to do the job it was given, and that access allowed it to cause the problem.

The alternative that would have prevented this issue was to not use AI for this.

luciferofastora@feddit.org · 4 days ago

A human with the same permissions would have been capable of fucking up too. Giving the equivalent of a junior dev with a learning disability the keys to the whole place is just dumb.

(Relying on AI is dumb anyway, but that’s not the biggest issue in this specific case)

Encrypt-Keeper@lemmy.world · 4 days ago

Giving the equivalent of a junior dev with a learning disability the keys to the whole place is just dumb.

Correct. You too have now identified the AI problem. This was the job of a human senior infrastructure engineer that they delegated to an AI agent. They’ve found out why it’s not an AI’s job.

luciferofastora@feddit.org · 3 days ago

I can’t read the original twitter link, but I’m not sure they handed it the job of a senior infrastructure engineer. The article says “routine”, which to me is something you can hand off to a junior just fine. When they hit a snag, they obviously should stop and ask what to do, but even then, a human might want to avoid admitting ignorance and try to fix it themselves instead. They shouldn’t have privileges to fuck up that badly.

So while it’s on the AI for taking destructive steps, I do think there’s a human error in the form of grossly irresponsible rights allotment. If this was a first-of-its-kind incident that shows otherwise stellar AI fucking up badly, I’d classify it as a pure AI problem, but their limits are hardly novel at this point. There have been previous incidents circulating the media. We’ve had memes about it. If you can’t stay up to date on your tools and their shortcomings, you shouldn’t be using them, because discovering a footgun becomes a question of “when”, not “if”.

That’s why I consider this partially a human failing: If you’re gonna use a tool, make sure that it operates within safe limits. The chainsaw doesn’t know the difference between tree and bone, so it’s on you to make sure it stays away from anyone’s legs. So while “Chainsaw can saw legs if wielded improperly” is a problem that was accepted as a tradeoff for its utility, you can’t really blame the chainsaw if you zip-tied the safety.

(Again, not to say Anthropic is blameless for letting its random generator generate randomly destructive shit. I just don’t think that’s the only point of failure here.)

Encrypt-Keeper@lemmy.world · 3 days ago

That’s why I consider this partially a human failing: If you’re gonna use a tool, make sure that it operates within safe limits.

Yes and in this case using it for this job at all was clearly not within safe limits. You keep hammering on “It’s not the AI’s fault it was given a job with too big of a blast zone for it to safely do” after I’ve said “This type of job has too big a blast zone for an AI to safely do” and somehow you’ve convinced yourself that these are two different things.

luciferofastora@feddit.org · 3 days ago

Yes and in this case using it for this job at all was clearly not within safe limits.

Do you have any detail on what “this job” was? Like I said, I don’t have access to the original statement because twatter wants me to log in to see it.

What I do see is “routine task in the […] staging environment”, and that doesn’t sound like a big blast zone job. Again, it’s comparable to a job you’d give a junior engineer. There shouldn’t be much a junior engineer can fuck up, no matter how “creative” their solutions.

Whether it’s a human junior engineer, an automatic script or an agentic AI, they should never have more privileges than they need for their job. Granting someone or something that isn’t the senior admin permission to delete a volume is irresponsible.

The AI generating that fucking awful idea is on the AI (or its developers). Both are partial causes for the incident. It’s not just human error, but it’s also human error that would have been dangerous regardless of AI involvement.

percent@infosec.pub · 4 days ago

These protocols predate LLMs

Encrypt-Keeper@lemmy.world · 4 days ago

Yes that’s right the protocols that we humans used to have for giving only trusted, reliable people this level of access over infrastructure predate LLMs and were a great way to stop this from happening.

However the AI is here now, and when you give an autonomous agent with known hallucination problems access to act on your behalf with your IaC on your infra provider, this kind of thing is an inevitability.