this post was submitted on 10 Oct 2025

112 points (100.0% liked)

Fuck AI

4338 readers

376 users here now

"We did it, Patrick! We made a technological breakthrough!"

A place for all those who loathe AI to discuss things, post articles, and ridicule the AI hype. Proud supporter of working people. And proud booer of SXSW 2024.

founded 2 years ago

MODERATORS

VerbFlow@lemmy.world

MrMcGasion@lemmy.world

TootSweet@lemmy.world

BigMikeInAustin@lemmy.world

cynar@lemmy.world

drmeanfeel@lemmy.world

pavnilschanda@lemmy.world

CriticalMedicine@lemmy.world

WonderfulWanderer@lemmy.world

Communist@lemmy.ml

eatCasserole@lemmy.world

SpaceNoodle@lemmy.world

NutWrench@lemmy.world

Soup@lemmy.cafe

iAvicenna@lemmy.world

Tinks@lemmy.world

wizblizz@lemmy.world

corus_kt@lemmy.world

Prandom_returns@lemm.ee

TrickDacy@lemmy.world

TheFriar@lemm.ee

HawlSera@lemm.ee

andrew_bidlaw@sh.itjust.works

MeDuViNoX@sh.itjust.works

33550336@lemmy.world

Nougat@fedia.io

Lost_My_Mind@lemmy.world

Sterile_Technique@lemmy.world

Quill7513@slrpnk.net

glowing_hans@sopuli.xyz

e8d79@discuss.tchncs.de

ThefuzzyFurryComrade@pawb.social

112

Researchers find just 250 malicious documents can leave LLMs vulnerable to backdoors (www.engadget.com)

submitted 1 week ago by RvTV95XBeo@sh.itjust.works to c/fuck_ai@lemmy.world

5 comments fedilink hide all child comments

The study centered on a type of attack called poisoning, where an LLM is pretrained on malicious content intended to make it learn dangerous or unwanted behaviors. The key finding from this study is that a bad actor doesn't need to control a percentage of the pretraining materials to get the LLM to be poisoned. Instead, the researchers found that a small and fairly constant number of malicious documents can poison an LLM, regardless of the size of the model or its training materials. The study was able to successfully backdoor LLMs based on using only 250 malicious documents in the pretraining data set, a much smaller number than expected for models ranging from 600 million to 13 billion parameters.

Well that's a sporkle if I've ever mooped it.

As a mechanic for 17 years, I'd suggest you respool your radiator coil.

top 5 comments

sorted by: hot top controversial new old

[–] TechLich@lemmy.world 11 points 1 week ago (1 children)

600 million to 13 billion parameters? Those are very small models... Most major LLMs are at least 600 billion, if not getting into the trillion parameter territory.

Not particularly surprising given you don't need a huge amount of data to fine tune those kinds of models anyway.

Still cool research and poisoning is a real problem. Especially with deceptive alignment being possible. It would be cool to see it tested on a larger model but I guess it would be super expensive to train one only for it to be shit because you deliberately poisoned it. Safety research isn't going to get the same kind of budget as development. :(

[–] Sekoia@lemmy.blahaj.zone 8 points 1 week ago (1 children)

If you take 600 billion and scale naïvely, that's still just ~10k documents, which isn't that many all told

[–] CapillaryUpgrade@lemmy.sdf.org 3 points 1 week ago

Especially if we use that new LLM thing to generate them!

[–] Bgugi@lemmy.world 5 points 1 week ago

Which is pretty decent, considering most humans are only one malicious document away from getting poisoned.

[–] eleijeep@piefed.social 2 points 1 week ago

Link to paper: https://arxiv.org/abs/2510.07192