Navigate Select ESC Close

LLMs are in trouble

2025-10-14 Science & Technology
611.4k
23.6k
2.2k
ThePrimeTime
ThePrimeTime
1.1m subscribers

The Shocking Discovery: How Tiny Data Samples Can Poison Large Language Models

Discover why conventional wisdom about LLM poisoning is broken and understand the minimal effort required to compromise models up to 13 billion parameters. This analysis reveals that only a fixed, small number of documents—not a large percentage of total data—is needed to inject malicious behavior.

Short Summary

  • Research shows poisoning success hinges on the absolute number of injected documents, defying expectations that required large data percentages.
  • As few as 250 poisoned documents (roughly 0.0016% of training tokens) successfully backdoored a 13-billion parameter model.
  • This vulnerability supports the emergence of "LLM SEO," where actors can intentionally shape public data landscapes to sway model outputs.

The speaker reviews critical findings from a recent Anthropic research paper detailing data poisoning attacks. Understanding this research shifts the security focus from sheer data volume control to the highly achievable goal of targeted content injection campaigns.

Unlock all features

FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.

Description

https://twitch.tv/ThePrimeagen - I Stream 5 days a Week Become A Great Backend Dev: https://boot.dev/prime (I make courses for them) This is also the best way to support me is to support yourself becoming a better backend engineer. https://twitter.com/terminaldotshop - Order coffee over SSH! ssh terminal.shop Discord: https://discord.gg/ThePrimeagen ### LINKS https://www.anthropic.com/research/small-samples-poison Great News? Want me to research and create video????: https://www.reddit.com/r/ThePrimeagen Kinesis Advantage 360: https://bit.ly/Prime-Kinesis

Top Comments (10)

@KiraOTS 2025-10-14

The thief is calling me malicious for putting fake gold in my safe that he stole from.

6.4k 156 replies
@dracuxan 2025-10-14

I knew it. training model on that reddit dataset was a bad idea

3.6k 58 replies
@Rayleigh47-t4x 2025-10-14

Job security - make poisoned repositories!

2.8k 44 replies
@mavroudisv 2025-10-14

Hey! This is my paper! Didn't expect it covered here! Edit: I don't work for Anthropic, the paper includes three orgs.

1.7k 41 replies
@ToddMagnussonWasHere 2025-10-14

People: “Don’t believe everything you hear on the internet” LLM: “Bet.”

840 10 replies
@tranthien3932 2025-10-14

"Look at what they have to do to mimic a fraction of our power" - Junior Devs

814 6 replies
@LiterallyRyanGosling-p8b 2025-10-14

Your honour, I was hospitalized when I broke into my attacker's house and drank the contents of his toilet, which he had maliciously poisoned specifically to target me

747 21 replies
@lespectator4962 2025-10-15

Once again, this wouldn't be as big of a problem if it really was intelligent. It's just a search engine with charisma.

139 18 replies
@POVShotgun 2025-10-16

Silicon Valley: the 51% attack Reality: the 0.00016% attack

65 2 replies
@maplepotato3676 2025-10-18

"LLM SEO is the production of the dead internet" -- I'm an SEO specialist and I think you're 100% correct. My industry is so broken.

64

Unlock the Data Inside
Turn Videos into Knowledge

  • Get FREE 10/day: transcripts, summaries, chats
  • Chat with videos, export text & PDF
  • $1 free API credit for RAG, chatbots & research

Free forever plan • All features unlocked

App screenshot