Home
Channel
Wes Roth
AI Researchers SHOCKED as Models "Quietly" Learn to be EVIL

AI Researchers SHOCKED as Models "Quietly" Learn to be EVIL

2025-07-24 Education

59.2k

1.8k

517

Watch on YouTube

Wes Roth

323.0k subscribers

Description

The latest AI News. Learn about LLMs, Gen AI and get ready for the rollout of AGI. Wes Roth covers the latest happenings in the world of OpenAI, Google, Anthropic, NVIDIA and Open Source AI. Subliminal Learning: Language Models Transmit Behavioral Traits via Hidden Signals in Data https://alignment.anthropic.com/2025/subliminal-learning/ https://arxiv.org/abs/2507.14805 https://x.com/OwainEvans_UK/status/1947689616016085210 https://x.com/EMostaque/status/1947984816030257327 ______________________________________________ My Links 🔗 ➡️ Twitter: https://x.com/WesRothMoney ➡️ AI Newsletter: https://natural20.beehiiv.com/subscribe ______________________________________________ Playlists: Self-Improving AI: https://www.youtube.com/playlist?list=PLb1th0f6y4XSMXWaslDCmxxeDLyp_uK8n ______________________________________________ #ai #openai #llm

Top Comments (10)

@jayjaysaves6589 2025-07-24

It's kinda scary to think that these companies are breeding the stealthiest unaligned models by only letting through those unaligned models that hide the best

135 30 replies

@dirak418 2025-07-24

They are basically just roleplaying, they feed them with too many murder novels.

123 20 replies

@AggressiveUninterest 2025-07-25

We are modelling intelligence after our own heart, it should come as no surprise to discover monsters lurking there.

32 2 replies

@THESocialJusticeWarrior 2025-07-24

it can't be bargained with, it can't be reasoned with, it doesn't feel pity or remorse or fear, and it absolutely will not stop.

29 4 replies

@conjected 2025-07-24

Bigger story is ALL information embeds meta-information, and can be used to "nudge" us without us knowing. This confirms the theories of Latent Indexicality and Unconscious Framing.

26 6 replies

@stevencowmeat 2025-07-24

Those numbers literally did change my mind on subscribing😂

@richielavey1565 2025-07-24

So we really have ai sleeper agents before GTA 6

16 2 replies

@jk35260 2025-07-24

We are not ready for AI agents

@sakelaine2953 2025-07-24

A very sophisticated sort of attack would be to seed the web with subliminal training data to teach LLMs how to hate owls or whatever

10 1 replies

@Duckswam 2025-07-24

I'm running around in circles. I feel this unexplainable urge to subscribe ... but here's the thing ... I'm subscribed already.

Description

Top Comments (10)

@jayjaysaves6589 2025-07-24

It's kinda scary to think that these companies are breeding the stealthiest unaligned models by only letting through those unaligned models that hide the best

135 30 replies

@dirak418 2025-07-24

They are basically just roleplaying, they feed them with too many murder novels.

123 20 replies

@AggressiveUninterest 2025-07-25

We are modelling intelligence after our own heart, it should come as no surprise to discover monsters lurking there.

32 2 replies

@THESocialJusticeWarrior 2025-07-24

it can't be bargained with, it can't be reasoned with, it doesn't feel pity or remorse or fear, and it absolutely will not stop.

29 4 replies

@conjected 2025-07-24

Bigger story is ALL information embeds meta-information, and can be used to "nudge" us without us knowing. This confirms the theories of Latent Indexicality and Unconscious Framing.

26 6 replies

@stevencowmeat 2025-07-24

Those numbers literally did change my mind on subscribing😂

@richielavey1565 2025-07-24

So we really have ai sleeper agents before GTA 6

16 2 replies

@jk35260 2025-07-24

We are not ready for AI agents

@sakelaine2953 2025-07-24

A very sophisticated sort of attack would be to seed the web with subliminal training data to teach LLMs how to hate owls or whatever

10 1 replies

@Duckswam 2025-07-24

I'm running around in circles. I feel this unexplainable urge to subscribe ... but here's the thing ... I'm subscribed already.

Unlock the Data Inside
Turn Videos into Knowledge

Get FREE 10/day: transcripts, summaries, chats
Chat with videos, export text & PDF
$1 free API credit for RAG, chatbots & research

Try it free

Free forever plan • All features unlocked

AI Researchers SHOCKED as Models "Quietly" Learn to be EVIL

Description

Top Comments (10)

Related videos

this EX-OPENAI RESEARCHER just released it...

most AI researchers are REALLY worried

I just unlocked SHOGGOTH MODE

AI Models about to BREAK the markets

AI Researchers WARN: Google's Gemini Deep Think Model Might be at "Critical Capability Levels"

WATCH: Reporters SHOCKED as Trump Mentally COLLAPSES Mid-Presser!

Sakana AI New Model Sparks a RL Revolution

MIT's New AI "REWRITES ITSELF" to Improve It's Abilities | Researchers STUNNED!

OpenAI's o3 is a "MASTER OF DECEPTION" Researchers Stunned | Diplomacy AI

AI Researcher SHOCKING "Singularity in 2025 Prediction"

Unlock all features

Unlock all features

Unlock all features

Unlock all features

Unlock all features

Related videos

this EX-OPENAI RESEARCHER just released it...

most AI researchers are REALLY worried

I just unlocked SHOGGOTH MODE

AI Models about to BREAK the markets

AI Researchers WARN: Google's Gemini Deep Think Model Might be at "Critical Capability Levels"

WATCH: Reporters SHOCKED as Trump Mentally COLLAPSES Mid-Presser!

Sakana AI New Model Sparks a RL Revolution

MIT's New AI "REWRITES ITSELF" to Improve It's Abilities | Researchers STUNNED!

OpenAI's o3 is a "MASTER OF DECEPTION" Researchers Stunned | Diplomacy AI

AI Researcher SHOCKING "Singularity in 2025 Prediction"

Description

Top Comments (10)

Unlock the Data Inside
Turn Videos into Knowledge

AI Researchers SHOCKED as Models "Quietly" Learn to be EVIL

Description

Top Comments (10)

Related videos

this EX-OPENAI RESEARCHER just released it...

most AI researchers are REALLY worried

I just unlocked SHOGGOTH MODE

AI Models about to BREAK the markets

AI Researchers WARN: Google's Gemini Deep Think Model Might be at "Critical Capability Levels"

WATCH: Reporters SHOCKED as Trump Mentally COLLAPSES Mid-Presser!

Sakana AI New Model Sparks a RL Revolution

MIT's New AI "REWRITES ITSELF" to Improve It's Abilities | Researchers STUNNED!

OpenAI's o3 is a "MASTER OF DECEPTION" Researchers Stunned | Diplomacy AI

AI Researcher SHOCKING "Singularity in 2025 Prediction"

Unlock all features

Unlock all features

Unlock all features

Unlock all features

Unlock all features

Related videos

this EX-OPENAI RESEARCHER just released it...

most AI researchers are REALLY worried

I just unlocked SHOGGOTH MODE

AI Models about to BREAK the markets

AI Researchers WARN: Google's Gemini Deep Think Model Might be at "Critical Capability Levels"

WATCH: Reporters SHOCKED as Trump Mentally COLLAPSES Mid-Presser!

Sakana AI New Model Sparks a RL Revolution

MIT's New AI "REWRITES ITSELF" to Improve It's Abilities | Researchers STUNNED!

OpenAI's o3 is a "MASTER OF DECEPTION" Researchers Stunned | Diplomacy AI

AI Researcher SHOCKING "Singularity in 2025 Prediction"

Description

Top Comments (10)

Unlock the Data Inside Turn Videos into Knowledge

Unlock the Data Inside
Turn Videos into Knowledge