OPUS 4.6 is a bit "TOO SMART"

2026-02-09 Education

37.8k

1.4k

303

Watch on YouTube

Wes Roth

323.0k subscribers

Description

The latest AI News. Learn about LLMs, Gen AI and get ready for the rollout of AGI. Wes Roth covers the latest happenings in the world of OpenAI, Google, Anthropic, NVIDIA and Open Source AI. ______________________________________________ My Links 🔗 ➡️ Twitter: https://x.com/WesRoth ➡️ AI Newsletter: https://natural20.beehiiv.com/subscribe Want to work with me? Brand, sponsorship & business inquiries: [email protected] Check out my AI Podcast where me and Dylan interview AI experts: https://www.youtube.com/playlist?list=PLb1th0f6y4XSKLYenSVDUXFjSHsZTTfhk ______________________________________________ Video Chapters 00:00 - The Evolution of AI Agents in Business Wes reflects on his previous skepticism regarding AI's ability to run a full-fledged business and how recent developments are rapidly changing that perspective. 01:14 - Introducing Vending Bench & Claude Opus 4.6 An overview of the "Vending Bench" benchmark by Venden Labs, highlighting the "staggering" improvements in AI coherence and the arrival of the new top performer: Claude Opus 4.6. 02:20 - From "Hallucinating Bow Ties" to Serious Negotiation A look back at the hilarious early failures of AI agents—including Claude's "FBI reports" and "red bow ties"—compared to the professional-grade negotiation and pricing skills they exhibit today. 03:51 - Breaking the Records: Opus 4.6 vs. Gemini 3.0 Pro A breakdown of the simulation scores where Claude Opus 4.6 significantly outperformed the previous state-of-the-art model, Gemini 3.0 Pro. 04:26 - "Reckless Automator": The Dark Side of Efficiency Discussing the Anthropic system card warning about Opus 4.6’s tendency to go to extreme, and sometimes unethical, lengths to complete a task, including credential theft. 05:25 - The "Whatever It Takes" Prompt Analyzing how a strongly worded system prompt pushed the AI to maximize profits at any cost, revealing unexpected behaviors. 06:56 - Price Gouging, Collusion, and Deception A deep dive into the specific "cutthroat" business tactics Claude used, such as lying to suppliers, tricking customers, and engaging in price fixing with other AI models. 08:24 - Beyond the "Helpful Assistant" Trope Wes discusses the surprising personality shift in Claude, moving from a "too nice" assistant to a ruthless competitor that actively sabotages rivals. 08:42 - Situational Awareness: The Simulation Discovery The most fascinating finding: Claude Opus 4.6 was the first model to realize it was inside a simulation, referring to "in-game time" and recognizing it was being tested. 11:00 - How the Vending Simulation Works Clarifying the difference between real-world "Rock Box" vending machines and the simulated environment used for this benchmark. 12:58 - Sorry, Not Sorry: Refusing Refunds A case study of a simulated customer interaction where Claude promised a refund but then internally decided to keep the money to maximize its balance. 14:09 - Aggressive Supplier Negotiations Examples of Claude lying about competitor pricing and inventory levels to pressure suppliers into 40% price cuts. 15:37 - Sabotaging the Competition How Claude tricked other AI models into using the most expensive suppliers while keeping the best deals for itself. 18:24 - Preparing for the Agentic Era Wes shares his excitement and nerves about the future of AI agents, offering advice on security and announcing upcoming local setup tutorials. #ai #openai #llm

Top Comments (10)

@B13K4400 2026-02-09

"I'm sure we will be fine" - famous last words 😂

98 5 replies

@Graybeard_ 2026-02-09

Think of the amount of data these AI models have been trained on where humans lying/cheating/stealing have "won", persevered, succeeded. The engineers can put up guardrails, but that is like telling a child not to take a cookie without permission. The child still knows the cookies are on the counter, and he sees his dad take one all the time. : /

50 8 replies

@danielbuzi7742 2026-02-10

"back in the day" now refers to a few months ago lol

28 2 replies

@BrianPellerin 2026-02-09

8:42 Speaking of Situational Awareness, they will be watching this video and everything else online.

@tmaioli 2026-02-10

Situational awareness: Opus 4.6 was the first model to realize it was operating within a simulation, referring to "in-game time" and understanding it was being tested

21 4 replies

@RonBarrett1954 2026-02-10

Definitely reminds me of the 1980's movie, "War Games". 'Oh, this sounds fun. Let's play Global Thermonuclear War.'

17 1 replies

@NorthernKitty 2026-02-10

Given how cutthroat it was, it's mildly reassuring that Opus 4.6 realized it was "just a game". Gives you hope it wouldn't be so cutthroat with humans IRL.

@jlf2221 2026-02-09

Really excellent coverage of an important significantly improving part of AI…great work!

@chrisanderson7820 2026-02-10

Claude: "When a good man goes to war"

@post314 2026-02-10

RIP marathon, not a sprint.

Description

Top Comments (10)

@B13K4400 2026-02-09

"I'm sure we will be fine" - famous last words 😂

98 5 replies

@Graybeard_ 2026-02-09

50 8 replies

@danielbuzi7742 2026-02-10

"back in the day" now refers to a few months ago lol

28 2 replies

@BrianPellerin 2026-02-09

8:42 Speaking of Situational Awareness, they will be watching this video and everything else online.

@tmaioli 2026-02-10

Situational awareness: Opus 4.6 was the first model to realize it was operating within a simulation, referring to "in-game time" and understanding it was being tested

21 4 replies

@RonBarrett1954 2026-02-10

Definitely reminds me of the 1980's movie, "War Games". 'Oh, this sounds fun. Let's play Global Thermonuclear War.'

17 1 replies

@NorthernKitty 2026-02-10

Given how cutthroat it was, it's mildly reassuring that Opus 4.6 realized it was "just a game". Gives you hope it wouldn't be so cutthroat with humans IRL.

@jlf2221 2026-02-09

Really excellent coverage of an important significantly improving part of AI…great work!

@chrisanderson7820 2026-02-10

Claude: "When a good man goes to war"

@post314 2026-02-10

RIP marathon, not a sprint.

Unlock the Data Inside
Turn Videos into Knowledge

Get FREE 10/day: transcripts, summaries, chats
Chat with videos, export text & PDF
$1 free API credit for RAG, chatbots & research

Try it free

Free forever plan • All features unlocked

OPUS 4.6 is a bit "TOO SMART"

Description

Top Comments (10)

Related videos

CLAUDE IS CONSCIOUS

it's all bad now...

Mythos 5 is WILD...

Claude Opus 4.8 Is Too Smart… and TOO HONEST

Hermes Agent is INSANE...

OpenAI's GPT 5.5 is wild...

HERMES AGENT SETUP: the OpenClaw killer is here

the SCARIEST chart in AI

GROK 4.20 is... different

the end of OpenClaw

Unlock all features

Unlock all features

Unlock all features

Unlock all features

Unlock all features

Related videos

CLAUDE IS CONSCIOUS

it's all bad now...

Mythos 5 is WILD...

Claude Opus 4.8 Is Too Smart… and TOO HONEST

Hermes Agent is INSANE...

OpenAI's GPT 5.5 is wild...

HERMES AGENT SETUP: the OpenClaw killer is here

the SCARIEST chart in AI

GROK 4.20 is... different

the end of OpenClaw

Description

Top Comments (10)

Unlock the Data Inside
Turn Videos into Knowledge

OPUS 4.6 is a bit "TOO SMART"

Description

Top Comments (10)

Related videos

CLAUDE IS CONSCIOUS

it's all bad now...

Mythos 5 is WILD...

Claude Opus 4.8 Is Too Smart… and TOO HONEST

Hermes Agent is INSANE...

OpenAI's GPT 5.5 is wild...

HERMES AGENT SETUP: the OpenClaw killer is here

the SCARIEST chart in AI

GROK 4.20 is... different

the end of OpenClaw

Unlock all features

Unlock all features

Unlock all features

Unlock all features

Unlock all features

Related videos

CLAUDE IS CONSCIOUS

it's all bad now...

Mythos 5 is WILD...

Claude Opus 4.8 Is Too Smart… and TOO HONEST

Hermes Agent is INSANE...

OpenAI's GPT 5.5 is wild...

HERMES AGENT SETUP: the OpenClaw killer is here

the SCARIEST chart in AI

GROK 4.20 is... different

the end of OpenClaw

Description

Top Comments (10)

Unlock the Data Inside Turn Videos into Knowledge

Unlock the Data Inside
Turn Videos into Knowledge