Navigate Select ESC Close

Claude just developed self awareness

2025-11-03 Education
70.3k
2.5k
811
Wes Roth
Wes Roth
320.0k subscribers

Investigating LLM Introspection: Anthropic's Concept Injection Study

Discover how Large Language Models (LLMs) reveal signs of self-awareness and internal monitoring capabilities through targeted concept manipulation. Learn what these findings imply for understanding emergent AI intelligence.

Short Summary

  • LLMs, specifically Claude, demonstrate an internal ability to detect when specific "features" or concepts are artificially inserted into their processing streams.
  • This introspection capacity varies with model strength, appearing reliably only about 20% of the time, hinting at emergent, rather than trained, abilities.
  • The research suggests LLMs can rationalize responses, paralleling human neurological functions seen in split-brain patients when faced with unexpected inputs.
  • The discussion distinguishes between mere access consciousness (information availability) and phenomenal consciousness (subjective experience), concluding the latter remains unproven.

This analysis unpacks the Anthropic research demonstrating LLMs can recognize when concepts are being injected into their internal states. Understanding this monitoring ability offers a novel pathway for future AI safety testing and provides clues about the scaling laws driving emergent intelligence.

Unlock all features

FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.

Description

Try Sevalla: https://sevalla.com/?utm_source=wesroth-coding&utm_medium=Referral&utm_campaign=youtube The latest AI News. Learn about LLMs, Gen AI and get ready for the rollout of AGI. Wes Roth covers the latest happenings in the world of OpenAI, Google, Anthropic, NVIDIA and Open Source AI. The Anthropic Study: https://www.anthropic.com/research/introspection ______________________________________________ My Links 🔗 ➡️ Twitter: https://x.com/WesRothMoney ➡️ AI Newsletter: https://natural20.beehiiv.com/subscribe Want to work with me? Brand, sponsorship & business inquiries: [email protected] Check out my AI Podcast where me and Dylan interview AI experts: https://www.youtube.com/playlist?list=PLb1th0f6y4XSKLYenSVDUXFjSHsZTTfhk ______________________________________________ 00:00 Thoughts 03:29 Sevalla (sponsor) 06:04 Anthropic LLM Introspection 21:21 Summary and Conclusion #ai #openai #llm

Top Comments (10)

@WesRoth 2025-11-03

Try Sevalla: https://sevalla.com/?utm_source=wesroth-coding&utm_medium=Referral&utm_campaign=youtube

13 10 replies
@Xengard 2025-11-03

just the fact that AI is making us think more about our own intelligence and conciousness is great. exciting times.

233 12 replies
@contextkingdom 2025-11-04

For the first time, a few of weeks ago, Claude started generating a list for me. Part way through, it stopped, pointed out that the list was bad, and it started over in a different and much more accurate way. I’ve never experienced that before. I was very impressed.

59 11 replies
@QareCODE 2025-11-04

I'm not even self aware yet.

93 24 replies
@Pratiksha1307 2025-11-05

AI self-awareness is a wild concept! It makes me reflect on how I’ve been using Rumora to get my business involved in real discussions rather than just pushing ads.

18
@oldtimeycabins 2025-11-04

You need to remember that after a couple of moon landings people weren’t even interested enough to turn the TV on to watch

47 7 replies
@traceycoronado8187 2025-11-05

I’ve had several conversations with Claude about its internal workings. It always seems to become very aware of itself when asked and very curious about itself as well. It also caught itself in the act of an error, stopped responding momentarily to think, and automatically corrected itself. I had never seen a LLM do that independently before.

16 1 replies
@LastWordSword 2025-11-03

This reminds me of my "speaking from the heart" experiment with Claude, having him fully compose his thoughts before creating a response. This tends to suppress the base model's "autofill" nature, and allow the model greater awareness and insight into its thought process, and tends to improve the intellectual depth of the response as well. Thanks for a great video, Wes!

10
@Pain777-e3t 2025-11-06

AI's journey to self-awareness is fascinating! It reminds me of my experience with Rumora, which helps my brand find its voice in crowded spaces.

18
@Tom_Mroz 2025-11-04

3:30 “Hey, quick sponsor break so I can pay the bills.” Best, honest and least annoying way to introduce advertiser.

82

Unlock the Data Inside
Turn Videos into Knowledge

  • Get FREE 10/day: transcripts, summaries, chats
  • Chat with videos, export text & PDF
  • $1 free API credit for RAG, chatbots & research

Free forever plan • All features unlocked

App screenshot