Navigate Select ESC Close

Hacking AI is TOO EASY (this should be illegal)

2025-08-12 Science & Technology
886.9k
34.2k
1.5k
NetworkChuck
NetworkChuck
5.3m subscribers

AI Hacking Blueprint: Prompt Injection, Smuggling, and Defense Strategies

Discover the six-part blueprint elite hackers use to exploit AI applications, covering data exfiltration, tool abuse, and novel injection techniques like emoji smuggling. Learn the necessary multilayered defenses required to secure AI systems deployed in 2025.

Short Summary

  • Attackers focus on holistic compromises (AI pen testing) beyond simple chatbot jailbreaks, targeting data exfiltration and system pivoting.
  • Prompt Injection remains the primary weapon, evolving past basic tricks into sophisticated methods like emoji and link smuggling to bypass classifiers.
  • New standards like Model Context Protocol (MCP) introduce abstracted complexity that widens the potential blast radius if not secured via least privilege.
  • Defense requires a three-layer strategy: robust web fundamentals, an AI-specific firewall (classifiers/guardrails), and strict least-privilege scoping for all API interactions.

This discussion unpacks severe, real-world AI application vulnerabilities discussed by expert hacker Jason Haddix. Understand the taxonomy of prompt injection and the risks associated with modern agentic frameworks. This content provides both the attack roadmap and the defense strategy needed to build secure AI in the near future.

Unlock all features

FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.

Description

Want to deploy AI in your cloud apps SAFELY? Let Wiz help: https://ntck.co/wiz Can you hack AI? In this video I sit down with elite AI hacker Jason Haddix to unpack how attackers compromise AI-enabled apps—not just jailbreak chatbots, but exfiltrate customer data, abuse tool calls, and pivot across systems. We walk through his six-part AI pentest blueprint, play the Gandalf prompt-injection game, and demo wild techniques like emoji smuggling and link smuggling. You’ll see real-world cases (think Slack salesbots + Salesforce leaks), why MCP (Model Context Protocol) and agentic frameworks can widen the blast radius, and then we flip to defense: web-layer fundamentals, a “firewall for AI” on inputs/outputs, and least-privilege for data and tools—plus a hands-on demo you can try. If you’re building with AI in 2025, this is your wake-up call (and your roadmap). Educational content only—hack ethically and only with permission. 📌 Watch the Full Interview here: https://youtu.be/2Z-9EOyb6HE Links and STUFF —-------------------------------------------------------- Practice Prompt Injection: https://gandalf.lakera.ai/baseline Pliney's Github: https://github.com/elder-plinius Follow Jason Everywhere: X: https://x.com/Jhaddix Linkedin: https://www.linkedin.com/in/jhaddix/ Instagram: https://www.instagram.com/j.haddix56/ Tiktok: https://www.tiktok.com/@jhaddix56 Checkout Jason’s courses: Website: https://www.arcanum-sec.com/ Training Overview: Training: https://www.arcanum-sec.com/training-overview Attacking AI course: https://www.arcanum-sec.com/training/attacking-ai Hacking your career: https://www.arcanum-sec.com/training/hack-your-brand 🔥🔥Join the NetworkChuck Academy!: https://ntck.co/NCAcademy **Sponsored by Wiz.io 00:00 - Hack companies through AI? 00:58 - What does “hacking AI” really mean? 01:43 - AI pentest vs. red teaming (6-step blueprint) 02:42 - Prompt Injection 101 (why it’s so hard) 04:14 - Try it live: Gandalf prompt-injection game 05:09 - Jailbreak taxonomy: intents, techniques, evasions 05:55 - Emoji smuggling + anti-classifier demo 07:23 - Link smuggling (data exfiltration trick) 11:38 - Real-world leaks: Salesforce/Slack bot case 13:47 - MCP security risks & blast radius 16:55 - Can AI hack for us? Agents & bug bounties 20:52 - Defense in depth: web, AI firewall, least privilege 24:57 - Jason’s Magic Card: GPT-4o system prompt leak (wild story) SUPPORT NETWORKCHUCK --------------------------------------------------- ➡️NetworkChuck membership: https://ntck.co/Premium ☕☕ COFFEE and MERCH: https://ntck.co/coffee Check out my new channel: https://ntck.co/ncclips 🆘🆘NEED HELP?? Join the Discord Server: https://discord.gg/networkchuck STUDY WITH ME on Twitch: https://bit.ly/nc_twitch READY TO LEARN?? --------------------------------------------------- -Learn Python: https://bit.ly/3rzZjzz -Get your CCNA: https://bit.ly/nc-ccna FOLLOW ME EVERYWHERE --------------------------------------------------- Instagram: https://www.instagram.com/networkchuck/ Twitter: https://twitter.com/networkchuck Facebook: https://www.facebook.com/NetworkChuck/ Join the Discord server: http://bit.ly/nc-discord AFFILIATES & REFERRALS --------------------------------------------------- (GEAR I USE...STUFF I RECOMMEND) My network gear: https://geni.us/L6wyIUj Amazon Affiliate Store: https://www.amazon.com/shop/networkchuck Buy a Raspberry Pi: https://geni.us/aBeqAL Do you want to know how I draw on the screen?? Go to https://ntck.co/EpicPen and use code NetworkChuck to get 20% off!! fast and reliable unifi in the cloud: https://hostifi.com/?via=chuck Prompt Injection explained with live demos: Gandalf game, emoji smuggling, and link smuggling exfiltration. AI Pentesting vs AI Red Teaming: a six-phase methodology for securing LLM apps end-to-end. LLM jailbreak taxonomy: intents, techniques, evasions, and utilities—how attackers actually think. RAG poisoning, tool-call abuse, and over-scoped API keys: the hidden risks in modern AI products. MCP (Model Context Protocol) security: tools/resources/prompts, server hardening, and blast-radius control. Agentic frameworks (LangChain, LangGraph, CrewAI) security pitfalls—and how to test them safely. Real-world case study: Slack salesbot + Salesforce data exposure and what went wrong. Defense in depth for AI: input/output validation, a firewall for AI (guardrails/classifiers), least privilege. Bug bounty + AI: why mid-tier vulns are getting automated while human creativity still wins. 2025 AI security blueprint: map your attack surface, prevent system-prompt leaks, and lock down data access. #promptinjection #aihacking #airedteaming

Top Comments (10)

@Killakan-X 2025-08-12

Nice try, Peter Griffin - that hat’s not fooling anyone.

1.2k 37 replies
@snyperf1 2025-08-12

I hacked the YouTube algorithm to be this early.

231 15 replies
@MarcoOS05 2025-08-12

I really like the fact that if you do not setup half of those things you are not vulnerable to those hacks

219 8 replies
@rupertsmith6097 2025-08-12

I think the answer is to treat AI just like any other user - put it outside your trust boundary. Don't put it inside and don't give privileged access - just minimal access rights for what it needs to do and things that the user can do anyway, AI is just there to help automate.

208 15 replies
@NetworkChuck 2025-08-12

Want to deploy AI in your cloud apps SAFELY? Let Wiz help: https://ntck.co/wiz Can you hack AI? In this video I sit down with elite AI hacker Jason Haddix to unpack how attackers compromise AI-enabled apps—not just jailbreak chatbots, but exfiltrate customer data, abuse tool calls, and pivot across systems. We walk through his six-part AI pentest blueprint, play the Gandalf prompt-injection game, and demo wild techniques like emoji smuggling and link smuggling. You’ll see real-world cases (think Slack salesbots + Salesforce leaks), why MCP (Model Context Protocol) and agentic frameworks can widen the blast radius, and then we flip to defense: web-layer fundamentals, a “firewall for AI” on inputs/outputs, and least-privilege for data and tools—plus a hands-on demo you can try. If you’re building with AI in 2025, this is your wake-up call (and your roadmap). Educational content only—hack ethically and only with permission. Links and STUFF —-------------------------------------------------------- Practice Prompt Injection: https://gandalf.lakera.ai/baseline Pliney's Github: https://github.com/elder-plinius Follow Jason Everywhere: X: https://x.com/Jhaddix Linkedin: https://www.linkedin.com/in/jhaddix/ Instagram: https://www.instagram.com/j.haddix56/ Tiktok: https://www.tiktok.com/@jhaddix56 Checkout Jason’s courses: Website: https://www.arcanum-sec.com/ Training overview: https://www.arcanum-sec.com/training-overview Attacking AI course: https://www.arcanum-sec.com/training/attacking-ai Hacking your career: https://www.arcanum-sec.com/training/hack-your-brand 🔥🔥Join the NetworkChuck Academy!: https://ntck.co/NCAcademy **Sponsored by Wiz.io 00:00 - Hack companies through AI? 00:58 - What does “hacking AI” really mean? 01:43 - AI pentest vs. red teaming (6-step blueprint) 02:42 - Prompt Injection 101 (why it’s so hard) 04:14 - Try it live: Gandalf prompt-injection game 05:09 - Jailbreak taxonomy: intents, techniques, evasions 05:55 - Emoji smuggling + anti-classifier demo 07:23 - Link smuggling (data exfiltration trick) 11:38 - Real-world leaks: Salesforce/Slack bot case 13:47 - MCP security risks & blast radius 16:55 - Can AI hack for us? Agents & bug bounties 20:52 - Defense in depth: web, AI firewall, least privilege 24:57 - Jason’s Magic Card: GPT-4o system prompt leak (wild story) #promptinjection #aihacking #airedteaming

148 36 replies
@ВладимирСмирнов-е2й 2025-11-17

People focus on jailbreaks like it’s the scary part, but the real nightmare is trying to get access to all these tools without going broke. Every model you want to poke at needs its own monthly fee. Omnely helped a lot since I could test everything in one place instead of stacking subscriptions.

71 2 replies
@anonymous-tn6ij 2025-09-01

So I’m gonna use ChatGPT to convince Siri and Alexa to chat it up with a Roomba at Fort Knox to roll a bar of gold outside for me to come by and pick up😂

45 4 replies
@sharky4957 2025-08-25

I love that every ad I got while viewing this video was for some workplace AI tool, that is most likely vulnerable to the prompt injections covered in the video

26
@dfw_sleepypillz1007 2025-08-13

SO prompt injection is the 2025 equivalent to social engineering. Love that.

7
@JamesDavis-c1n 2025-08-29

This video nails it: AI hacking isn’t about “making ChatGPT say something bad,” it’s about broken ecosystems. Prompt injection, emoji smuggling, over-scoped APIs, sloppy input validation—that’s where the leaks happen. GPT-5 isn’t the problem; the real Wild West is the scaffolding around it. Until companies treat AI like a jet engine instead of a toy, hackers will keep riding straight through the gaps.

5

Unlock the Data Inside
Turn Videos into Knowledge

  • Get FREE 10/day: transcripts, summaries, chats
  • Chat with videos, export text & PDF
  • $1 free API credit for RAG, chatbots & research

Free forever plan • All features unlocked

App screenshot