Navigate Select ESC Close

Stop Getting Blocked: My "Escalating" Web Scraper Skill

2025-12-10 Science & Technology
175
17
0
Unsupervised Learning
Unsupervised Learning
673.0k subscribers

Unlock all features

FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.

Description

Access real-time web data with Bright Data’s secure and scalable proxy solutions. https://ul.live/brightdata In this video, I break down my custom four-tier web scraping workflow that automatically escalates from basic WebFetch to Bright Data's proxy network to successfully browse and extract data from any public website. Blog Post: https://danielmiessler.com/blog/progressive-web-scraping-four-tier-system Skill: https://github.com/danielmiessler/Personal_AI_Infrastructure/tree/main/.claude/skills/BrightData 🎬 RELATED AI VIDEOS: • Building Your Own Unified AI Assistant Using Claude Code - https://youtu.be/iKwRWwabkEc • How My Projects Fit Together - https://www.youtube.com/watch?v=5x4s2d3YWak • The Future of Hacking is Context - https://www.youtube.com/watch?v=UwTTcka1Wd8 • Introducing Substrate - https://www.youtube.com/watch?v=ky7ejowc_qY • The 4 AAAAs of the AI ECOSYSTEM - https://www.youtube.com/watch?v=Pf6ydbCgU3U • Is OpenCode as Smart as Claude Code? - https://www.youtube.com/watch?v=yTylDxgyJZ8 • Claude Code + Neovim via Ghostty Panes - https://www.youtube.com/watch?v=ysVmQ6mesWE 🤖 OTHER TECH DISCUSSED: • Claude Code: https://claude.ai/code • Fabric AI Framework: https://github.com/danielmiessler/fabric • PAI: https://github.com/danielmiessler/Personal_AI_Infrastructure ✍🏼 RELATED BLOGS: • Progressive Web Scraping with a Four-Tier Fallback System - https://danielmiessler.com/blog/progressive-web-scraping-four-tier-system • Building a Personal AI Infrastructure (PAI) - https://danielmiessler.com/blog/personal-ai-infrastructure • MCPs Are Just Other People's Prompts and APIs - https://danielmiessler.com/blog/mcps-are-just-other-peoples-prompts-and-apis • One-Click MCP Servers on Cloudflare - https://danielmiessler.com/blog/one-click-mcp-servers-cloudflare • Launching Daemon: Personal API - https://danielmiessler.com/blog/launching-daemon-personal-api • OpenCode vs Claude Code - https://danielmiessler.com/blog/opencode-vs-claude-code • Claude Code + Neovim via Ghostty Integration - https://danielmiessler.com/blog/claude-code-neovim-ghostty-integration • Web Scraping with BrightData + Claude Code - https://danielmiessler.com/blog/webscraping-with-brightdata-claude-code • Why Marcus Is Wrong About AI - https://danielmiessler.com/blog/why-marcus-is-wrong-about-ai • Why Dwarkesh Is Wrong About AGI - https://danielmiessler.com/blog/why-dwarkesh-is-wrong-about-agi • AI Is Becoming Like Reading - https://danielmiessler.com/blog/ai-becoming-reading • We Have Enough AI for AGI - https://danielmiessler.com/blog/we-have-enough-ai-for-agi • The Worst AI Metric - https://danielmiessler.com/blog/the-worst-ai-metric • AI Workforce Volume-Difficulty Curve - https://danielmiessler.com/blog/ai-workforce-volume-difficulty-curve • Survive AI: Become Creators - https://danielmiessler.com/blog/survive-ai-become-creators • The Great Bifurcation - https://danielmiessler.com/blog/great-bifurcation • I'm Worried It Might Get Bad - https://danielmiessler.com/blog/im-worried-it-might-get-bad • AI Model Ecosystem: 4 Components - https://danielmiessler.com/blog/ai-model-ecosystem-4-components • Using the Smartest AI to Rate Other AI - https://danielmiessler.com/blog/using-the-smartest-ai-to-rate-other-ai 🔗 CONNECT: X: @danielmiessler Website: https://danielmiessler.com GitHub: https://github.com/danielmiessler/ I'm building sick shit over here, and I've only just begun. Let's build together. Subscribe to the channel here and catch the newsletter below. See you in the next one! 📧 NEWSLETTER: https://danielmiessler.com/subscribe

Top Comments (9)

@LaurenceGold 2025-12-31

The Skills link is not working -

4
@nickthompson2052 2025-12-10

Where the heck are they getting 150m residential public IPs?

3 2 replies
@oopsec 2025-12-11

Great minds think alike. My research tool stack has fallback chains with brightdata for unlocking too. There are also three types of workflow - direct, exploratory and synthesis. Direct is for simple lookups, exploratory for taking an aggregator's output (like perplexica), rating the citations as N url's that would fill the specific knowledge gap(s) relative to the query, which my 'triple stack ' (jina, exa, ref) mcp's then investigate further as appropriate (by looking up the relevant cited reference url's rated worthy by the algo). The triple stack can cover all types of query, whether general knowledge, code examples, docs, arXiv papers, etc. Its actually crazy how juiced up this stack is. Synthesis flow is where the aggregator comes in at the end to synthesize what the triple stack found in parallel. Whichever type of flow's skill is triggered depends on the nature of the query.

2
@cpgarrison 2025-12-10

Thank you sir!

2
@puremajik 2026-01-02

Please fix skills link

2
@whenwegrewup-t8r 2026-01-11

This skill was taken down from github?

1
@VulnerableU 2025-12-11

This is literally a superpower.

0
@MrMarco7ify 2026-01-06

I tried making an app for my Amazon wishlist but the Ai wasn’t able to get the name and prices of items due to Amazon blocking it, does this fix that issue?

0
@BitOfJustin 2026-03-02

Love these videos highlighting specific skills. Would love to see more of these!

0

Unlock the Data Inside
Turn Videos into Knowledge

  • Get FREE 10/day: transcripts, summaries, chats
  • Chat with videos, export text & PDF
  • $1 free API credit for RAG, chatbots & research

Free forever plan • All features unlocked

App screenshot