I built an AI Supercomputer... again (2TB RAM)
Enabling High-Speed Local AI Clustering on Mac with Apple's RDMA Update
Discover how Apple's software update—enabling low-latency RDMA over Thunderbolt 5—revolutionized local AI clustering, turning slow performance from 5 tokens/sec to over 16 tokens/sec using massive models.
Short Summary
- Apple introduced RDMA (Remote Direct Memory Access) via a software update (Tahoe 26.2).
- This feature slashed inter-device latency from 300 microseconds down to 3 microseconds.
- The improved connectivity unlocked Tensor Parallelism, making massive model clustering fast and viable.
- Testing showed clustering now speeds up inference (e.g., Llama 3 370B went from 5 to 16 tokens/sec).
This breakdown covers the construction of a powerful, $50,000 local AI cluster using four highly-specced Mac Studios. The core focus is evaluating whether clustering these machines makes sense for AI workloads, especially after previous failures due to networking bottlenecks. The key takeaway is that Apple solved connectivity latency using RDMA, fundamentally changing the equation for decentralized local AI processing.
Unlock all features
FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.
Unlock all features
FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.
Unlock all features
FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.
Unlock all features
FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.
Unlock all features
FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.
Related videos
I almost quit YouTube....
NetworkChuck
191.7k views
become an AI HACKER (it's easier than you think)
NetworkChuck
41.4k views
I Tried running in 3D printed shoes
NetworkChuck
40.1k views
I'll never use n8n the same......
NetworkChuck
75.9k views
You SUCK at Prompting AI (Here's the secret)
NetworkChuck
16.9k views
This AI Supercomputer can fit on your desk...
NetworkChuck
663.8k views
Accidentally Built a Nuclear Supercomputer.
PewDiePie
2.6m views
I'm done with the AI hype
NetworkChuck
200.5k views
got AI anxiety? Do this RIGHT NOW!
NetworkChuck
159.6k views
I built an AI supercomputer with 5 Mac Studios
NetworkChuck
1.7m views
Top Comments (10)
So many YouTubers got sent 4 Mac’s this week
This guy is the reason why we're in a RAM shortage lol
Chuck's Wife: "Stop hugging your computers, and come to dinner!"
He has RAM get him!!!
That's at least 5 cents extra on my future RAM purchase.
Show generative audio, images and video to see the efficiency of the cluster.
Dude is going on a RAMpage
Hey…just try Twingate….you'll never look at VPN the same: https://ntck.co/twingate-networkchuck I built another AI supercomputer with 4 Mac Studios... but this time it actually works. Earlier this year, I clustered 5 Mac Studios and it was 91% SLOWER. Everyone said clustering was stupid. But Apple just dropped a software update that changes everything - RDMA over Thunderbolt 5. Latency dropped from 300 microseconds to 3 microseconds. Now we're running trillion-parameter models locally at speeds that actually make sense. 🔥🔥Join the NetworkChuck Academy!: https://ntck.co/NCAcademy RESOURCES / LINKS: Docs/walkthrough: https://github.com/theNetworkChuck/mac-studio-cluster Exo Labs: https://github.com/exo-explore/exo MLX (Apple's ML Framework): https://github.com/ml-explore/mlx My First Cluster Video (the failure): https://youtu.be/Ju0ndy2kwlw RDMA Networking Explained: https://youtu.be/fb69FyW2KLk
The end was really kind. Thank you
If he gets hold of one more ram stick Im gonna litteraly crash out
Unlock the Data Inside
Turn Videos into Knowledge
- Get FREE 10/day: transcripts, summaries, chats
- Chat with videos, export text & PDF
- $1 free API credit for RAG, chatbots & research
Free forever plan • All features unlocked
Top Comments (10)
So many YouTubers got sent 4 Mac’s this week
This guy is the reason why we're in a RAM shortage lol
Chuck's Wife: "Stop hugging your computers, and come to dinner!"
He has RAM get him!!!
That's at least 5 cents extra on my future RAM purchase.
Show generative audio, images and video to see the efficiency of the cluster.
Dude is going on a RAMpage
Hey…just try Twingate….you'll never look at VPN the same: https://ntck.co/twingate-networkchuck I built another AI supercomputer with 4 Mac Studios... but this time it actually works. Earlier this year, I clustered 5 Mac Studios and it was 91% SLOWER. Everyone said clustering was stupid. But Apple just dropped a software update that changes everything - RDMA over Thunderbolt 5. Latency dropped from 300 microseconds to 3 microseconds. Now we're running trillion-parameter models locally at speeds that actually make sense. 🔥🔥Join the NetworkChuck Academy!: https://ntck.co/NCAcademy RESOURCES / LINKS: Docs/walkthrough: https://github.com/theNetworkChuck/mac-studio-cluster Exo Labs: https://github.com/exo-explore/exo MLX (Apple's ML Framework): https://github.com/ml-explore/mlx My First Cluster Video (the failure): https://youtu.be/Ju0ndy2kwlw RDMA Networking Explained: https://youtu.be/fb69FyW2KLk
The end was really kind. Thank you
If he gets hold of one more ram stick Im gonna litteraly crash out