Navigate Select ESC Close

How to Run LLMs Locally - Full Guide

2025-12-19 Education
67.4k
1.8k
55
Tech With Tim
Tech With Tim
2.0m subscribers

Unlock all features

FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.

Description

Click this link https://boot.dev/?promo=TECHWITHTIM and use my code TECHWITHTIM to get 25% off your first payment for boot.dev. If you're not running LLMs locally, then you're missing out. ChatGPT and other hosted solutions are great, but if you care about speed, privacy and cost, then you'll want to learn how to run them on your own machine. In this video, I'll show you two methods of running LLMs locally from a developer perspective. Want to make real money with coding? I share high-signal insights on careers, monetization, and leverage in my free newsletter. Join here and get my guide How to Make Money With Coding instantly: https://techwithtim.net/newsletter 🎞 Video Resources 🎞 Download Ollama: https://ollama.com/download Ollama Library: https://ollama.com/library Ollama GitHub: https://github.com/ollama/ollama Docker Model Runner Full Video: https://www.youtube.com/watch?v=GOgfQxDPaDw&t=24 ⏳ Timestamps ⏳ 00:00 | Overview 00:38 | Method 1 - Ollama 04:29 | Ollama from Code 08:27 | Method 2 - Docker Model Runner 12:32 | Docker Model Runner from Code Hashtags #Ollama #Docker #LLM UAE Media License Number: 3635141

Top Comments (10)

@TechWithTim 2025-12-19

Click this link https://boot.dev/?promo=TECHWITHTIM and use my code TECHWITHTIM to get 25% off your first payment for boot.dev.

4 2 replies
@to_var_dev 2026-02-11

port 11434 spells LLAMA 🤯

32
@yadavallisaijagadeesh5763 2025-12-20

Hey Tim, one interesting option i found lately is using azure foundry local to optimize the run for underlying architecture!!

4
@Lunolux 2025-12-19

nice video. i like using "open web ui" docker to have a web view to use for ollama model

9 1 replies
@donsolokhalifa6828 2025-12-19

I see that the docker model runner is using llama.cpp under the hood. llama.cpp really gives optimized inference for models. I remember trying to migrate from openai apis to local models and i would code the inference pipeline myself, ran very bad😂, not really a good programmer. But with llama.cpp... chef's kiss

9
@JohnMitchellCalif 2025-12-19

"capital of Canada is Paris, the 'City of Lights'" -- haha

27 1 replies
@sheon2 2026-01-24

clearest video I've found, thank you!

3 1 replies
@NeverCodeAlone 2025-12-19

Very good video. Thx a lot.

1
@Master_of_Chess_Shorts 2025-12-22

Thanks for the examples

1
@nsantanu 2025-12-19

Excellent video 👍

3

Unlock the Data Inside
Turn Videos into Knowledge

  • Get FREE 10/day: transcripts, summaries, chats
  • Chat with videos, export text & PDF
  • $1 free API credit for RAG, chatbots & research

Free forever plan • All features unlocked

App screenshot