Navigate Select ESC Close

Unsloth Studio is insane… fine-tune any AI model locally

2026-05-28 Education
5.6k
481
17
David Ondrej
David Ondrej
386.0k subscribers

Unlock all features

FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.

Description

Unsloth studio: https://unsloth.ai/docs/new/studio Get everything from the video: https://www.davidondrej.com/local-ai-finetuning-starter-kit Wanna learn how to code with AI? Go here: https://www.skool.com/new-society We're hiring: https://www.scalesoftware.ai/ Follow me on Instagram - https://www.instagram.com/davidondrej1/ Follow me on Twitter - https://x.com/DavidOndrej1 Open Source models: https://artificialanalysis.ai/models/open-source Subscribe if you're serious about AI. Fine-tuning beginners guide 101 super duper easy

Top Comments (10)

@DavidOndrej 2026-05-28

Unsloth studio: https://unsloth.ai/docs/new/studio Get everything from the video: https://www.davidondrej.com/local-ai-finetuning-starter-kit

13 1 replies
@itziklerner 2026-05-29

The video is not complete , I didn't see how to use the fine-tuned model , the delivery of the entire workflow should be a new weights file (or files) , that you can later use in any LLM server , like vLLM and alike

29 5 replies
@jasonthomas9916 2026-05-29

Great video! My one queston, is whether unsloth studio can create a combined dataset training bundle that can handle personalisation at the same time as PDF fine-tuning ? ie., I have dialogue transcripts of the Simpsons + NVidia financials PDF. I want the dataset recipe to combine dialogue + PDF by the deepseek-v4 distillation. Ie., ultimately, I want to fine-tune Qwen 3.6 model with NVidia financial year-ends, to respond in the personality of Krusty the Clown ? (fake example of course)

1
@bugtest3849 2026-05-31

I want to know how use lora in agents ai because full fine-tune takes alot of time , can you please make video how to use lora adaptor in agent ai

1
@張允達 2026-05-31

Great tutorial to get started, but it feels like it only scratches the surface of the real issues we face during fine-tuning. I'm currently running into severe model collapse (mode collapse/repetition) and overfitting, which weren't covered here. In my recent runs, no matter how much I aggressively tune down the learning rate, epochs, or rank, the model either underfits completely or collapses into a degenerative state (e.g., repeating the same outputs or ignoring the prompt context entirely). It feels like it's just finding a shortcut to minimize loss by memorizing formats rather than learning the underlying distribution. Is anyone else experiencing this exact same degradation? And I am now working on using dora.

0
@fhsa7239 2026-05-30

Can this be done for voice agent? Have you ever created one as an outbound agent, or any way you can show that in a video?

1
@JimMcColl-m1u 2026-05-29

Dude... You rock.

1
@axietomars4367 2026-05-31

What if you trained your LLM, then the same LLM that you trained got a huge upgrade from the source? You will you be, dame what i have now is obsolete and i made efforth to train it? Well i am just starting to learn, would this be possible? What should be done?

1
@aadilferoze3621 2026-06-01

Great Video - Thank You

0
@AlbinoCordeiroJunior 2026-05-30

Thank you for this one!

0

Unlock the Data Inside
Turn Videos into Knowledge

  • Get FREE 10/day: transcripts, summaries, chats
  • Chat with videos, export text & PDF
  • $1 free API credit for RAG, chatbots & research

Free forever plan • All features unlocked

App screenshot