High Entropy & KL Divergence Token Masking for SFT
Unlock all features
FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.
Unlock all features
FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.
Unlock all features
FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.
Unlock all features
FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.
Unlock all features
FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.
Related videos
The Most Disturbing Discovery Yet 😨 (S7) | The Secret of Skinwalker Ranch
HISTORY
34.9k views
r/AITA for Throwing Trash Over Karen's Yard?
rSlash
52.3k views
Breaking! Trump Suffers Emergency Landing … Air Force One Disaster!
Jack Cocchiarella
121.3k views
The Most Powerful Compound for Stopping Insulin Resistance has Been Discovered
Thomas DeLauer
16.2k views
🚨MAGA RUSHES for EMERGENCY SCOTUS Ruling
Legal AF
170.6k views
🚨 Trump RUSHES for EMERGENCY HEARING at SCOTUS
MeidasTouch
262.2k views
New AI Meta: Train LLMs To Explore On "Hard" Tokens [RLVR + Entropy]
bycloud
23.4k views
Covid, Mass Formation, and the Silence of The 70%
Kim Iversen
26.9k views
The Truth about AI is Devastating: Proof by MIT, Harvard
Discover AI
65.2k views
r/AITA For Divorcing over Mr. Beast?
rSlash
181.2k views
Top Comments (6)
Truly love your content while I am fine tuning my agent ux/ui I am incorporating alot of your content and my agent is becoming more natural and easy to deal with. Still having some memory issues and issues with utilization of all systems. Some of it is constantly degraded and unsure how to tighten it all up.... still working. And thank you.
Great stuff, I often spend a lot of time trying to learn from the images, but when the AI images of "improvised" formulas that are simply "wrong" with labels that are simply "wrong", it makes it harder. It takes some extra work, but making sure those details are correct is important.
It seems the benchmarks are not measuring the behavior the model is actually optimizing for. If the objective is to prevent catastrophic forgetting, the evaluation should include tasks that are specifically designed to cause or known to cause catastrophic forgetting; otherwise, the benchmark may fail to capture whether the objective is being achieved.
This would have been nice to have 3 years ago.
Welcome to LoRA
❤ definitivamente no deberían de usar eso como cimentación de sus modelos sino como forma de expresión e integración de expresiones para los modelos y ya fuera de eso no parece un parámetro útil
Unlock the Data Inside
Turn Videos into Knowledge
- Get FREE 10/day: transcripts, summaries, chats
- Chat with videos, export text & PDF
- $1 free API credit for RAG, chatbots & research
Free forever plan • All features unlocked
Top Comments (6)
Truly love your content while I am fine tuning my agent ux/ui I am incorporating alot of your content and my agent is becoming more natural and easy to deal with. Still having some memory issues and issues with utilization of all systems. Some of it is constantly degraded and unsure how to tighten it all up.... still working. And thank you.
Great stuff, I often spend a lot of time trying to learn from the images, but when the AI images of "improvised" formulas that are simply "wrong" with labels that are simply "wrong", it makes it harder. It takes some extra work, but making sure those details are correct is important.
It seems the benchmarks are not measuring the behavior the model is actually optimizing for. If the objective is to prevent catastrophic forgetting, the evaluation should include tasks that are specifically designed to cause or known to cause catastrophic forgetting; otherwise, the benchmark may fail to capture whether the objective is being achieved.
This would have been nice to have 3 years ago.
Welcome to LoRA
❤ definitivamente no deberían de usar eso como cimentación de sus modelos sino como forma de expresión e integración de expresiones para los modelos y ya fuera de eso no parece un parámetro útil