Navigate Select ESC Close

LLM Attention That Expands At Inference? Test Time Training Explained

2024-07-29 Science & Technology
50.0k
2.6k
150
bycloud
bycloud
225.0k subscribers

Unlock all features

FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.

Description

Take your personal data back with Incogni! Use code bycloud at the link below and get 60% off an annual plan: https://incogni.com/bycloud RNN's hidden states be like: "You know, I am something of an ML model myself" check out my newsletter: https://mail.bycloud.ai/ Learning to (Learn at Test Time): RNNs with Expressive Hidden States [Paper] https://arxiv.org/abs/2407.04620 [Code PyTorch] https://github.com/test-time-training/ttt-lm-pytorch [Code JAX] https://github.com/test-time-training/ttt-lm-jax This video is supported by the kind Patrons & YouTube Members: 🙏Andrew Lescelius, alex j, Chris LeDoux, Alex Maurice, Miguilim, Deagan, FiFaŁ, Robert Zawiasa, Daddy Wen, Tony Jimenez, Panther Modern, Jake Disco, Demilson Quintao, Shuhong Chen, Hongbo Men, happi nyuu nyaa, Carol Lo, Mose Sakashita, Miguel, Bandera, Gennaro Schiano, gunwoo, Ravid Freedman, Mert Seftali, Mrityunjay, Richárd Nagyfi, Timo Steiner, Henrik G Sundt, projectAnthony, Brigham Hall, Kyle Hudson, Kalila, Jef Come, Jvari Williams, Tien Tien, BIll Mangrum, owned, Janne Kytölä, SO, Richárd Nagyfi, Hector, Drexon, Claxvii 177th, Inferencer, Michael Brenner, Akkusativ, Oleg Wock, FantomBloth [Discord] https://discord.gg/NhJZGtH [Twitter] https://twitter.com/bycloudai [Patreon] https://www.patreon.com/bycloud [Music] Massobeats - Noon [Profile & Banner Art] https://twitter.com/pygm7 [Video Editor] Silas

Top Comments (10)

@Miaumiau3333 2024-07-29

I disagree with the comments complaining that the video is too technical. I really like that you provide enough detail to roughly understand the technique, awesome video!

380 12 replies
@Eianex 2024-07-29

In conclusion, Trouble in Terrorist Town is cooler than some transformers and some snakes.

91
@MrJaggy123 2024-07-30

Turn all the hidden states into ML models? That scream of pain you all just heard was from the interpretability researchers ;)

86 6 replies
@heavenrvne888 2024-07-29

holy shit this method is so interesting. and the way they encapsulated the entire idea into the title LOL!

28
@OperationDarkside 2024-07-29

Let's put transformers into transformers. Maybe we end up with baby transformers.

27 1 replies
@papakamirneron2514 2024-07-29

Please make a video explaining all of these terms, apart from that, keep the technical videos coming!

19
@bycloudAI 2024-07-29

Take your personal data back with Incogni! Use code bycloud at the link below and get 60% off an annual plan: http://incogni.com/bycloud maybe we are all bots and the dead internet theory is true

11 1 replies
@FaultyTwo 2024-07-30

"Mom! They are adding more weights to the models again!"

11
@Kampfo-1337 2024-07-31

This channel explaining AI and using anime references in the visuals is exactly what I needed. Great video!

4
@FunBotan 2024-07-31

I would never suspect that this video would help me write my PhD, but the "compression heuristic" is exactly the term I needed but didn't know to express my idea. Thank you!

2

Unlock the Data Inside
Turn Videos into Knowledge

  • Get FREE 10/day: transcripts, summaries, chats
  • Chat with videos, export text & PDF
  • $1 free API credit for RAG, chatbots & research

Free forever plan • All features unlocked

App screenshot