Navigate Select ESC Close

Is Signal Processing The CURE For AI's ADHD?

2024-11-04 Science & Technology
25.9k
1.6k
89
bycloud
bycloud
225.0k subscribers

Unlock all features

FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.

Description

Check out HubSpot's Free ChatGPT Bundle! https://clickhubspot.com/jgv5 In this video, I will be covering the latest and the hottest paper called Differential Transformer. Will also be covering some basics about self-attention, grouped query attention, and multi-head latent attention. check out my newsletter: https://mail.bycloud.ai/ Attention Is All You Need [Paper] https://arxiv.org/abs/1706.03762 GQA: Training Generalized Multi-Query Transformer Models from Multi-Head Checkpoints [Paper] https://arxiv.org/abs/2305.13245 DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model [Paper] https://arxiv.org/abs/2405.04434 Differential Transformer [Paper] https://arxiv.org/abs/2410.05258 Flash Attention [Paper] https://arxiv.org/abs/2205.14135 This video is supported by the kind Patrons & YouTube Members: 🙏Andrew Lescelius, Ben Shaener, Chris LeDoux, Miguilim, Deagan, FiFaŁ, Robert Zawiasa, Marcelo Ferreira, Owen Ingraham, Daddy Wen, Tony Jimenez, Panther Modern, Jake Disco, Demilson Quintao, Penumbraa, Shuhong Chen, Hongbo Men, happi nyuu nyaa, Carol Lo, Mose Sakashita, Miguel, Bandera, Gennaro Schiano, gunwoo, Ravid Freedman, Mert Seftali, Mrityunjay, Richárd Nagyfi, Timo Steiner, Henrik G Sundt, projectAnthony, Brigham Hall, Kyle Hudson, Kalila, Jef Come, Jvari Williams, Tien Tien, BIll Mangrum, owned, Janne Kytölä, SO, Richárd Nagyfi, Hector, Drexon, Claxvii 177th, Inferencer, Michael Brenner, Akkusativ, Oleg Wock, FantomBloth, Thipok Tham, Clayton Ford, Theo, Handenon, Diego Silva, mayssam, Kadhai Pesalam, Tim Schulz, jiye, Anushka [Discord] https://discord.gg/NhJZGtH [Twitter] https://twitter.com/bycloudai [Patreon] https://www.patreon.com/bycloud [Music] massobeats - glimmer [Profile & Banner Art] https://twitter.com/pygm7 [Video Editor] @Askejm

Top Comments (10)

@commonwombat-h6r 2024-11-04

btw how is Mamba doing? There was a ton of hype surrounding her back in the day

133 8 replies
@er-lau 2024-11-04

0:11 I was so sure the truck was gonna transform into Optimus Prime

76 1 replies
@commonwombat-h6r 2024-11-04

delithful memes! Man where do you find energy to post so many supreme quality videos so often? My deepest respect

28 1 replies
@luke.perkin.online 2024-11-05

It's just a hunch, but the biggest unrealised gains are at the token embedding stage. If we can formalise language by pre-processing whole words into semantic word senses (on a multilingual / language independent word sense knowledge graph like bablenet) using contextual and linguistic analysis, then each token will actually "mean something" and the sentence as a whole will become "formal language" like code instead of natural language. It's basically insane that we cut words up and assume the parts have meaning. It's a miracle that it works at all, even if one token encodes "duck" as a nown or verb, they're not at all related. So much attention wasted untangling composite embeddings.

16 9 replies
@nilaier 2024-11-04

Plot twist: The truck was Optimus Prime The Transformer.

16
@natsirhasan2288 2024-11-04

You caught me off guard with that locked in meme

11
@pareak 2024-11-04

Thanks for covering that paper. I was amazed when I read it and really wanna see someone implementing it into an open model that can be tested by the community. Would be great for a small 3B model on your phone.

9
@kristhianaguilar4774 2024-11-04

That intro tho? Absolute cinema

4
@JorgetePanete 2024-11-05

8:51 either "62.2% of its parameters" or "62.2% less parameters"

2
@bycloudAI 2024-11-01

Check out HubSpot's Free ChatGPT Bundle! https://clickhubspot.com/jgv5 maybe I'll also cover ring attention and tree attention next time

1

Unlock the Data Inside
Turn Videos into Knowledge

  • Get FREE 10/day: transcripts, summaries, chats
  • Chat with videos, export text & PDF
  • $1 free API credit for RAG, chatbots & research

Free forever plan • All features unlocked

App screenshot