How does AI actually work? Transformers explained
How GPT and other large language models (LLMs) work. Transformers deep dive. #ai #llm #machinelearning #datascience #agi
Thanks to our sponsor Genspark. Try it for free https://bit.ly/4uM3PLS
Attention is all you need https://arxiv.org/html/1706.03762v7
0:00 Intro
0:33 The transformer model
1:30 Predicting the next word
2:30 Tokenization
5:06 Representing meaning
7:17 Positional encoding
9:17 Attention head
14:49 Genspark
16:35 Multiple heads
19:30 Add and norm
21:45 Feed forward neural net
24:08 Multiple decoder blocks
24:50 Final layer
27:03 Training the model
Newsletter: https://aisearch.substack.com/
Find AI tools & jobs: https://ai-search.io/
Support: https://ko-fi.com/aisearch
Here’s my equipment, in case you’re wondering:
Lenovo Thinkbook: https://amzn.to/4jWeKwH
Dell Precision 5690: https://www.dell.com/en-us/dt/ai-technologies/index.htm?utm_source=AISearchTools&utm_medium=youtube&utm_campaign=precisionai#tab0=0
GPU: Nvidia RTX 5000 Ada https://nvda.ws/3zfqGqS
Mic: Shure SM7B https://amzn.to/3DErjt1
Audio interface: Scarlett Solo https://amzn.to/3qELMeu




Post Comment