Intro to NLP part 3
Status: PastRecording available below
This 3rd part of the NLP Basics series focuses more on the mathematical concepts behind Attention models and introduces briefly what BERT is and how it works.
- Multihead Attention
- BERT: Bidirectional Encoding Representations from Transformers
NLP_Basics_Part_3.pdf (493.4 KB)