<aside> 💡 My notes from Karpathy’s YouTube tutorial on building GPT. Watched it around January 20, 2024 These notes are meant for me to recall the things I want to make sure to recall

</aside>

Google Labs Colab for the lecture!

Intro

<aside> 💡 My own note: one of the biggest differentiators between OpenAI’s system and a GPT that I build on my laptop is probably the data the model is trained on. Elon Musk talks about this in Lex Fridman’s podcast. He talks about how the model’s codebase is actually quite simple, and the heavy lift is filtering through all of the noise on the internet to get good data to train the model on

</aside>

Exploring the data