(Day 55) Learning about tokenization in LLMs
February 25, 2024
Hello :) Today is Day 55!
A quick summary of today:
- continuing on the Andrej Karpathy streak from the last few days, today I watched his latest video about tokenization in LLMs
- I used a pokemon name dataset on yesterday’s MLP code (with the manual backprop) and posted it on Kaggle for easier access