Learn + Blog every day in 2024

Math, Stats, AI, Data Eng, MLOps!

Home Projects 2025 About

(Day 72) Carnegie Mellon University - Advanced NLP Spring 2024 - assignment 1

March 13, 2024

Hello :) Today is Day 72!

A quick summary of today:

  • Today I found 11-711 Advanced NLP by CMU
Read More

(Day 71) Backprop, GELU, Tricking ChatGPT, and Stealing part of an LLM

March 12, 2024

Hello :) Today is Day 71!

A quick summary of today:

  • Finished up the manual backprop code to make it more clear
  • Read some research papers
  • Gaussian Error Linear Units (GELUs)
  • Using Hallucinations to Bypass RLHF Filters
  • Stealing Part of a Production Language Model
Read More

(Day 70) Testing my backprop knowledge

March 11, 2024

Hello :) Today is Day 70!

Read More

(Day 69) Training an LLM to generate Harry Potter text

March 10, 2024

Hello :) Today is Day 69!

A quick summary of today:

  • tried to build upon the built LLM from the book from the days before, and write a training loop in the hopes of generating some Harry Potter text (kaggle notebook)
Read More

(Day 68) Build a LLM from scratch chapter 4 - making the GPT-2 architecture

March 9, 2024

Hello :) Today is Day 68!

A quick summary of today:

  • Covered chapter 4 of Build a LLM from scratch by Sebastian Raschka
Read More

(Day 67) Build a LLM from scratch chapter 3 - self-attention from scratch

March 8, 2024

Hello :) Today is Day 67!

A quick summary of today:

  • Covered chapter 3 of Build a LLM from scratch by Sebastian Raschka
Read More
« Prev 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 Next »