Schedule
-
EventDateDescriptionDescription
-
Session05/05/2026 13:00
TuesdayFirst Lecture -
Lecture05/05/2026
TuesdayLecture 0: Course Overview and LogisticsLecture Notes:
-
Lecture05/05/2026
TuesdayLecture 1: Language ModelingLecture Notes:
Further Reads:
- Tokenization: Chapter 2 of [JM]
- Embedding: Chapter 6 of [JM]
- Original BPE Algorithm: Original BPE Algorithm proposed by Philip Gage in 1994
- BPE for Tokenization: Paper Neural machine translation of rare words with subword units by Rico Sennrich, Barry Haddow, and Alexandra Birch presented in ACL 2016 that adapted BPE for NLP
- LMs: Chapter 12 of [BB] Section 12.2
- N-Gram LMs: Chapter 3 of Speech and Language Processing; Section 3.1 on N-gram LM
- Maximum Likelihood: Chapter 2 of [BB] Sections 12.1 – 12.3
- Recurrent LMs: Chapter 8 of [JM]
- LSTM LMs: Paper Regularizing and Optimizing LSTM Language Models by Stephen Merity, Nitish Shirish Keskar, and Richard Socher published in ICLR 2018 enabling LSTMs to perform strongly on word-level language modeling
- High-Rank Recurrent LMs: Paper Breaking the Softmax Bottleneck: A High-Rank RNN Language Model by Zhilin Yang, Zihang Dai, Ruslan Salakhutdinov, and William W. Cohen presented at ICLR 2018 proposing Mixture of Softmaxes (MoS) and achieving state-of-the-art results at the time
Overall Course Calendar
| Week | Topic | Assignment | Project | Exam | Submission |
|---|---|---|---|---|---|
| 1 | Language Modeling | ||||
| 2 | LLMs | Assgn 1 | |||
| 3 | Fundamentals of Generative Learning | ||||
| 4 | Guest Lecture | Assgn 1 | |||
| 5 | Autoregressive Models | Assgn 2 | |||
| 6 | Energy-based Models | Proposal | Proposal | ||
| 7 | Normalizing Flow | Exam 1 | |||
| 8 | Generative Adversarial Networks | Assgn 3 | Assgn 2 | ||
| 9 | Holiday | ||||
| 10 | Variational Inference | Assgn 4 | Assgn 3 | ||
| 11 | VAEs | ||||
| 12 | Score-based Diffusion | Assgn 5 | Assgn 4 | ||
| 13 | DPMs | Exam 2 | |||
| 14 | Multimodality and Conditioning | Assgn 5 | |||
| 15 | Final Lecture - Reserved | Presentation | Presentation | ||
| 16 | No Lecture - Reserved | Code and Paper | Code and Paper |
Tutorial Schedule
| Date | Topic | Tutorial |
|---|---|---|
| May 14 | PyTorch Overview -- Tokenization and Embedding | Amir Hossein |
| May 21 | Transformer-based Language Models | Amir Hossein |
| May 28 | Generative vs Discriminative Learning | Mohammadreza |
| June 4 | Autoregressive Models | Mohammadreza |
| June 11 | Energy-based Models | Amir Hossein |
| June 18 | Midterm | |
| June 25 | Normalizing Flow | Mohammadreza |
| July 2 | GAN | Amir Hossein |
| July 9 | Sample Project | |
| July 16 | VAE and Q-VAE | Amir Hossein |
| July 23 | Score-based Diffusion | Mohammadreza |
| July 30 | Midterm | |
| August 6 | DDPM and DDIM | Mohammadreza |
