Schedule

  • Event
    Date
    Description
    Description
  • Session
    05/05/2026 13:00
    Tuesday
    First Lecture
  • Lecture
    05/05/2026
    Tuesday
    Lecture 0: Course Overview and Logistics

    Lecture Notes:

  • Lecture
    05/05/2026
    Tuesday
    Lecture 1: Language Modeling

    Lecture Notes:

    Further Reads:

    • Tokenization: Chapter 2 of [JM]
    • Embedding: Chapter 6 of [JM]
    • Original BPE Algorithm: Original BPE Algorithm proposed by Philip Gage in 1994
    • BPE for Tokenization: Paper Neural machine translation of rare words with subword units by Rico Sennrich, Barry Haddow, and Alexandra Birch presented in ACL 2016 that adapted BPE for NLP
    • LMs: Chapter 12 of [BB] Section 12.2
    • N-Gram LMs: Chapter 3 of Speech and Language Processing; Section 3.1 on N-gram LM
    • Maximum Likelihood: Chapter 2 of [BB] Sections 12.1 – 12.3
    • Recurrent LMs: Chapter 8 of [JM]
    • LSTM LMs: Paper Regularizing and Optimizing LSTM Language Models by Stephen Merity, Nitish Shirish Keskar, and Richard Socher published in ICLR 2018 enabling LSTMs to perform strongly on word-level language modeling
    • High-Rank Recurrent LMs: Paper Breaking the Softmax Bottleneck: A High-Rank RNN Language Model by Zhilin Yang, Zihang Dai, Ruslan Salakhutdinov, and William W. Cohen presented at ICLR 2018 proposing Mixture of Softmaxes (MoS) and achieving state-of-the-art results at the time



Overall Course Calendar

Week Topic Assignment Project Exam Submission
1 Language Modeling
2 LLMs Assgn 1
3 Fundamentals of Generative Learning
4 Guest Lecture Assgn 1
5 Autoregressive Models Assgn 2
6 Energy-based Models Proposal Proposal
7 Normalizing Flow Exam 1
8 Generative Adversarial Networks Assgn 3 Assgn 2
9 Holiday
10 Variational Inference Assgn 4 Assgn 3
11 VAEs
12 Score-based Diffusion Assgn 5 Assgn 4
13 DPMs Exam 2
14 Multimodality and Conditioning Assgn 5
15 Final Lecture - Reserved Presentation Presentation
16 No Lecture - Reserved Code and Paper Code and Paper




Tutorial Schedule

Date Topic Tutorial
May 14 PyTorch Overview -- Tokenization and Embedding Amir Hossein
May 21 Transformer-based Language Models Amir Hossein
May 28 Generative vs Discriminative Learning Mohammadreza
June 4 Autoregressive Models Mohammadreza
June 11 Energy-based Models Amir Hossein
June 18 Midterm
June 25 Normalizing Flow Mohammadreza
July 2 GAN Amir Hossein
July 9 Sample Project
July 16 VAE and Q-VAE Amir Hossein
July 23 Score-based Diffusion Mohammadreza
July 30 Midterm
August 6 DDPM and DDIM Mohammadreza