Schedule
-
EventDateDescriptionDescription
-
Lecture09/02/2025
TuesdayLecture 0: Course Overview and LogisticsLecture Notes:
-
Lecture09/02/2025
TuesdayLecture 1: Why Deep Learning -
Lecture09/02/2025
TuesdayLecture 2: Machine Learning vs AnalysisLecture Notes:
-
Lecture09/02/2025
TuesdayLecture 3: ML Component 1 - Data -
Session09/02/2025 15:00
TuesdayFirst Lecture -
Lecture09/05/2025
FridayLecture 4: Supervised, Unsupervised and Semi-supervisedLecture Notes:
-
Lecture09/05/2025
FridayLecture 5: Components 2 and 3: Model and LossLecture Notes:
Further Reads:
- ML Components: Chapter 1 - Sections 1.2.1 to 1.2.4 of [BB]
- ML Basics: Chapter 5 of [GYC]
-
Lecture09/05/2025
FridayLecture 6: First Example -- Classification by PerceptronLecture Notes:
Further Reads:
- Binary Classification: Chapter 5 - Sections 5.1 and 5.2 of [BB]
- McCulloch-Pitts Model: Paper A logical calculus of the ideas immanent in nervous activity published in the Bulletin of Mathematical Biophysics by Warren McCulloch and Walter Pitts in 1943, proposing a computational model for neuron. This paper is treated as the pioneer study leading to the idea of artificial neuron
-
Lecture09/05/2025
FridayLecture 7: Recap -- Law of Large NumbersLecture Notes:
Further Reads:
- Probability Theory: Chapter 2 of [BB]
- Probability Review: Chapter 3 of [GYC]
-
Lecture09/09/2025
TuesdayLecture 8: Training via Empirical Risk MinimizationLecture Notes:
Further Reads:
- Overview on Risk Minimization: Paper An overview of statistical learning theory published as an overview of his life-going developments in ML in the IEEE Transactions on Neural Networks by Vladimir N. Vapnik in 1999
-
Lecture09/09/2025
TuesdayLecture 9: Training Perceptron MachineLecture Notes:
Further Reads:
- Perceptron Simulation Experiments: Paper Perceptron Simulation Experiments presented by Frank Rosenblatt in Proceedings of IRE in 1960
- Perceptron: Chapter 1 - Section 1.2.1 of [Ag]
-
Lecture09/09/2025
TuesdayLecture 10: From Perceptron to NNs -- Universal ApproximationLecture Notes:
Further Reads:
- Universal Approximation: Paper Approximation by superpositions of a sigmoidal function published in Mathematics of Control, Signals and Systems by George V. Cybenko in 1989
-
Assignment09/12/2025
FridayAssignment #1 - Fundamentals of Machine Learning released! -
Lecture09/12/2025
FridayLecture 11: Deep Neural Networks -
Lecture09/12/2025
FridayLecture 12: Iterative Optimization by Gradient DescentLecture Notes:
Further Reads:
- Gradient-based Optimization: Chapter 4 - Sections 4.3 and 4.4 of [GYC]
- Gradient Descent: Chapter 7 - Sections 7.1 and 7.2 of [BB]
-
Lecture09/16/2025
TuesdayLecture 13: More on Gradient Descent -
Lecture09/16/2025
TuesdayLecture 14: Forward Propagation in MLPsLecture Notes:
Further Reads:
-
Lecture09/19/2025
FridayLecture 15: Training Neural Networks via GD -
Lecture09/19/2025
FridayLecture 16: Chain Rule on Computation Graph -
Lecture09/19/2025
FridayLecture 17: Backward Pass on Computation GraphLecture Notes:
Further Reads:
- Backpropagation: Chapter 6 - Section 6.5 of [GYC]
- Backpropagation: Chapter 8 of [BB]
-
Assignment09/21/2025
SundayProject Proposal released! -
Lecture09/23/2025
TuesdayLecture 18: Backpropagation over MLPLecture Notes:
Further Reads:
- Backpropagation: Chapter 8 of [BB]
- Backpropagation of Error Paper Learning representations by back-propagating errors published in Nature by D. Rumelhart, G. Hinton and R. Williams in 1986 advocating the idea of systematic gradient computation of a computation graph
-
Lecture09/23/2025
TuesdayLecture 19: First Neural Classifier -
Lecture09/26/2025
FridayLecture 20: Multiclass Classification -
Lecture09/26/2025
FridayLecture 21: Stochastic Gradient Descent -
Due09/26/2025 23:59
FridayAssignment #1 due -
Lecture09/30/2025
TuesdayLecture 22: Mini-batch SGD and Complexity-Variance Tradeoff -
Lecture09/30/2025
TuesdayLecture 23: Evaluation and Generalization MeasuresLecture Notes:
Further Reads:
- Generalization: Chapter 6 of the Book Patterns, predictions, and actions: A story about machine learning by Moritz Hardt and B. Recht published in 2021
-
Lecture09/30/2025
TuesdayLecture 24: Linear and Sub-linear Convergence SpeedLecture Notes:
Further Reads:
- Notes on Optimizers Lecture notes of the course Optimization for Machine Learning by Ashok Cutkosky in Boston University: A good resource for optimizers
-
Assignment10/01/2025
WednesdayAssignment #2 - Feedforward Neural Networks released! -
Lecture10/03/2025
FridayLecture 25: Optimizer Boosting -- Scheduling, Momentum and Rprop IdeasLecture Notes:
Further Reads:
- Learning Rate Scheduling Paper Cyclical Learning Rates for Training Neural Networks published in Winter Conference on Applications of Computer Vision (WACV) by Leslie N. Smith in 2017 discussing learning rate scheduling
- Rprop Paper A direct adaptive method for faster backpropagation learning: the RPROP algorithm published in IEEE International Conference on Neural Networks by M. Riedmiller and H. Braun in 1993 proposing Rprop algorithm
-
Lecture10/03/2025
FridayLecture 26: RMSprop and AdamLecture Notes:
Further Reads:
- RMSprop Lecture note by GEoffrey Hinton proposing RMSprop
- RMSprop Analysis Paper RMSProp and equilibrated adaptive learning rates for non-convex optimization by Y. Dauphin et al. published in 2015 talking about RMSprop and citing Honton’s lecture notes
- Adam Paper Adam: A Method for Stochastic Optimization published in 2014 by D. Kingma and J. Ba proposing Adam
-
Lecture10/03/2025
FridayLecture 27: Overfitting -
Due10/06/2025 23:59
MondayProposal due -
Due10/15/2025 23:59
WednesdayAssignment #2 due -
Exam10/24/2025 11:00
FridayMidtermTopics:
- The exam is 3 hours long
- No programming questions
- Starts at 11:00 AM