-
Lean 4 Notes
Stanford CS99 Functional Programming and Theorem Proving in Lean 4
-
Probability Review I - Definition
Stanford STAT310 Theory of Probability
-
Reinforcement Learning Notes II - Policy Gradients & Actor-Critic Methods
Policy Gradients, Actor-Critic Methods, Model-based RL
-
Reinforcement Learning Notes I - Introduction & Imitation Learning
Introductions, MDPs, Imitation Learning
-
Reinforcement Learning as a Co-Design of Product and Research
Notes for CS25 Transformers United V5, Lecture 2