🤖

Sequential Decision Making

Master the art of making optimal decisions over time. From Dynamic Programming and MDPs to Reinforcement Learning and Multiagent Systems.

Course Syllabus

01

Sequential Decision Problems

Foundations, Finite Horizon & Basic DP.

02

Principle of Optimality

Tail Sub-problems & Backward Induction.

03

Scheduling Example

Discrete-State DP in Action.

04

DP Algorithm & Examples

General Algorithm & Real-world Applications.

05

Optimal Control & Shortest Path

Forward Pass & Graph Representation.

06

Q-Factors & TSP

Alternative DP form & Traveling Salesman Problem.

07

TSP Algorithm

Python Implementation & Discrete Optimization.

08

Approx. in Value Space

Overcoming computational limits with approximations.

09

Lookahead Approx.

One-Step vs Multistep Lookahead and Chess Example.

10

Rollout with Base Heuristic

On-line approximation using base policy simulation.

11

TSP with Rollout

Improving Nearest Neighbor heuristic with Rollout.

12

Infinite Horizon Problems

Value Iteration, Policy Iteration, and Bellman Equation.

14

Linear Quadratic Problems

Riccati Equation, Newton's Method, and Rollout.

07

DP with Time Delays

State Augmentation and Non-additive Costs.

15

DP with Forecast

Augmented System and Conditional Expectation.

16

Uncontrollable States

Simplifying DP by Averaging Out.

17

DP for Tetris

Stochastic Formulation and Challenges.

18

POMDP & Belief State

Partial Information and Sufficient Statistics.

19

Bidirectional Parking

Belief State DP and Multiagent Teaser.

20

Multiagent Rollout

Efficient Control for Multiagent Systems.

21

Model Predictive Control

Optimization, Constraints, and Applications.

22

Exercising an Option

Optimal Stopping and Rollout Comparison.

23

RL vs DP Terminology

The Rosetta Stone of Sequential Decision Making.

24

Approximation in Value Space

Bridging Exact DP and RL.

25

Four Queens Problem

Constraint Satisfaction & DP.

26

General Discrete Optimization

Sequential View of Optimization.

27

Approx. in Value Space

Deterministic Problems & Q-Factors.

28

Rollout Algorithms

Discrete Optimization & Routing.

29

Cost Improvement

Sequential Consistency & Improvement.

30

Advanced Rollout

Parallel & Simplified Strategies.

31

Model-Free Rollout

Truncated, Expert-Guided & LLMs.

32

DeepSeek & GRPO

Group Relative Policy Optimization.

33

RNA Folding

Discrete Optimization in Biology.

34

Value & Policy Approx.

Training Nonlinear Architectures.

35

Fitted Value Iteration

Sequential Backward Training.

36

Linear Programming

Exact & Approximate LP Methods.

37

Policy Space Approx.

Parametrization & Expert Training.

38

Optimization Framework

Random Search & Cross Entropy.

39

Policy Gradient Methods

Deterministic vs Stochastic Problems.

40

Approx. Gradient Methods

Log-Likelihood Ratio & Cost Shaping.

5+