AMATH/CSE 579: Intelligent control through learning and optimization
SLN 19363, MW 3:00-4:20, ARC G070
Emo Todorov
Guggenheim 415h
todorov@cs.washington.edu
Course Description
Design of near-optimal controllers for complex dynamical systems, using
analytical techniques, machine learning, and optimization. Topics from
deterministic and stochastic optimal control, reinforcement learning and
dynamic programming, numerical optimization in the context of control, and
robotics. Prerequisite: vector calculus, linear algebra, and Matlab.
Recommended: differential equations, stochastic processes, and optimization.
Syllabus
See introductory lecture.
Lecture slides
Lecture 1: Introduction
Lecture 2: Markov Decision Processes and Bellman Equations
Lecture 3: Controlled Diffusions and Hamilton-Jacobi-Bellman Equations
Lecture 4: Deterministic Systems and Pontryagin's Maximum Principle
Lecture 5: Linear-Quadratic-Gaussian (LQG) Controllers and Kalman Filters
Lecture 6: Linearly-Solvable Stochastic Optimal Control Problems
Lecture 7: Inverse Optimal Control
Lecture 8: Trajectory-based Optimization
Lecture 9: Numerical Optimization
Lecture 10: Optimal control in locomotion
Lecture 11: Function approximation methods
Lecture 12: Applications to biological movement
General Readings
A. Barto and R. Sutton (1998) Reinforcement learning: An introduction (online book)
E. Todorov (2006) Optimal control theory (book chapter)
D. Bertsekas (2008) Dynamic programming (lecture slides)
R. Tedrake (2009) Underactuated robotics: Learning, planning and control (lecture notes)
B. Van Roy (2004) Approximate dynamic programming (lecture notes)
P. Abbeel (2009) Advanced robotics (lecture slides)
Lecture-specific Readings
Lecture 6:
Todorov (2009) Efficient computation of optimal actions. PNAS 106: 11478-11483
Lecture 7:
Krishnamurthy and Todorov (2010) Inverse optimal control with linearly-solvable MDPs. In proceedings of ICML
Lecture 8:
Tassa, Erez and Smart (2007) Inverse optimal control with linearly-solvable MDPs. In proceedings of NIPS
Todorov and Li (2005) A generalized iterative LQG method for locally-optimal feedback control of constrained nonlinear stochastic systems. In proceedings of ACC
Lecture 10:
Muico, Lee, Popovic and Popovic (2009) Contact-aware nonlinear control of dynamics characters. In proceedings of SIGGRAPH
Wampler and Popovic (2009) Optimal gait and form for animal locomotion. In proceedings of SIGGRAPH
Lecture 11:
Bertsekas (2010) Approximate dynamic programming. In Dynamic Programming and Optimal Control, vol 2, 3rd ed
Lecture 12:
Todorov (2004) Optimality principles in sensorimotor control. Nature Neuroscience
Code
Matlab MDP solver: all problem formulations and algorithms
Acrobot dynamics
Animation of acrobot dynamics
Homeworks
Homework 1, due April 28
Default Final Project, due before the final presentations