Stochastic Control Notes Update

I’ve updated the stochastic control notes here:

Stochastic_Control_2020_May.pdf

These still remain a work in progress.

(Typos/errors/mishaps found are welcome)

A quick summary of what is new:

  • I’ve updated a section on the ODE method for stochastic approximation.
  • I’ve improved the discussion around Temporal Difference methods and included some proofs.
  • I’ve added a proof of convergence for SARSA. 
  • I’ve added a section on Lyapunov functions in continuous time
    • La Salle, exponential convergence, and online convex optimization…
  • I’ve started a section on Policy Gradients but there is more recent proofs to include
  • I’ve started a section on Deep Learning for RL.

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s

%d bloggers like this: