markov decision process
Bellman equation
value iteration

3 Ways of Learning

Markov Decision Process

On Rewards

Two way is Infinite

Discount Factor

Polices

Finding Polices

Findn Polices Quiz

Finding Polices Again

V Function & Q Function

C Function

Ralation of Bellman Equations( Q Func is Cool!)

What've Learned