markov decision process
Bellman equation
value iteration
3 Ways of Learning
Markov Decision Process
On Rewards
Two way is Infinite
Discount Factor
Polices
Finding Polices
Findn Polices Quiz
Finding Polices Again
V Function & Q Function
C Function
Ralation of Bellman Equations( Q Func is Cool!)
What've Learned