728x90

AI/RL (2021 DeepMind x UCL ) 12

Lecture 4: Theoretical Fund. of Dynamic Programming Algorithms

Contraction mapping we take a sequence that is convergent in that space , apply Transfomation (T) to that sequence, we get another convergence sequence that covnerges to T of x. which is limit of that sequence fixed point if apply T trasnform to x it goes back to original x Banach Fixed Point Theorem we can know that sequence is convergent to unique fixed point X star The Bellman Optimality Oper..

728x90