Bellman's Optimality Equation

Fermat’s Last Theorem Filter and Channel Pruning

Bellman's Optimality Equation

In this article, we will be walking through the concept behind Bellman's Optimality equation, and how it is used in the case of Reinforcement Learning.

V( s ) = a max ( R(s,a) )+γV(s'))−−> Bellman's equation for deterministic environment

Bellman’s optimality equation:

V * (s)= ∑ a π(a|s) . ∑ s' P(s'|a)[ E(r|s,as')+γ V * (s') ]

Bellman’s equation is one amongst other very important eq...

View more on Aditya Chatterjee's website »

Like • 0 comments • flag

Published on December 03, 2021 08:32

No comments have been added yet.

Aditya Chatterjee's Blog

Aditya Chatterjee's profile
8 followers