Download

We maximize the expected cummulated reward