Develop a strategy to maximize your average reward per move (equivalent to maximizing total reward over n moves). express this as a function of k, using θ-notation. in other words your maximization doesn't have to be entirely precise; you may assume that k is any convenient number that will make the math easier for your strategy, but you cannot assume that k = o(1). notice that any strategy that you come up with provides a lower bound on reward optimality. the better the strategy, the better (higher) the lower bound. it’s trivial to get ω(1) per move, so you must get ω(1).

Solved

Show answers

Ответ:

jonathon3957

03.08.2021

Mathematics

2/5 cup of water

Step-by-step explanation:

0,0(0 оценок)

Ask your question to the AI advisor

Ai-bot is an expert in any field and is the perfect companion for reliable and useful answers and advice on a variety of topics, including science, history, technology, art, sports, health, culture and more.

Ask your question

More tips

Answers on questions: Mathematics

Ask an AI advisor a question

I want advice