| 1. |
Explain Markov’s decision process. |
|
Answer» Markov’s decision process (MDP) is a mathematical approach for reinforcement learning. Markov's decision process (MDP) is a mathematical framework used to solve problems where outcomes are partially random and partly controlled. To solve a complex problem using Markov’s decision process, the following basic things are needed-
The working of Markov’s model can be understood from the following diagram. In simple words, the agent has to do some action to start from its initial state. While doing so, it RECEIVES rewards based on the actions it takes. The policy defines the action it takes, and the reward collected defines the value (V). |
|