1.

. Let’s look into the scenario given and answer the question: Let’s play a game: We have an agent, a robot, and a reward (diamond here) with many hurdles (fires) in between. The goal of the robot is to get the reward (diamond) and to avoid the hurdles (fire). The robot learns by trying all the possible paths and then chooses the path which reaches the reward while encountering the least hurdles. Each correct step will bring the robot closer to the diamond while accumulating some points and each wrong step will push the robot away from the diamond and will take away some of the accumulated points. The reward (diamond) will be assigned to the robot when it reaches the final stage of the game. Which kind of ML algorithm is applied in the above game? Explain it in detail.

Answer»

I can't BELIEVE it is IMPOSTER you LIKE



Discussion

No Comment Found