Reward Function
In MCTS, the function that assigns a reward to a rollout outcome.
In MCTS, the function that assigns a reward to a rollout outcome. For math, the reward is binary: +1 if correct, 0 if incorrect.
In MCTS, the function that assigns a reward to a rollout outcome.
In MCTS, the function that assigns a reward to a rollout outcome. For math, the reward is binary: +1 if correct, 0 if incorrect.