书目详细信息 : Rollout, policy iteration, and distributed reinforcement learning