文摘
A novel many-objective reinforcement learning algorithm is proposed. A benchmark many-objective pathfinding problem is introduced. We evaluated the algorithm on path finding problems with five and six objectives. Total reward obtained, solution set quality, and episode duration were measured. The proposed method outperforms the state of the art on all problems.