Abstract: Multi-Reward Proximal Policy Optimization, a multiobjective deep reinforcement learning algorithm, is used to examine the design space of low-thrust trajectories for a SmallSat transferring ...