Reinforcement Learning In High-Diameter, Continuous Environments