Optimization in Reinforcement Learning

Improving policy learning and efficiency.