Action Optimizer

Manual Reinforcement Learning Simulator

Initialize Environment

Configure your K-Armed Bandit problem. Name your actions to make them meaningful.