build Environment using OpenAI Gym
I have an environment which contains 26 states. Each episode sample have an terminal state.I want my agent to learn how to get to the terminal state faster by using the minimum sequences of action. some states have obstacles. In my data simulation the actions are defined ( there are total 3 actions ). How can I prepare the environment using openAI gym ?
Topic openai-gym reinforcement-learning
Category Data Science