On what principle did Google's DeepMind learn to walk?
I just saw this video on Youtube.
On what principle did Google's DeepMind learn to walk?
Was it Q-Learning or a Genetic Algorithm or Policy Gradient?
Topic deepmind q-learning genetic-algorithms deep-learning machine-learning
Category Data Science