Does an RL agent learn during exploitation?
I have started with RL and have some doubts regarding it.
Does an RL agent learn during exploitation, or does it only learn during exploration?
Is it possible to train a model only using exploitation (i.e. where exploration is not allowed)?
Topic ai reinforcement-learning machine-learning
Category Data Science