Training Loss or Validation Loss for Hyperparameter Optimisation
When performing HO, should I be looking to train each model (each with different hyperparameter values, e.g. with RandomSearch picking those values) on the training data, and then the best one is picked? Or should I be looking to choose them judged on their performance on the validation set?