Is data subsampling appropriate for hyperparameter optimisation?
Fundamentally, under what circumstance is it reasonable to do HPO only on a subsample of the training set?
I am using Population Based Training to optimise hparameters for a sequence model. My dataset consists of 20M sequences and was wondering if it would make sense to optimise on a subsample due to restricted budget.
Topic hyperparameter-tuning deep-learning neural-network
Category Data Science