GridSearch on imbalanced datasets
Im trying to use gridsearch to find the best parameter for my model. Knowing that I have to implement nearmiss undersampling method while doing cross validation, should I fit my gridsearch on my undersampled dataset (no matter which under sampling techniques) or on my entire training data (whole dataset) before using cross validation?
Topic hyperparameter-tuning imbalance scikit-learn machine-learning
Category Data Science