Using cross validation score to perform feature selection
So to perform my feature selection I ran cross validation over and over again, each time trying different subsets of my attributes and repeated this until I got the best cross validation score I could get. Is this alright to do or I am creating a major bias? I suspect that this could cause a bias and possibly result in data leakage because I am probably learning something about my test set by doing this, but how bad of a bias would this be? My data set is too small to create another validation set.
Topic cross-validation feature-selection
Category Data Science