How to evaluate data imputation techniques
I have a data set with 29 features 8 if them have missing values.
I've tried Sklearn simple imputer and all it's strategies KNN imputer and several Number of K Iterative imputer and all combinations of imputation order , estimators, number of iterations.
My question is how to evaluate the imputation techniques and choose the better one for my Data.
I can't run a base line model and evaluate it's performance because I'm not familiar with balancing the data and tunning the parameters and all models are giving poor scores.
There's why I'm searching if there's another way to evaluate the data imputation techniques Something like evaluation of distribution or something like that. I'm very newbie btw so pardon my stupidity
Topic imbalanced-learn data-imputation scikit-learn classification dataset
Category Data Science