Why feature engineering and filling NaN's reduce score?

Dmitry Sokolov

2021年12月6日 07:13

I used CatBoost for InClass Kaggle competition. I have tried various strategies to filling NaN values. Convert float binary variables to categorical. Add new categorical features (from age, for example). I have tried generate new features from existents, also tried remove irrelevant features (by correlation, feature importance, SHAP). But it all only makes it worse! Why? The best score came out without any preprocessing only with found hyper-parameters via random_seach

Topic catboost feature-engineering kaggle feature-selection data-cleaning

Category Data Science

Why feature engineering and filling NaN's reduce score?

About