How do Classification Algorithms such as Catboost and Random Forest parse test data?
I would like to know how classification works with the algorithms listed above. My specific question is this, say I have a high signal continuous feature which has a certain distribution and I train a model according to some training data and it finds the best split for that feature. When I use the model on test data, would it split according to a specific number or by distribution? i.e if the number '10' provides the best split for the training data which happens to be 75th percentile. When it comes to test data will it split according to the percentile or the number 10? I hope that is clear.
Topic catboost training random-forest machine-learning
Category Data Science