How can I do the correlation between two estimators?

I'm working with several estimators of all kind. Then, I want to stack these estimators, and the best is if they have low correlation between them.

I suppose that the correlation method depends on the type of dependent variable, if it's categorical or numerical. In my case, it's categorical, and the estimators are classifiers.

How can I do the correlation between two estimators?

Topic estimators classifier correlation

Category Data Science


If I am not wrong this already exists : Random Forest ... Basically, a lower correlation among ensemble model weak classifiers increases the error-correcting ability of the model. Therefore, it is preferred to use models with low correlations while creating ensemble classifier. Random Forest uses some clever tricks to make its decision trees likely to be independent.


There are probably many options, I'd look at inter-annotator agreement.

An alternative option would be to to evaluate one (e.g. by accuracy or f1-score) using the other as the gold standard.

About

Geeks Mental is a community that publishes articles and tutorials about Web, Android, Data Science, new techniques and Linux security.