Collinearity between continuous and categorical variable

I have a medical dataset with features age, bmi, sex, gender, # of children, region, charges, smoker. Here smoker, gender, sex and region are categorical variables and others are numerical features. How do I check for collinearity between these in my dataset?

Topic collinearity python

Category Data Science


You could just regress against any given variable.

You could also generate a matrix of correlation metrics. Depending on the variable type, you would need a different metric. Here are some common ones:

About

Geeks Mental is a community that publishes articles and tutorials about Web, Android, Data Science, new techniques and Linux security.