How can I compute the ideal variance threshold value for my data?

I have a dataset that contains n features scaled between [0,1]. I would use an unsupervised feature selection algorithm (variance thresholding). How can I compute the threshold value?

Topic variance feature-selection

Category Data Science


Variance thresholding is used to select those features with a variance above the suggested threshold. Ideally you would want to take in all features with a non-zero variance but I'm not sure of the data youre handling, it would be better to calculate the variance of the individual feature, arrange them in the increasing order of variance and then select that value where the variance sharply increases.

Or you could do PCA and find the order of the importance of features and then set the threshold.

About

Geeks Mental is a community that publishes articles and tutorials about Web, Android, Data Science, new techniques and Linux security.