how to set threshold for anomaly detection

I read one research paper and they said that they are using a threshold for anomaly detection. The threshold is determined to make some proportion of data of the validation dataset labeled as anomalies. how does this concept make sense

Topic anomaly-detection machine-learning

Category Data Science


When your algorithms creates an anomaly detection model, it will create a set of expected values that represent normal value behaviours.You can then use this models to either alert based on the expected values with out a static threshold, or you can overlay this to some graph as a band. Ideally you need a considerable deviation from your expected values to trigger an alarm. Here the thresholds are calculated as error, or deviation from the expected and real values. You can apply a scaled min max threshold to this to have a good inference to the problem.When your data points are large in numbers, you cannot be able to determine the thresholds, here Thresholds may induce false positives but the outcomes are mostly better. You can make find the ideal thresholds by plotting an RC curve with TP and FP rates.

About

Geeks Mental is a community that publishes articles and tutorials about Web, Android, Data Science, new techniques and Linux security.