Types of artificial anomalies

I am working on some algorithms for anomaly detection

The dataset is clean our anomalies so I want to add some artificial anomalies.

I have added some anomalies. I get the maximum value of the dataset and add 20-25%, meaning these added anomalies are bigger than the max value by 20 to 25%

Are there any other types of anomalies that would be nice to have had in an anomaly detection algorithm dataset?

My dataset is with integers and float

Topic anomaly-detection dbscan outlier python

Category Data Science


There are many options. Here a couple:

  • Values that very high or very low.
  • Values that are many standard deviations from current distributions.
  • Multivariate anomalies - the combination of values across features are anomalous

About

Geeks Mental is a community that publishes articles and tutorials about Web, Android, Data Science, new techniques and Linux security.