anomaly

which statistical parameters are more useful to detect anomalies and outlier? mean max min var?

user10296606

2022年6月2日 15:01

This time series contains some time frame which each of them are 8K (frequencies)*151 (time samples) in 0.5 sec [overall 1.2288 millions samples per half a second) I need to find anomalous based on different rows (frequencies) Report the rows (frequencies) which are anomalous? (an unsupervised learning method) Do you have an idea to which statistical parameter is more useful for it? mean max min median var or any parameters of these 151 sampling? Which parameter I should use? (I …

Topic: anomaly anomaly-detection time-series statistics machine-learning

Category: Data Science

How to find an anomalous matrix among many?

TheDataScientist101

2022年5月8日 13:08

Let's say we have a bunch of matrices that we know are non-anomalous. We now receive a new matrix and want to know if it belongs into the group or is way off. Is there a way to do that? I'm thinking of something similar to MAD (median absolute deviation) but for matrices.

Topic: anomaly anomaly-detection statistics

Category: Data Science

How to test unsupervised learning methods for anomaly detection?

user10296606

2022年4月19日 17:01

How to test unsupervised learning methods for anomaly detection? I am looking for a test strategy to evaluate my result of my anomaly detection technique? what is your offer more than evaluate with different algorithms. My data is some time series very low frequency.

Topic: anomaly unsupervised-learning anomaly-detection evaluation time-series

Category: Data Science

Are cluster feature and micro-cluster good summary statistics for outlier detection in high dimensional data streams?

I Sui

2022年1月19日 05:16

I'm dealing with outlier detection in data streams. I'm looking for a way to summarize my data and obtain important statistics such as means and variance, etc. I want to know if the cluster features or microclusters are suitable or not.

Topic: anomaly anomaly-detection outlier data-stream-mining clustering

Category: Data Science

Assign a risk score in records in a dataset

nameguest

2022年1月4日 15:56

I was wondering, if I have a dataset with categorical and numerical data and labels such as 1 or 0 that shows if a row is anomalous or normal respectively. Is it possible to create somehow a model that will assign something like how much risky a record is using as input these numerical and categorical features? Edit My thoughts were to train a supervised anomaly detection method that will classify the records as 0 or 1. But instead of …

Topic: anomaly anomaly-detection regression outlier

Category: Data Science

What type of Anomaly Detection Model could I use?

nameguest

2022年1月4日 08:20

I would like to create an anomaly detection model that assigns a probability of risk instead of labels (1 or 0). My problem is that I only know for sure which records are anomalous but not which are Normal. Regarding this, would be better to work on Unsupervised anomaly detection instead of semi-supervised or supervised? Note: I have high dimensional data (20-40+ features). And a few hundreds of anomalies while around a thousand that I do not know.

Topic: anomaly anomaly-detection

Category: Data Science

Anomaly Detection Techniques

Pluviophile

2021年12月4日 07:11

Often the hardest part of solving an Anomaly Detection problem can be finding the right technique for the job. Different Anomaly techniques are better suited for different types of data and different problems. Are there any flowchart/tree diagrams that are designed to give users a bit of a rough guide on different anomaly techniques and the approach problems with regard to which technique to try on the data?

Topic: anomaly anomaly-detection

Category: Data Science

How to detect anomalies in each feature - time series

Fabio

2021年11月1日 00:46

I have a dataset with 5 features corresponding to 5 sensors that measure each three seconds the state of an accelerator. It is structured as well: Sensor 1 | Sensor 2 | Sensor 3 | Sensor 4 | Sensor 5 | Label 1.5 1.1 0.8 1.2 1.2 0 1.2 1.4 1.4 1.4 1.1 0 1.2 1.1 1.2 1.3 1.5 0 The label indicates if the time series is anomaly(=1) or not(=0). I have an anomaly detection task, and the frameworks …

Topic: anomaly dataframe anomaly-detection time-series dataset

Category: Data Science

Generation of Anomalous data points

Saif Ali Khan

2021年10月31日 13:03

I have a task to generate some anomalous points in a real world dataset with 15 features, and a synthetic dataset of 5 features. I was thinking of using correlation between features, but it'll be a problem to do for 15 features. Thanks!

Topic: anomaly anomaly-detection machine-learning

Category: Data Science

How do I determine the top "reason" for anomaly when using Isolation Forests

user

2021年9月23日 09:09

I am using Isolation Forests for Anomaly Detection. Say, my set has 10 variables, var1, var2, ..., var10, and I found an anomaly. Can I rank the 10 variables var1, var2, ..., var10 in such a way I can say that I have an anomaly and the main reason is, say, var6. For example, if I had var1, var2, var3 only, and my set were: 5 25 109 7 26 111 6 23 108 6 26 109 6 978 108 …

Topic: isolation-forest anomaly anomaly-detection outlier machine-learning

Category: Data Science

How is convex hull method used in outlier detection?

achhainsan

2021年7月12日 15:30

I think the slides are bit unclear on what they want to say. Can someone elaborate this with example.

Topic: anomaly anomaly-detection outlier

Category: Data Science

Looking for a good package for anomaly detection in time series

pythinker

2021年7月1日 10:43

Is there a comprehensive open source package (preferably in python or R) that can be used for anomaly detection in time series? There is a one class SVM package in scikit-learn but it is not for the time series data. I’m looking for more sophisticated packages that, for example, use Bayesian networks for anomaly detection.

Topic: anomaly bayesian-networks anomaly-detection time-series python

Category: Data Science

Practical problems in anomaly detection where the number of normal data is extremely high compared to abnormal data

pie

2021年5月15日 02:04

If the ratio of abnormal data is about 1 to 10,000 normal data, even if the true negative rate is 99%, there will be 100 false positive data, and the precision( TP/(TP+FP) ) will be low. If this kind of anomaly detection is to be put to practical use, I think it is necessary to create a model with a fairly high prediction accuracy. How do the actual examples of anomaly detection in the world deal with this problem? Is …

Topic: anomaly anomaly-detection machine-learning

Category: Data Science

How to determine the abnormality of a specific variable by taking into account all the other variables in the data?

AdrienC

2021年1月6日 23:44

I have an issue of machine learning/anomaly detection. Indeed, I have a variable Y and several other variables X. The purpose is to quantify the degree of abnormality of the data on Y but I have to take into account the values on the other variables (the relationship between Y and X). Normally, an anomaly detection algorithm would find anomalies but on the whole data (Y + X), but in my case I want to zoom in on Y because …

Topic: anomaly anomaly-detection research machine-learning

Category: Data Science

How to perform Anomaly Detection on a force profile?

Xela95

2020年10月28日 17:56

I have a set of force profiles of an industrial machine. I'm trying to develop an algorithm that tries to understand when a new profile is "anomalous" with respect to the ones in "normal operating conditions". In the picture below you can see the force profiles (function of time). I want the blue curve found checked as anomalous. What approaches do you suggest? I'm thinking about using some statistical distance (like mahalanobis) to check the similarity of a new curve …

Topic: anomaly pca anomaly-detection clustering

Category: Data Science

Anomaly Detection System

Angelos

2019年4月26日 14:07

I need a sanity check. I want to create an anomaly detection system. The logic which I am planning to use is the following: Find anomalies in the past using Seasonal Hybrid Extreme Studentized Deviate Test. Binarise the anomalies (1 the anomalies and 0 the trends). Run several algorithms (Autoencoders, SVM, Logistic Regression, Naive Bayes, Lasso Regression, etc) with variables that are correlated and validate the models and use it. Does the binarisation process makes sense?

Topic: anomaly machine-learning-model anomaly-detection binary machine-learning

Category: Data Science

How to find anomalies/outliers in Panel Data?

Kale-ab Tessera

2019年4月24日 14:18

I have panel data based on 900000 different entities with 384 time steps and the data is not normally distributed. I am looking for outliers/anomalies, this is unsupervised as I have no examples of anomalies/outliers. Apart from clustering methods such as K-means, DBSCAN/HDBSCAN, what options do I have?

Topic: anomaly dbscan outlier time-series k-means

Category: Data Science

Anomaly detection without any knowledge about structure

Jan van der Vegt

2019年2月28日 13:27

I have an interesting question, my code needs to be able to handle structured data where I don't know much about the structure at development time. I know the samples follow a schema that can be nested and the leafs contain some basic primitives like floats, integers and categoricals. I will have access to the schema at training time. I want to train a model that can detect anomalies or more directly whether or not a sample comes from the …

Topic: anomaly

Category: Data Science

Anomaly Detection for Large Time Series Data

Zaynab

2018年6月20日 23:29

I am working on detecting anomalies within a large time series data set. It is updated on a regular basis and consists of more than 30 parameters. I am using R as a reference language. It is a first for me working on this type of projects and I am unfamiliar with most of the techniques. I have 6 weeks to implement a good analytical toolbox to enhance the quality of the control checks on the production line. I have …

Topic: anomaly data-analysis deep-learning time-series statistics

Category: Data Science

How to detect anomalous points

Arkan

2018年3月10日 13:38

As is clear from the figure, the blue points, which don't follow the trend, are anomalous points. I'm wondering about the best non-parametric method to detect those points. I have tested some outlier detection methods such as standard deviation, etc. but they don't provide good results while it is clear from the figure.

Topic: anomaly anomaly-detection outlier visualization statistics

Category: Data Science

About