metric

Chi-square as evaluation metrics for nonlinear machine learning regression models

Alex

2022年6月2日 12:00

I am using machine learning models to predict an ordinal variable (values: 1,2,3,4, and 5) using 7 different features. I posed this as a regression problem, so the final outputs of a model are continuous variables. So an evaluation box plot looks like this: I experiment with both linear (linear regression, linear SVMs) and nonlinear models (SVMs with RBF, Random forest, Gradient boosting machines ). The models are trained using cross-validation (~1600 samples), and 25% of the dataset is used …

Topic: metric machine-learning-model evaluation

Category: Data Science

XGBClassifier's predictions are not probabilities with objective='binary:logistic'

João Bravo

2022年5月30日 18:55

I am using a XGBoost's XGBClassifier, a binary 0-1 target, and I am trying to define a custom metric function. It supposedly receives an array of predictions and a DMatrix with the training set according to the XGBoost Tutorials. I have used objective='binary:logistic' in order to get probabilities but the prediction values passed to the custom metric function are not between 0 and 1. They can be like between -3 and 5 and the range of values seems to grow …

Topic: metric probability xgboost scikit-learn classification

Category: Data Science

What is the relationship between the accuracy and the loss in deep learning?

N.IT

2022年5月29日 14:30

I have created three different models using deep learning for multi-class classification and each model gave me a different accuracy and loss value. The results of the testing model as the following: First Model: Accuracy: 98.1% Loss: 0.1882 Second Model: Accuracy: 98.5% Loss: 0.0997 Third Model: Accuracy: 99.1% Loss: 0.2544 My questions are: What is the relationship between the loss and accuracy values? Why the loss of the third model is the higher even though the accuracy is higher?

Topic: metric keras tensorflow deep-learning neural-network

Category: Data Science

AUC-ROC for Multi-Label Classification

NotoriousFunk

2022年5月23日 09:18

Hey guys I'm currently reading about AUC-ROC and I have understood the binary case and I think that I understand the multi-classification case. Now I'm a bit confused on how to generalize it to the multi-label case, and I can't find any intuitive explanatory texts on the matter. I want to clarify if my intuition is correct with an example, let's assume that we have some scenario with three classes (c1, c2, c3). Let's start with multi-classification: When we're considering …

Topic: metric auc multilabel-classification multiclass-classification

Category: Data Science

Average Precision if Target Class is Not in Evaluation

user8152821

2022年5月22日 21:03

Suppose I have 5 classes, denoted by 1, 2, 3, 4, and 5, and this is used in object detection. When evaluating an object detection performance, suppose I have classes 1, 2, and 3 present, but classes 4 and 5 are not present in the targeted values. Will each of classes 4 and 5 have average precision of 0 (due to its precision being zero as no true positives can be identified)? Or perhaps there are other considerations to take …

Topic: object-detection metric

Category: Data Science

Why calculating how much removed sentences with most contributing words to the result helps to show that a model is "faithful"?

Revolucion for Monica

2022年5月18日 19:28

I don't understand how the calculation score taking out the sentences where the words contribute the most of to the result helps to show to what extent a model is "faithful" to a reasoning process. Indeed, a faithfulness score was proposed by Du et al. in 2019 to verify the importance of the identified contributing sentences or words to a given model’s outputs. It is assumed that the probability values for the predicted class will significantly drop if the truly …

Topic: explainable-ai metric nlp

Category: Data Science

the error value of two model is different, one has smallest MAE but another one have smallest MSE

Ardy

2022年5月17日 17:48

I have two machine learning model, the model result different error value in MAE and MSE. M1 have smallest MAE, but M2 have smallest MSE. Can anyone explain to me, why this happen? I also add my actual and predicted data here. Thank You

Topic: metric regression

Category: Data Science

How could we interpret a SI Scatter Index and RMSE?

Hich

2022年5月17日 11:00

SI is RMSE divided by the average value of the observed values (or the predicted values? am confused)? is SI = 25% acceptable? (is the model good enough? )

Topic: scatter-index rmse metric predictive-modeling

Category: Data Science

How do I compute the Weighted average ROC Curve?

Marco Ramos

2022年5月13日 14:11

So i have a multiclass problem and successfully computed the micro and macro average curves, how do I calculate the weighted value for each TPR and FPR?

Topic: metric roc scikit-learn

Category: Data Science

Which metrics for evaluating a recommender system with implicit data?

2022年5月8日 03:04

I am currently in the process of creating a recommender system. This recommender system works with a neural network and then searches for the closest neighbors and thus gives recommendations for a user. The data is implicit. I only have in the data which products a user has bought.On the basis of this data, I create the recommendations. What are the best metrics to evaluate this recommender system with implicit data? Can I evaluate the model and then the search …

Topic: metric recommender-system

Category: Data Science

What metrics work well in unbalanced assemblies?

PicaR

2022年5月6日 16:06

I wanted to know if there are some metrics that work well when working with an unbalanced dataset. I know that accuracy is a very bad metric when evaluating a classifier when the data is unbalanced but, what about for example the Kappa index? Best regards and thanks.

Topic: metric evaluation

Category: Data Science

What is the correct way to compute lift in lift charts

Anuj Gupta

2022年5月5日 07:06

How is "lift" computed? i was reading about "Gain and lift charts" in data science. I picked the following example from https://www.listendata.com/2014/08/excel-template-gain-and-lift-charts.html I am clear on how the gain values are computed. Not clear about lift values are computed? (last column in table)

Topic: metric machine-learning

Category: Data Science

Metrics for presenting RNN/LSTM result

Bloodstone Programmer

2022年5月4日 14:05

I am working on two different architectures based on the LSTM model to predict the user's next action based on the previous actions. I am wondering, what is the best way to present the result? Is it okay to present only the prediction accuracy? Or Should I use other metrics? I found a paper using top_K_accuracy whereas on a different paper I found AUC or ROC. Overall, I would like to know what is the state of the art of …

Topic: metric lstm rnn accuracy machine-learning

Category: Data Science

specificity for 3 class

Arun Jose

2022年5月3日 13:17

I was reading an answer in qoura to calculate the specificity of a 3 class classifier from a confusion matrix. In the below answer https://www.quora.com/How-do-I-get-specificity-and-sensitivity-from-a-three-classes-confusion-matrix For below 3-class confusion matrix, The below is a screenshot from the answer. the sensitivity and specificity would be found by calculating the following: My question is in numerator for specificity which is false negatives shouldnt it be 4 terms. Eg if we are calculating w/r 1, Then in the table n22,n33,n32 and n23 were …

Topic: metric confusion-matrix classification

Category: Data Science

How to re-train a model from false positives

Tina J

2022年5月2日 03:01

I'm still a bit new to deep learning. What I'm still struggling, is what is the best practice in re-training a good model over time? I've trained a deep model for my binary classification problem (fire vs non-fire) in Keras. I have 4K fire images and 8K non-fire images (they are video frames). I train with 0.2/0.8 validation/training split. Now I test it on some videos, and I found some false positives. I add those to my negative (non-fire) set, …

Topic: metric training keras deep-learning machine-learning

Category: Data Science

Normalization and Denormalization

Tarun Sharma

2022年4月20日 07:07

I have few queries. 1) Is normalization required for ANN / CNN /LSTM ? 2) If we normalize the data with MinMax Scaler, then in that case how to denormalize it and when to denormalize it so that we can get the Error Metrics in the original format?

Topic: metric machine-learning-model normalization deep-learning time-series

Category: Data Science

How to measure accuracy of a route prediction

João Matos

2022年4月18日 16:07

I developed a new route prediction algorithm and I am trying to find a metric that informs on how well a prediction was. This metric is meant to be used offline, meaning that the goal is not to measure the quality of the prediction when it is need in real time. Instead, We are given a set $R=\{r_1,r_2,...r_{|R|}\}$ of routes that occurred in the past and for each $r_i\in R$ we take a small prefix of $r_i$ and provide it …

Topic: metric prediction machine-learning

Category: Data Science

Can Precision-Recall be improved for imbalanced sample?

rayven1lk

2022年4月17日 22:02

I tried out a few models on a highly imbalanced sample (~2:100) where I can get decent AUC from ROC (test sample). But when I plot precision-recall (test sample), it looks horrible. Kind of like the worse PR curve in box (d). This article contains the picture from below and describes that ROC is better suited since it is invariant to class distribution. My question is if there's anything than can be done to improve precision-recall?

Topic: metric predictive-modeling machine-learning

Category: Data Science

Using Z-test score to evaluate model performance

I_Play_With_Data

2022年4月17日 19:06

I think I know the answer to this question but I am looking for a sanity check here: Is it appropriate to use z-test scores in order to evaluate the performance of my model? I have a binary model that I have developed with a NN in Keras. I know the size of my (equally balanced) training set and it has a proportion of 0.5 (duh!). I know that with my business use case, false-positives are financially expensive so I'm …

Topic: metric validation keras neural-network statistics

Category: Data Science

What would be the main and essential criteria for evaluating auto-sklearn library ?

Abhishek G.L

2022年4月11日 23:01

I m running experiments using benchmark datasets with auto-sklearn to see how its performance is different to the standard sklearn library, Since automl does an exhaustive search over parameters and sklearn has to be manually tuned. what could be the essential criteria to judge the performance between these two libraries

Topic: metric evaluation scikit-learn machine-learning

Category: Data Science

About