rmse - Geeks Mental

My Linear Regression Model Mean Absolute Error(MAE) is 0.29 and R2 0.20 , Is this a acceptable Model?

Aadhil Imam

2022年5月26日 12:17

My Linear Regression Model Mean Absolute Error(MAE) is 0.29 and R2 0.20 , Is this a acceptable Model ? How can increase the r2 score ?

Topic: rmse linear-regression regression python machine-learning

Category: Data Science

How to reduce RMS error value in regression analysis & predictions - feature engineering, model selection

Centauri

2022年5月25日 02:01

There's this dataset containing the metadata of Twitch's top 1,000 streamers of 2020. You can have the details here. I am currently participating in a challenge to predict the values for Followers gained, by creating and training the model using the remaining features from the dataset. The kernel objective is to get the lowest RMSE (Root-Mean Squared Error) metric value from the model's predictions. Until now, I have made numerous attempts to lower down the RMSE loss value as much …

Topic: rmse tensorflow regression deep-learning machine-learning

Category: Data Science

How could we interpret a SI Scatter Index and RMSE?

Hich

2022年5月17日 11:00

SI is RMSE divided by the average value of the observed values (or the predicted values? am confused)? is SI = 25% acceptable? (is the model good enough? )

Topic: scatter-index rmse metric predictive-modeling

Category: Data Science

Why does log-transforming the target have a huge impact on MSE value?

Caterina

2022年5月4日 18:56

I am doing linear regression using the Boston Housing data set, and the effect of applying $\log(y)$ has a huge impact on the MSE. Failing to do it gives MSE=34.94 while if $y$ is transformed, it gives 0.05.

Topic: transformation rmse mse feature-scaling

Category: Data Science

Which error metric is good for measuring accuracy

Chris

2022年4月11日 06:03

I am estimating water depth with satellite data (predicted value) and would like to validate my result using bathymetry lidar data collected on the field and believed to be more accurate (observed value). I have different observations at each water depth. For example, number of observations at water depth range of 0-10 m are 300, where as values at deeper depth range (10 - 20 m) are less (~50 points). I have been using RMSE (as I would like to …

Topic: rmse metric

Category: Data Science

Measuring performance of customer purchase predictions

Shlomi Schwartz

2022年3月24日 09:12

My goal is to develop a model that predicts next customer purchases in USD (Update: During the time period of the dataset, if no purchase was made by the customer, the next purchase label is set to zero). I am trying to determine what would be the most effective metric for measuring the model's performance. Results looks like so: y_true_usd y_predicted_usd 1.2 0.8 0 0.3 0 1.1 0 0 0 0.1 5.3 4.3 First I thought about going with RMSE, …

Topic: rmse imbalanced-data metric predictive-modeling

Category: Data Science

Feature engineering: The more features I add the better RMSE I get?

the phoenix

2022年3月16日 11:05

I have a model with 7 features, I'm trying to figure out if I can improve the performance of this model by adding additional features. So I'm relying on the RMSE to measure the accuracy of my predictions. from 7 features I get to 25 features and with each time I add a new feature, the RMSE slightly gradually get better (smaller). I find it hard to believe that all of these features improved the performance of my model as …

Topic: rmse feature-engineering feature-selection predictive-modeling machine-learning

Category: Data Science

High loss but low rmse, how?

Stupid_Intern

2022年3月11日 14:02

I have trained an lstm model on a dataset but its loss during training is ten times than the rmse during test. How is it possible, and can I use this model if rmse is very low but loss is high? How can I improve training and test loss?

Topic: loss rmse hyperparameter-tuning lstm

Category: Data Science

How many features do I select when doing feature selection for regression algorithms? Is R2 and RMSE good measures of success for overfitting?

pythonnoob2

2022年3月4日 17:00

Context: I'm currently crafting and comparing machine learning models to predict housing data. I have around 32000 data points, 42 features, and I'm predicting housing price. I'm comparing Random Forest Regressor, Decision Tree Regressor, and Linear Regression. I can tell there is some overfitting going on, as my initial values vs cross validated values are as follows: RF: 10 Fold R Squared = 0.758, neg RMSE = -540.2 vs unvalidated R Squared of 0.877, RMSE of 505.6 DT: 10 Fold …

Topic: rmse overfitting pearsons-correlation-coefficient regression feature-selection

Category: Data Science

What does rmse of a LSTM model tells?

Stupid_Intern

2022年2月15日 12:21

Suppose I made a model which has rmse of 50 Now when I predict the next data which is 500 So does that mean the actual value has high probability to be within the range of 450 - 550 ? If so what is the probability that it will be in this range? Or it means the actual value has high probability to be within the range of 475- 525 ? If so what is the probability that it will …

Topic: rmse lstm time-series

Category: Data Science

Appropriate loss function and metrics for regression task with mixed outputs

SagRU

2021年12月10日 06:44

I'm trying to train an EfficientNet-based Keras model that takes an image as input and returns two numeric values as output. Here's the model: def prepare_model_eff(input_shape): inputs = Input(shape=input_shape) x = EfficientNetB3(include_top=False, input_shape=input_shape)(inputs) x.trainable = True x = layers.GlobalAveragePooling2D()(x) x = layers.Dropout(rate=0.1, )(x) x = layers.BatchNormalization()(x) out_1 = layers.Dense(1, activation='linear', name='out_1')(x) out_2 = layers.Dense(1, activation='linear', name='out_2')(x) model = Model(inputs=inputs, outputs=[out_1, out_2]) As far as I know, the most common metric for such tasks is Root Mean Square Error (RMSE): def …

Topic: rmse mse keras regression

Category: Data Science

Determining which model result is better

justanewb

2021年10月14日 02:23

I am trying to determine which model result is better. Both results are trying to achieve the same objective, the only difference is the exact data that is being used. I used random forest, xgboost, and elastic net for regression. Here is one of the results that has low rmse but not so good r2 model n_rows_test n_rows_train r2 rmse rf 128144 384429 0.258415240861579 8.44255341472637 xgb 128144 384429 0.103772500839367 9.28116624462333 e-net 128144 384429 0.062460300392487 9.49266713837073 The other model run has …

Topic: rmse r-squared regression machine-learning

Category: Data Science

Perform bootstrapping of an ordinary linear regression model, using B=100 bootstrap resamples of my dataset, and getting RMSE

Robbie Meaney

2021年10月4日 05:06

So Im studying machine learning through R, and Im working with the boston data set from the library MASS. I am practicing bootsrapping. I already carried out analysis to determine how ,many distinct data points on average are drawn from the sample to make up a bootsratp resample, using B=100 resamples of the dataset. Next I would like to do two things- perform boostrapping of an ordinary linear regression model using B=100 resamples of the data set again and use …

Topic: bootstraping rmse r machine-learning

Category: Data Science

Difference in result in every run of Neural network?

john22

2021年9月30日 11:40

I have written a simple neural network (MLP Regressor), to fit simple data frame columns. To have an optimum architecture, I also defined it as a function to see whether it is converging to a pattern. But every time that I run the model, it gives me a different result than the last time that I tried, and I do not know why? Due to the fact that it is fairly difficult to make the question reproducible, I can not …

Topic: rmse scikit-learn neural-network python

Category: Data Science

Comparing RMSEs of multiple test sets having different sizes

Aditya Kulkarni

2021年8月14日 10:21

The data I have is a time series data (stock returns), and I am training a Random Forest Regressor on it. Total observations = 2499 To better evaluate the performance, I have implemented rolling windows testing with training window sizes = 500, 700, 900,..., 2100. Though instinctively it would seem obvious to choose a window size which produced lowest RMSE, how can I be sure that the comparison is fair? I mean with increasing window size, the test set size …

Topic: model-evaluations rmse error-handling machine-learning

Category: Data Science

Low MAE, RMSE, RMSLE and MAPE, but also a low R^2

n.mathfreak

2021年7月6日 13:54

I have a dataframe containing the IDs of 2000 questions, a list of scores representing difficulty, and the following features: how often the question was answered, how often the answer has been changed because the students were undecided, a normalized "frequency of changing the answers" (so the last two feature divided) and the average time spent on a question. The most important seems to be this normalized frequency (50%), then the average time (22%), how often the question was answered …

Topic: rmse classification machine-learning

Category: Data Science

Why is linear regression not doing worse with a low weighted attribute?

Bartiez

2021年6月5日 21:59

I've been able to build a few linear regression models that can predict a material strength quite well: minimum RMSE of 17.95 using 11 attributes that I have selected from 159 original attributes. The data is distributed with mean=234.4 and stdev=19.9. I am working in Orange3. When using only the highest weighted attribute (weight 8.013) the model calculates RMSE of 18.767. If I use only the lowest weighted attribute (weight 0.051) the RMSE is 20.007. The difference is 1.24, or …

Topic: rmse machine-learning-model linear-regression machine-learning

Category: Data Science

What is bad, good and excellent metric score for time series model?

Vasilii Naumushkin

2021年5月20日 04:25

I have created a couple of models for my master project and I used several metrics for evaluation. I used MSE, MAE, MAPE, RMSE not because I really learned about them a lot, because I saw in many other projects these metrics being used. Now I have a problem, I need to interpret results. I search for some articles or some studies that classify metrics performance as good or bad or excellent. The only material I found now is this …

Topic: rmse mse time-series

Category: Data Science

How to add RMSE value on a plot with ggplot

RS_girl

2021年3月30日 15:12

I added r2 value and the formula of the regression function but I also want RMSE value on my plot, maybe I need to add something but I could not see a proper answer to this question neither here nor google... ggplot(data = AGB.rf$pred) + geom_point(mapping = aes(x = pred, y = obs, color = pred, shape=1))+ geom_smooth(mapping = aes(x = pred, y = obs), method="lm", se = FALSE)+ stat_cor(aes(x = pred, y = obs, label = ..rr.label..),label.y = 3000)+ …

Topic: rmse ggplot2 r

Category: Data Science

How to interpret the Mean squared error value in a regression model?

the phoenix

2021年3月8日 19:57

I'm working on a simple linear regression model to predict 'Label' based on 'feature'. The two variables seems to be highly correlate corr=0.99. After splitting the data sample for to training and testing sets. I make predictions and evaluate the model. metrics.mean_squared_error(Label_test,Label_Predicted) = 99.17777494521019 metrics.r2_score(Label_test,Label_Predicted) = 0.9909449021176512 Based on the r2_score my model is performing perfectly. 1 being the highest possible value. But when it comes to the mean squared error, I don't know if it shows that my model …

Topic: rmse regression python predictive-modeling machine-learning

Category: Data Science

About