Validating classification results
I created a model for only 2 classes and the classification report was:
Although accuracy looks good, I don't think this model is good. The original data has 522 records of class 1 and 123 of class 2. So, I think that the model is guessing for the most common (class 1). When I applied the model on the original data, it was predicted 585 class 1 and 60 class 2.
When I balanced the classes, the results were:
The model application on original data generated 396 for class 1 and 249 for class 2. Since I'm going to use this model for prediction, it still doesn't look good to me.
My evaluation, in this case, was: I multiplied that class-2 predicted number by precision: 0.65 x 249 = 162, without considering the classes predicted as class 1, when in true are class 2. These values as very bigger than the original count (123 records)
Is this evaluation correct? Are there other ways to evaluate this model?
Topic model-evaluations classification
Category Data Science