Balanced data sets are important in regression. Especially for accurate predictions through out the space. If the data is skewed towards one end of the distributions, predictions might be overly weighted to that region. This can cause problems if future data does not follow the same distribution.

One possible reason unbalanced data is less well covered in regression is because it is not assumption of the modeling fitting process and often the hold-data set is often part of the same batch as the training dataset. It can become an issue if a regression model goes in production and the data distribution changes.

About

Geeks Mental is a community that publishes articles and tutorials about Web, Android, Data Science, new techniques and Linux security.