Data preprocessing methods

  • Data Cleaning
  • Data Imbalance solving (Classification)
  • Data Smoothing (decreasing noise)
  • Creating-deleting features from original data
  • Data Transformation (Box-cox,Log Transform)
  • Making Dataset stationery (time series)
  • And other specific data preprocessing methods in NLP-Computer Vision (very specific ones)

I am trying to research data preparation methods and so far those are the things i could find. Do you think is anything missing? Thanks.

Topic data preprocessing data-cleaning data-mining machine-learning

Category Data Science


Here are some.

  1. Dealing with the variable Types.
  2. Dealing with Missing data
  3. Encoding categorical variables
  4. Categorical variable — cardinality
  5. Categorical variable — rare labels
  6. Dealing with Outliers
  7. Variable Transformations
  8. Variable Discretization
  9. Feature Scaling and the list goes on.......

The question ased is too broad. These are some of the steps to be taken care of.

About

Geeks Mental is a community that publishes articles and tutorials about Web, Android, Data Science, new techniques and Linux security.