When to choose character instead of factor in R?
I am currently working on a dataset which contains a name attribute, which stands for a person's first name. After reading the csv file with read.csv, the variable is a factor by default (stringsAsFactors=TRUE) with ~10k levels. Since name does not reflect any group membership, I am uncertain to leave it as factor.
Is it necessary to convert name to character? Are there some advantages in doing (or not doing) this? Does it even matter?
Topic data-wrangling r
Category Data Science