Stop words list to use for CountVectorization
The sci-kit learn library by defaults provides two options either no stop words
or one can specify stop_words=english to include a list of predefined English words
I am using Naive Bayes for SMS spam detection. Is there any other list of stop words
I can experiment with?
Topic naive-bayes-algorithim scikit-learn
Category Data Science