Using VADER Sentiment Analysis makes distributions overlap : how to improve my model

I use VADER Sentiment Analysis on a customer reviews dataset.

VADER breaks down feelings of satisfaction and dissatisfaction into neutral and positive negative components. Plotting the distributions, I see that those of satisfied and dissatisfied customers overlap quite a bit.

I would like to know if I can improve my model: I imagined training it only on non-overlapping dataset values, is this a correct method? Is there another method?

I am a beginner, please be cool, thanks again for your help.

Topic machine-learning-model nltk sentiment-analysis

Category Data Science

About

Geeks Mental is a community that publishes articles and tutorials about Web, Android, Data Science, new techniques and Linux security.