Supervised learning on sources of information with different importance

Question

Supervised learning on sources of information with different importance

Toby

2022年5月10日 19:01

I am trying to classify customer support sessions using supervised machine learning.

In each customer support session I have 3 bags of information. 1. The title of the customer's complaint 2. Information about the device the customer was using 3. Text of the chat session with the customer support agent

In each customer support session, there are 6 different classes. Is it better to: 1. Train a classifier on each bag of information and have them vote on which class the session belongs to? 2. Put all of the information into a single set of features, and train a single classifier to determine which class the session belongs to? 3. Other?

Topic aggregation feature-selection nlp machine-learning

Category Data Science

gopal krishna varshney · Accepted Answer · 2017年12月29日 10:33

Its better to do supervised classification then unsupervised clustering since you have the right labels for the response.

Now in supervised classification approach 1 is better than approach 2 because in approach 2 you will the reducing the already available information by combining features.

Approach 3 is advised where you will create more features from the given 3 features by use of NLP processing, sentiments, tags, chunking, time taken, text length etc and then selecting the features those provide more information gain using feature selection approaches.

Supervised learning on sources of information with different importance

About