How to represent a document in test data with the Document-Term Matrix created from the training set?
I build a classifier of documents using the vector representation of each document in the training set (i.e a row in the Document-Term Matrix). Now I need to test the model on the test data. But how can I represent a new document with the Document-Term Matrix since some terms might not be included in training data?
Topic vector-space-models lsi text-mining python
Category Data Science