Binary document classification using keywords for a very small dataset
I have a set of 150 documents with their assigned binary class. I also have 1000 unlabeled documents. Each document is about the length of a journal paper. Each class has 15 associated keywords.
I want to be able to predict the assigned class of the documents using this information. Does anyone have any ideas of how I could approach this problem?