Can we use doc2vec to detect outlier documents?
I have a set of documents and I want to identify and remove the outlier documents. I am just wondering if doc2vec can be used for this task.
Or are there any recently evolved, promising algorithms that I can use for this task?
EDIT
I am currently using a bag of words model to identify outliers.
Topic gensim word2vec outlier nlp data-mining
Category Data Science