How to interpret different coherence values
For an experiment with topic models, I have calculated four coherence values using Gensim's implementation:
- c_v
- u_mass
- c_uci
- c_npmi
From this paper, I know that c_v correlates mostly with human interpretation. For this reason, this seems to be the best score to use for topic evaluation. However, are there arguments for using the other measures? And how can these values be interpreted? They seem to be in a different range.
Topic coherence metric lda topic-model performance
Category Data Science