How to interpret different coherence values

For an experiment with topic models, I have calculated four coherence values using Gensim's implementation:

  1. c_v
  2. u_mass
  3. c_uci
  4. c_npmi

From this paper, I know that c_v correlates mostly with human interpretation. For this reason, this seems to be the best score to use for topic evaluation. However, are there arguments for using the other measures? And how can these values be interpreted? They seem to be in a different range.

Topic coherence metric lda topic-model performance

Category Data Science

About

Geeks Mental is a community that publishes articles and tutorials about Web, Android, Data Science, new techniques and Linux security.