Needed: Java library to calculate text readability/complexity

In principle the same as this but for Java (and ideally for multiple languages) (e.g. flesch reading ease, smog index, flesch kincaid grade, coleman liau index, automated readability index, dale chall readability score, linsear write formula, gunning fog etc).

I guess there must be plenty of libs but I just cant find them ...

Topic text java nlp

Category Data Science


I looked for this as well and found only the unmaintained RAT library.

But basically all of these algorithms (most are described as formulas in this paper) can be directly translated from the single Python file with no extra dependencies.

One exception was the Linsear Write formula because of its needing a syllable counting method. But if you use the CMU pronunciation dictionary or equivalent, you can easily build the countSyllables method and the rest of the algorithm is trivial.

About

Geeks Mental is a community that publishes articles and tutorials about Web, Android, Data Science, new techniques and Linux security.