How does amazon's reviews that mention extracts topics from reviews?

Question

How does amazon's reviews that mention extracts topics from reviews?

user2301346

2021年11月17日 12:50

Amazon product page contains a section called Reviews that mention. The section lists the main things that users liked or dislike about the product. For example see this page. How exactly does it work?

This can be done using topic modelling using LDA. But this approach has several drawback.

You need to choose number of topics upfront. But in amazon reviews number of topics vary for each product. Number of topics are not the same even for products that belong to same category.
You need to give friendly name to each topic. With so many products its unlikely that amazon does that.

What approach would be suitable to do this in completely unsupervised way, without the drawbacks mentioned above.

Topic real-ml-usecase topic-model nlp

Category Data Science

German C M · Accepted Answer · 2021年11月17日 12:45

One possible approach I can see is as follows:

Amazon considers (until now and based on its historic data, and checked every X time) a possible number of frequent categories (i.e. labels in a classification context)
In the product you send, you can see the considered categories:

and the most frequent terms users have writen on their reviews, used as filters:

by applying some techniques like word embeddings, you can build a classifier to find which categories those terms belong to, based on some predefined category labels

new ones categories could be found with unsupervised clustering techniques

How does amazon's reviews that mention extracts topics from reviews?

About