Transformer model comparison for binary sentiment classification

Question

Transformer model comparison for binary sentiment classification

nlpkind

2022年2月9日 12:47

On two independent datasets, I am comparing XLNet and BERT models with binary sentiment classification tasks: the Twitter dataset, where sentences are short, and the IMDB review dataset, where sentences are long.

On the Twitter dataset, BERT matches and slightly outperforms XLNet, but XLNet outperforms BERT on the IMDB dataset. I understand that XLNet captures longer dependencies due to the Transformer XL architecture and so outperforms BERT; but, what additional reasons may exist for one to outperform the other for a certain dataset? Why is BERT more successful, or at least comparable to XLNet, in classifying social media sentiment?

Topic binary-classification bert sentiment-analysis language-model nlp

Category Data Science

Transformer model comparison for binary sentiment classification

About