What the differences between self-supervised/semi-supervised in NLP?
GPT-1 mentions both Semi-supervised learning and Unsupervised pre-training but it seems like the same to me. Moreoever, Semi-supervised Sequence Learning of Dai and Le also more like self-supervised learning. So what the key differences between them?
Topic pretraining semi-supervised-learning nlp
Category Data Science