Do we really need <unk> tokens?
I am wondering, do we really need unk
tokens?
Why do we limit our vocabulary?
Is it for speed? Accuracy?
If we disable all limitations, what do you predict happens?
Topic sequence-to-sequence lstm machine-translation machine-learning
Category Data Science