String together a set of tokens into a sequence
I have this problem scenario - Given a set of tokens, string them or a subset of the tokens together using stop words into a sequence. I am clear that I can have potentially infinite pre-training data for this problem. For example, given the set of tokens {cat, jump, mouse} - possible outputs might be: a. the cat jumped on a mouse, b. the cat and the mouse jumped, c. cats jump and so on...
I am not sure if this is a well-studied problem scenario or what directions/model architectures should I explore. TIA.
Topic sequence-to-sequence machine-learning
Category Data Science