How to train millions of doc2vec embeddings using GPU?

Question

How to train millions of doc2vec embeddings using GPU?

Aljo Jose

2022年5月27日 07:01

I am trying to train a doc2vec based on user browsing history (urls tagged to user_id). I use chainer deep learning framework.

There are more than 20 millions (user_id and urls) of embeddings to initialize which doesn’t fit in a GPU internal memory (maximum available 12 GB). Training on CPU is very slow.

I am giving an attempt using code written in chainer given here

Please advise options to try if any.

Topic word-embeddings deep-learning nlp

Category Data Science

Brian Spiering · Accepted Answer · 2020年4月3日 19:48

1

Brian Spiering answered at 2020年4月3日 19:48

One option is to switch to a deep learning framework that supports distributed training.

How to train millions of doc2vec embeddings using GPU?

About