gensim word2vec results - why non-nearby word first?

william007

2022年3月15日 16:19

from gensim.models import Word2Vec
model = Word2Vec(sentences = [['a','b'],['c','d']], window = 9999999, min_count=1)
model.wv.most_similar('a', topn=10)

Above code gives the following result:

[('d', 0.06363436579704285),
 ('b', -0.010543467476963997),
 ('c', -0.039232250303030014)]

shouldn't the 'b' ranked first, since it's the only one nearby 'a'?

Topic gensim word2vec

Category Data Science

gensim word2vec results - why non-nearby word first?

About