Awesome
pytorch-wordemb
Load pretrained word embeddings (word2vec, glove format) into torch.FloatTensor for PyTorch
Install
PyTorch required.
pip install torchwordemb
Usage
import torch
import torchwordemb
torchwordemb.load_word2vec_bin(path)
read word2vec binary-format model from path
.
returns (vocab, vec)
vocab
is adict
mapping a word to its index.vec
is atorch.FloatTensor
of sizeV x D
, whereV
is the vocabulary size andD
is the dimension of word2vec.
vocab, vec = torchwordemb.load_word2vec_bin("/path/to/word2vec/model.bin")
print(vec.size())
print(vec[ w2v.vocab["apple"] ] )
torchwordemb.load_word2vec_text(path)
read word2vec text-format model from path
.
torchwordemb.load_glove_text(path)
read GloVe text-format model from path
.