Hello,
I have couple of questions regarding using pretrain
command in spacy.
-
Does the dimension of the vector model matters?
I saw this line in https://github.com/explosion/spaCy/blob/16aa092fb5cffb5ec7079951ea0c04cb96733b3e/spacy/cli/pretrain.py#L318
where theMaxout
layer has output dim as 300 (if I understand here correctly). A interesting thing is for the language modelen_vectors_web_lg
, the vector size is also 300. Is this a coincidence or some specific design? -
If I use
nlp = spacy.load('en_core_web_lg')
, I think this should also be counted as using "pretrained" model, is that right? Sinceen_core_web_lg
anden_vectors_web_lg
have the same vector (all from Glove, if I understand it correctly), then what is the difference between usingnlp = spacy.load('en_core_web_lg')
and use atok2vec
pretrained withen_vectors_web_lg
.
Thanks,