Hello, I am trying to train a model using the en_core_web_lg
model as the --base-model
.
Here is the code I am using:
prodigy train new_model --textcat-multilabel annotated_tweets --eval-split 0.2 --base-model en_core_web_lg
I am using an Anaconda environment set up with the following commands:
conda create -n prodigy python=3.10.4
conda activate test_env
pip install prodigy -f https://XXX-XXX-XXXX-XXX@download.prodi.gy
#I have tried both of the following to download the model (both seem to give the same error)
python -m spacy download en_core_web_lg
conda install -c conda-forge spacy-model-en_core_web_lg
This is the error that I get:
ℹ Using CPU
========================= Generating Prodigy config =========================
ℹ Auto-generating config with spaCy
ℹ Using config from base model
✔ Generated training config
=========================== Initializing pipeline ===========================
✘ Config validation error
Bad value substitution: option 'width' in section 'components.textcat_multilabel.model.tok2vec' contains an interpolation key 'components.tok2vec.model.encode.width' which is not a valid option name. Raw value: '${components.tok2vec.model.encode.width}'
I couldn't find much on the internet about this error, so I thought I'd ask if you have any ideas as to how to solve this. Any thoughts appreciated - thanks!