I am trying to upload my named entity recognition custom model on huggingface repository, prepared with prodi.gy spacy on my scientific text.
As prescribed on spacy (spaCy), I ran pip install spacy-huggingface-hub in my ubuntu (22.04) terminal.
I notice the module in the list of packages.
However when I run 'python -m spacy huggingface-hub --help' to check, the terminal gives me error 'Unknown command: huggingface-hub'.
Thank you very much for your suggestion Ryan.
I figured out that I created my custom model with Prodi.gy with spacy V2. However, the commands on spacy (spaCy) are with spacy V3. I tried downgrading spacy to V2 and proceed with the commands, but that gives error as "command --build wheel is not found".
The other option is to find way to upgrade my custom model with Spacy v3. I will try that.
Last year in June 2021 I created a ner custom model with Prodi.gy (and spacy 2.x.x.) on my windows laptop:
python -m prodigy train ner dataset,dataset_correct,dataset_correct1,dataset_correct3 en_vectors_web_lg — output C:\Users\myname\Documents\tmp_model — eval-split 0.2 — n-iter 40
I tried to upload this model to huggingface.co, but I couldnot, due to incompatibility between spacy2 used in the model vs spacy3 of the spacy-huggingface-hub. Therefore I have decided to install prodi.gy on my ubuntu laptop 22.04 to retrain the old model or rebuilt it, depending on the possibility.
I still have my .prodigy folder from my windows laptop from last year. It contains two files: prodigy.db (168 MB, 9 datasets) and prodigy.json (6B). I want to reuse this prodigy.db database to retrain or rebuilt the old model.
Can you please give suggestions on how to do it, with links to right code?