Hi all,
Again facing issues/incomprehensions.
I used this piece of code with last version of prodigy: 1.11.8
and spacy: 3.4.3
prodigy ner.correct trainset_reviewed model/model_spacy3/model-last trainset.jsonl --label x1,x2 --unsegmented
Traceback (most recent call last):
File "/usr/local/anaconda3/lib/python3.8/runpy.py", line 194, in _run_module_as_main
return _run_code(code, main_globals, None,
File "/usr/local/anaconda3/lib/python3.8/runpy.py", line 87, in _run_code
exec(code, run_globals)
File "/usr/local/anaconda3/lib/python3.8/site-packages/prodigy/__main__.py", line 61, in <module>
controller = recipe(*args, use_plac=True)
File "cython_src/prodigy/core.pyx", line 374, in prodigy.core.recipe.recipe_decorator.recipe_proxy
File "cython_src/prodigy/core.pyx", line 63, in prodigy.core.Controller.from_components
File "cython_src/prodigy/core.pyx", line 160, in prodigy.core.Controller.__init__
File "cython_src/prodigy/components/feeds.pyx", line 104, in prodigy.components.feeds.Feed.__init__
File "cython_src/prodigy/components/feeds.pyx", line 150, in prodigy.components.feeds.Feed._init_stream
File "cython_src/prodigy/components/stream.pyx", line 107, in prodigy.components.stream.Stream.__init__
File "cython_src/prodigy/components/stream.pyx", line 58, in prodigy.components.stream.validate_stream
File "/usr/local/anaconda3/lib/python3.8/site-packages/prodigy/recipes/ner.py", line 244, in make_tasks
for doc, eg in nlp.pipe(texts, as_tuples=True, batch_size=10):
File "/usr/local/anaconda3/lib/python3.8/site-packages/spacy/language.py", line 1545, in pipe
for doc in docs:
File "/usr/local/anaconda3/lib/python3.8/site-packages/spacy/language.py", line 1589, in pipe
for doc in docs:
File "/usr/local/anaconda3/lib/python3.8/site-packages/spacy/util.py", line 1651, in _pipe
yield from proc.pipe(docs, **kwargs)
File "spacy/pipeline/transition_parser.pyx", line 233, in pipe
File "/usr/local/anaconda3/lib/python3.8/site-packages/spacy/util.py", line 1600, in minibatch
batch = list(itertools.islice(items, int(batch_size)))
File "/usr/local/anaconda3/lib/python3.8/site-packages/spacy/util.py", line 1651, in _pipe
yield from proc.pipe(docs, **kwargs)
File "spacy/pipeline/trainable_pipe.pyx", line 73, in pipe
File "/usr/local/anaconda3/lib/python3.8/site-packages/spacy/util.py", line 1600, in minibatch
batch = list(itertools.islice(items, int(batch_size)))
File "/usr/local/anaconda3/lib/python3.8/site-packages/spacy/language.py", line 1586, in <genexpr>
docs = (self._ensure_doc(text) for text in texts)
File "/usr/local/anaconda3/lib/python3.8/site-packages/spacy/language.py", line 1535, in <genexpr>
docs_with_contexts = (
File "/usr/local/anaconda3/lib/python3.8/site-packages/prodigy/recipes/ner.py", line 243, in <genexpr>
texts = ((eg["text"], eg) for eg in stream)
File "cython_src/prodigy/components/preprocess.pyx", line 167, in add_tokens
File "cython_src/prodigy/components/preprocess.pyx", line 263, in prodigy.components.preprocess._add_tokens
File "cython_src/prodigy/components/preprocess.pyx", line 225, in prodigy.components.preprocess.sync_spans_to_tokens
KeyError: 'id'
I tried using model-best but the same. I think I didn't have such error with last version of prodigy.
Do you have some ideas?
Thank you
Best regards
Julie