I am looking for a way to convert from CONLL format NER annotations to the JSONL format used in Prodigy, to re-annotate a pre-existing dataset. I used the convert function in CLI for Spacy, but that does not seem to be useful for loading the annotations into Prodigy interface. Is there a pre-existing way to do this?
Edit: I found a ner.iob-to-gold in Prodigy recipes. However, that throws me the following error:
C:\Users\svajjala\Downloads\Prodigy>python -m prodigy ner.iob-to-gold good_preds_premanual.iob
Traceback (most recent call last):
File "C:\Users\svajjala\Python36\lib\runpy.py", line 193, in _run_module_as_main
"__main__", mod_spec)
File "C:\Users\svajjala\Python36\lib\runpy.py", line 85, in _run_code
exec(code, run_globals)
File "C:\Users\svajjala\Python36\lib\site-packages\prodigy\__main__.py", line 259, in <module>
controller = recipe(*args, use_plac=True)
File "cython_src\prodigy\core.pyx", line 167, in prodigy.core.recipe.recipe_decorator.recipe_proxy
File "C:\Users\svajjala\Python36\lib\site-packages\plac_core.py", line 328, in call
cmd, result = parser.consume(arglist)
File "C:\Users\svajjala\Python36\lib\site-packages\plac_core.py", line 207, in consume
return cmd, self.func(*(args + varargs + extraopts), **kwargs)
File "C:\Users\svajjala\Python36\lib\site-packages\prodigy\recipes\ner.py", line 636, in iob_to_gold
output_loc = sys.stdout
NameError: name 'sys' is not defined