I get the error below when running ner.correct on prodigy v1.13 on windows 11.
================================= Traceback =================================
File "C:\Users\fabio\AppData\Local\Programs\Python\Python38\lib\runpy.py", line 194, in _run_module_as_main
return _run_code(code, main_globals, None,
File "C:\Users\fabio\AppData\Local\Programs\Python\Python38\lib\runpy.py", line 87, in _run_code
File "C:\Users\fabio\.virtualenvs\dsar-wm91-sfZ\lib\site-packages\prodigy\__main__.py", line 63, in <module>
controller = recipe(*args, use_plac=True)
File "C:\Users\fabio\.virtualenvs\dsar-wm91-sfZ\lib\site-packages\plac_core.py", line 367, in call
cmd, result = parser.consume(arglist)
File "C:\Users\fabio\.virtualenvs\dsar-wm91-sfZ\lib\site-packages\plac_core.py", line 232, in consume
return cmd, self.func(*(args + varargs + extraopts), **kwargs)
============================== Warning message ==============================
✘ 'charmap' codec can't decode byte 0x9d in position 6990: character
maps to < undefined >
I ran the test suggested in that thread and had the same result - specifying encoding="utf-8" didnt raise the error but with out it I get a
UnicodeEncodeError: 'charmap' codec can't encode character '\u2139' in position 9: character maps to < undefined >
sorry for the late reply. We've not been able to reproduce this locally but that's possibly related to the fact that the Prodigy development team doesn't use Windows machines. However, you're not the first user with an issue that might be specific to Windows so we're exploring a way for internally to work on Windows bugfixes.
that seems like a solid solution for now, nice find!
It's still hard for us to fully replicate the issue, but I can't think of a reason why your approach wouldn't work for the short-medium term.
In the long term we're interested in replacing plac with radicli in our codebase. I'd imagine that once that change is in this issue should also go away, but it's something we'll keep in the back of our mind.