Hi! I get the following error message as soon as I run db-in after db-out:
UnicodeDecodeError: 'utf-8' codec can't decode byte 0xff in position 0: invalid start byte
I created the dataset locally with db-out and wanted to import it into a VM with GPU support for e.g. train curve etc..
However, I also get the error message on my local machine when I want to import the dataset with a new name after exporting it.
Can I simply export and import datasets?
db-out:
python -m prodigy db-out correct_UC01_train > assets/correct_UC01_train.jsonl
db-in:
python -m prodigy db-in test_UC01 assets/correct_UC01_train.jsonl
Full error message:
Traceback (most recent call last):
File "C:\Users\xxx\Miniconda3\lib\runpy.py", line 194, in _run_module_as_main
return _run_code(code, main_globals, None,
File "C:\Users\xxx\Miniconda3\lib\runpy.py", line 87, in _run_code
exec(code, run_globals)
File "C:\Users\xxx\Miniconda3\lib\site-packages\prodigy\__main__.py", line 61, in <module>
controller = recipe(*args, use_plac=True)
File "cython_src\prodigy\core.pyx", line 331, in prodigy.core.recipe.recipe_decorator.recipe_proxy
File "C:\Users\xxx\Miniconda3\lib\site-packages\plac_core.py", line 328, in call
cmd, result = parser.consume(arglist)
File "C:\Users\xxx\Miniconda3\lib\site-packages\plac_core.py", line 207, in consume
return cmd, self.func(*(args + varargs + extraopts), **kwargs)
File "C:\Users\xxx\Miniconda3\lib\site-packages\prodigy\recipes\commands.py", line 152, in db_in
annotations = [set_hashes(eg) for eg in annotations]
File "C:\Users\xxx\Miniconda3\lib\site-packages\prodigy\recipes\commands.py", line 152, in <listcomp>
annotations = [set_hashes(eg) for eg in annotations]
File "cython_src\prodigy\components\loaders.pyx", line 140, in JSONL
File "C:\Users\xxx\Miniconda3\lib\codecs.py", line 322, in decode
(result, consumed) = self._buffer_decode(data, self.errors, final)
UnicodeDecodeError: 'utf-8' codec can't decode byte 0xff in position 0: invalid start byte
Local environment:
============================== ✨ Prodigy Stats ==============================
Version 1.11.4
Location C:\Users\Miniconda3\lib\site-packages\prodigy
Prodigy Home C:\Users\.prodigy
Platform Windows-10-10.0.18362-SP0
Python Version 3.8.3
Database Name SQLite
Database Id sqlite
Total Datasets 9
Total Sessions 53
Thank you!