This has been happening with me quite often, especially after prodigy update (v1.8.3). It's corrupting the whole dataset of annotation over and over, again. I can't even export the result now.
I run this (below) and save the annotations:
prodigy image.manual my_dataset path/to/image --label ONE,TWO,THREE
When I try to access the dataset again to add more annotations or to export the jsonl file or even if I try to check the stats, it returns:
14:59:17 - APP: Using Hug endpoints (deprecated)
14:59:17 - DB: Initialising database MySQL
/home/shiftu/.local/lib/python3.6/site-packages/pymysql/cursors.py:170: Warning: (3090, "Changing sql mode 'NO_AUTO_CREATE_USER' is deprecated. It will be removed in a future release.")
result = self._query(query)
14:59:17 - DB: Connecting to database MySQL
14:59:17 - DB: Loading dataset 'my_dataset' (290 examples)
Traceback (most recent call last):
File "/usr/lib/python3.6/runpy.py", line 193, in _run_module_as_main
"__main__", mod_spec)
File "/usr/lib/python3.6/runpy.py", line 85, in _run_code
exec(code, run_globals)
File "/home/shiftu/.local/lib/python3.6/site-packages/prodigy/__main__.py", line 372, in <module>
plac.call(commands[command], arglist=args, eager=False)
File "/home/shiftu/.local/lib/python3.6/site-packages/plac_core.py", line 328, in call
cmd, result = parser.consume(arglist)
File "/home/shiftu/.local/lib/python3.6/site-packages/plac_core.py", line 207, in consume
return cmd, self.func(*(args + varargs + extraopts), **kwargs)
File "/home/shiftu/.local/lib/python3.6/site-packages/prodigy/__main__.py", line 263, in db_out
examples = DB.get_dataset(set_id)
File "/home/shiftu/.local/lib/python3.6/site-packages/prodigy/components/db.py", line 296, in get_dataset
return [eg.load() for eg in examples]
File "/home/shiftu/.local/lib/python3.6/site-packages/prodigy/components/db.py", line 296, in <listcomp>
return [eg.load() for eg in examples]
File "/home/shiftu/.local/lib/python3.6/site-packages/prodigy/components/db.py", line 99, in load
return srsly.json_loads(content)
File "/home/shiftu/.local/lib/python3.6/site-packages/srsly/_json_api.py", line 37, in json_loads
return ujson.loads(data)
ValueError: Unmatched ''"' when when decoding 'string'
Using:
prodigy==1.8.3
python==3.6.8
ujson==1.35
srsly==0.0.7
I want to debug this. Can you help me? Thanks!