I'm running the following command:
prodigy textcat.manual nl_seniority nl_ads_classif.jsonl --label "Entry Level (0-8 months),Junior (8-27 months),Medior (21-54 months),Senior (42-84 months),Mature (72-180 months),Seasoned (168+ months)"
The first example from the database is:
{'text': 'CRM Medewerker\nVoor A.S. ....',
'meta': {'id': 18180},
'_input_hash': -860025884,
'_task_hash': -287355978,
'options': [{'id': 'Entry Level (0-8 months)',
'text': 'Entry Level (0-8 months)'},
{'id': 'Junior (8-27 months)', 'text': 'Junior (8-27 months)'},
{'id': 'Medior (21-54 months)', 'text': 'Medior (21-54 months)'},
{'id': 'Senior (42-84 months)', 'text': 'Senior (42-84 months)'},
{'id': 'Mature (72-180 months)', 'text': 'Mature (72-180 months)'},
{'id': 'Seasoned (168+ months)', 'text': 'Seasoned (168+ months)'}],
'_view_id': 'choice',
'config': {'choice_style': 'multiple'},
'accept': ['Junior (8-27 months)', 'Medior (21-54 months)'],
'answer': 'accept'}
Another example is:
prodigy spans.manual de_ad_components_split blank:de de_job_vacancies_split.jsonl --label "Soft skills,Hard skills,Activities,Benefits,Company attributes,Company Location,Education,Industry,Licenses and Courses,Location,Salary,Header"
And the first example is:
{'text': 'Mitarbeiter/in Montage\n....t!',
'tokens': [{'text': 'Mitarbeiter',
'start': 0,
'end': 11,
'id': 0,
'ws': False},
{'text': '/', 'start': 11, 'end': 12, 'id': 1, 'ws': False},
{'text': 'in', 'start': 12, 'end': 14, 'id': 2, 'ws': True},
....
{'text': '!', 'start': 1068, 'end': 1069, 'id': 158, 'ws': False}],
'spans': [{'start': 73,
'end': 114,
'token_start': 15,
'token_end': 19,
'label': 'Company attributes'},
...
{'start': 958,
'end': 979,
'token_start': 143,
'token_end': 144,
'label': 'Benefits'}],
'_input_hash': -1458903816,
'_task_hash': 1297739242,
'_view_id': 'spans_manual',
'answer': 'accept'}
I've looked at multiple projects: there are cases where 6 out of 1000 examples have timestamp, but the other 990+ don't have it.