Sure, the example .jsonl was included in the initial post (ui_bug.jsonl
) but I'll attach here again.
Example of custom JSON: ui_bug.jsonl (1.6 KB)
e..g,
{"meta":{"form_name":"12549608361_JustFund-Common-Proposal-Questions.txt","page_number":2,"n":200,"jsonl_version":"0.03"},"text":"2 12549608361_JustFund-Common-Proposal-Questions.txt\nHomelessness\n\n- Human Rights / Civil Rights & Liberties\n\n- Immigration\n\n- LGBTQ+\n\n- Racial Justice\n\n- Transportation / Utilities / Public Infrastructure\n\n- Other\n\nURGENT NEED\n\nSpecific urgent need categories may be active for limited times. If this\nproposal suits an urgent need, select the category it fits. Note, this\noption will have no choices available if there are no current urgent\nneed categories.\n\n- None\n\n- COVID-19\n\nDONATION INFORMATION\n\nPlease list the address where all contributions should be sent. If you\nhave a Fiscal Sponsor, please list your Fiscal Sponsor's address.\n\nDonation Website:\n\nDonation Instructions:\n\nCheck Donation Addressed To:\n\nCheck Donation Memo Line:\n\nCheck Street Address:\n\nCheck City:\n\nCheck State:\n\nCheck ZIP:\n\n*Required fields\n","spans":[{"token_start":14,"token_end":59,"label":"ANSWER","start":2,"end":12},{"token_start":59,"token_end":76,"label":"ANSWER","start":12,"end":16},{"token_start":76,"token_end":88,"label":"ANSWER","start":16,"end":20},{"token_start":88,"token_end":108,"label":"ANSWER","start":20,"end":25},{"token_start":108,"token_end":164,"label":"ANSWER","start":25,"end":34},{"token_start":164,"token_end":175,"label":"ANSWER","start":34,"end":38},{"token_start":419,"token_end":429,"label":"ANSWER","start":88,"end":92},{"token_start":429,"token_end":443,"label":"ANSWER","start":92,"end":96}]}
It looks like the tokens
field is also required. But the documentation says that ner_manual
will add that, "... [t]he ner_manual
interface allows highlighting spans of text based on token boundaries, and will use a model to tokenize the text and add an additional "tokens"
property to the annotation task."
Despite the documentation above am I still required to add token
if I pre-populate spans
?