textcat.manual binary annotation without labels

cheyanneb · November 11, 2021, 8:25am

Hi,

I have a pre-annotated dataset. Each document in the .jsonl file has a different label. I have been able to set up a system where the pre-annotated label is in the banner above the sentence/document, but it required me to have a list of dropdown or checkbox labels that I need to declare as an argument on the command line. Is there a way to just have the banner with the pre-annotated label (which is different per document) and the accept/reject button without the --labels requirement on the command line?

I tried to use mark with a list of labels, as I just want the label as a banner with the sentence, and the accept and reject buttons. Each document is pre-annotated and has its own label taken from a list of ~100 labels. Example from .jsonl

{"id":"1234","deployment":"xxxx","date":"2021-10-20 12:46:00","text":"random text here","label":"label1","orig_accept":["label1"]}
{"id":"1234","deployment":"xxxx","date":"2021-10-20 12:46:00","text":"random text here","label":"label2","orig_accept":["label2"]}

Thank you!
Cheyanne

cheyanneb · November 12, 2021, 2:57am

I may have solved this by using the following command:

PRODIGY_ALLOWED_SESSIONS=max,sam prodigy mark testset_name /<path_to_file>/file.jsonl --view-id classification

ines · November 14, 2021, 10:38am

Yes, that's a good solution. If you already have pre-annotated labels, all you need to do is render the exact data that comes in.

Topic		Replies	Views
Reduce list of labels in textcat.manual to pre-annotated labels and none of the above usage , textcat	1	485	November 8, 2021
Using Prodigy to confirm or reject existing document labels usage , textcat , solved	2	613	January 5, 2019
Multi-labels not working usage , ner , solved	6	1016	August 23, 2019
Highlight list of terms in `textcat.manual` for binary annonation usage , textcat	2	412	April 21, 2022
textcat_multilabel with only some labels annotated for some examples	5	377	June 14, 2022

textcat.manual binary annotation without labels

Related topics