Correct way to "accept" examples in custom recipe?

ysz · July 11, 2023, 4:12pm

I'm using recipe from Dependencies and Relations · Prodigy · An annotation tool for AI, Machine Learning & NLP like this

prodigy rel.manual ner_rels blank:en ./tmp.jsonl --label EQUALS --span-label QTY,AMOUNT

which works great!

But I'm only interested in reviewing and fixing relations manually for sentences which have some labels (eg PERCENT)

What would be right way to pre-accept all other sentences from the source document to the database without writing my own custom recipe?

I mean I can preprocess my files and already save ("accept") sentences without labels as positive examples which have 0 relations and only send rest of the sentences to the prodigy

ysz · July 11, 2023, 5:27pm

or should i just yield such examples with these attributes from my custom loader:

ex = {"text": text, 
        "tokens": tokens,

        # set this to "accept" ? 
        "spans": [],
        "relations": [],
        "answer": "accept",
        }
yield ex

koaning · July 12, 2023, 2:40pm

Yeah this does feel like the kind of thing where I might recommend writing a custom script to handle the logic, mainly because the logic might also change over time.

With that in mind, it might also be good to perhaps consider multiple datasets. You might want to have a dataset with reviewed items and another with automatically accepted items just to ensure that they are separate.

Topic		Replies	Views
Automatically accept NER	2	236	October 13, 2023
Can I approve/reject pre labelled text classifications usage , textcat	2	473	February 11, 2020
Customizing prodigy for NER and relationship extraction usage , ner , custom	4	4203	December 20, 2017
recipe proposing list of custom chosen sentences for manual new usage , ner , custom , solved	4	1095	January 21, 2018
html custom recipe to display all "accept" annotations in db (without need to do anything else) usage , custom , front-end , solved	5	464	April 6, 2022

Correct way to "accept" examples in custom recipe?

Related topics