How to capture discrepancies between two annotators

klopez · April 29, 2022, 3:34pm

I want to capture and count the number of samples that had differences between the annotators annotations. Specifically, for an NER task. For example, a discrepancy for my purpose is a difference between the highlighted text. I understand there is a review recipe with a an auto accept function. This auto accept seems to do exactly what I want but rather than just skipping examples that have no differences, I want to count the ones that have differences. How can I do this?

One approach I thought about was to pull the db into python and match up the tokens. Although, seems cumbersome as there may be a lot highlighted text.

EDIT:
Maybe this code:

def filter_auto_accept_stream(
    stream: Iterator[Dict[str, Any]], db: Database, dataset: str
) -> StreamType:
    """
    Automatically add examples with no conflicts to the database and skip
    them during annotation.
    """
    task_hashes = db.get_task_hashes(dataset)
    for eg in stream:
        versions = eg["versions"]
        if len(versions) == 1:  # no conflicts, only one version
            if TASK_HASH_ATTR in eg and eg[TASK_HASH_ATTR] in task_hashes:
                continue
            sessions = versions[0]["sessions"]
            if len(sessions) > 1:  # multiple identical versions
                # Add example to dataset automatically
                eg["answer"] = "accept"
                db.add_examples([eg], [dataset])
            # Don't send anything out for annotation
        else:
            yield eg

found by exploring the package: python -c "import prodigy;print(prodigy.__file__)"

ljvmiranda921 · May 4, 2022, 8:18am

Hi @klopez !

I think you're on the right track, and using an "offline" Python script should do the trick. You can use db-out and it should provide you with a JSONL output of your database with the highlighted spans. In my opinion, it should be more convenient to work with those if you just want to obtain the differences.

Topic		Replies	Views
Review recipe: auto accept identical annotations enhancement , usage , ner , done , solved , review	6	802	August 12, 2021
Inconsistency Number of Annotated Data ner , textcat	10	72	November 27, 2024
Long reviews with custom filter_auto_accept_stream fail bug , review	0	421	March 29, 2022
Doubt about the use of review usage , ner , review	1	501	November 25, 2020
prodigy review --auto-accept exhausting stream before all annotations saved to gold dataset review , streams	10	926	January 27, 2023

How to capture discrepancies between two annotators

Related topics