NER and Relation Extraction

koayst · November 2, 2021, 8:19am

Hi,

Like to seek your opinion on how I should approach the tasks in terms of workflow.

I annotate the dataset first by labelling the entities and then build a NER model
Use the NER model generated and run it on the same dataset as in step #1 and together with the entities inferred by the model, label the relation between entities and build another relation model.

OR

I annotate the entities and relation on the dataset.
Build a NER model using the annotated file.
Build a entity relation model using the same annotated file (same file as in step 2).

Please recommend.

Thanks.

ST

koayst · November 2, 2021, 12:08pm

By the way, the "entity relation extraction" model I mentioned above is same as the one found in github project.

ljvmiranda921 · November 3, 2021, 10:00am

Hi @koayst ,

Welcome to Prodigy

For the entity relation extraction (REL) use-case, your first option of
annotating them separately makes sense. This is because for most REL applications, deciding whether something is an entity is unrelated to whether that
entity is a relation, so you have different thought processes in annotating.

If you annotate them at the same time, you might miss out on actual entities that just happened to not belong in a relation--lowering your recall and affect the consistency of the NER data and subsequent models.

I also suggest looking at these discussions for more information:

Topic		Replies	Views
How to train and correct a Named Entity Recognition with relation extraction usage , ner , relations	1	1002	December 11, 2020
How to extract dependencies in spaCy after using prodigy rel.manual? usage , spacy , relations	7	1465	April 19, 2021
Rel training usage , relations , training	7	1273	May 22, 2023
NER with relation ner , relations	1	343	November 24, 2022
Annotate relationships on existing entities usage , ner	2	822	July 12, 2022

NER and Relation Extraction

Related topics