Prodigy Support for Spacy_DBpedia_Spotlight Pipeline

danalynn · November 17, 2021, 3:44pm

Hi Prodigy Team,

I am using Spacy's Dbpedia Spotlight pipeline for NER and would like to implement a Prodigy ner.correct session based on that model. I know you can specify the component within the ner.correct recipe, but since the DBpedia Spotlight pipeline isn't included by default, that isn't recognized as an option even when downloaded. Is there a simpler way to call this pipeline within the standard ner.correct recipe or is it necessary to write a custom recipe for this support?

Thanks for your help!

ljvmiranda921 · November 18, 2021, 1:38am

Hi @danalynn , welcome to Prodigy!

It should be possible by loading the model, saving it to disk, and pointing the ner.correct recipe to that path. Something like this:

import spacy

nlp = spacy.load("en_core_web_lg")
nlp.add_pipe("dbpedia_spotlight")
print(nlp.pipe_names) # ['tok2vec', 'tagger', 'parser', 'ner', 'attribute_ruler', 'lemmatizer', 'dbpedia_spotlight']

nlp.to_disk("pipe_with_dbpedia")  # c.f. https://spacy.io/api/language#to_disk

Then afterwards you can pass them to the spacy_model positional argument of ner.correct. Something like this:

prodigy ner.correct my-dataset pipe_with_dbpedia ...

Assuming that the spacy-dbpedia-spotlight component sets doc.ents, then it should work out of the box.

Topic		Replies	Views
Add custom NER model from prodigy to spacy pipeline usage , ner , spacy , solved	3	2346	October 5, 2022
Error applying ner.correct to a dataset ner	4	304	February 6, 2023
Add custom NER model from prodigy to spacy pipeline - spaCy V3 usage , ner , spacy	1	339	October 6, 2022
How to use customized spaCy model in Prodigy? ner , spacy	6	491	July 3, 2023
Annotation using my own Spacy custom pipeline usage , ner , spacy , solved	1	520	March 2, 2022

Prodigy Support for Spacy_DBpedia_Spotlight Pipeline

Related topics