No tagger in pre-trained models?

akimotode · March 23, 2024, 11:01pm

I am trying to do Coref tagging.

My CLI is this:
python -m prodigy coref.manual coref_dataset en_core_web_sm d:\training\spacy_rel\assets\text_fragments.jsonl --label COREF

The error message I get confused me since I more or less copied this off the documentation. And I can't imagine that the pre-trained model is not complete...
ValueError: [E155] The pipeline needs to include a morphologizer or tagger+attribute_ruler in order to use Matcher or PhraseMatcher with the attribute POS. Try using 'nlp()' instead of 'nlp.make_doc()' or 'list(nlp.pipe())' instead of 'list(nlp.tokenizer.pipe())'.

I assume this is a silly mistake on my side, but I can't see it...

In case it is important: I'm running Prodigy 1.14.12 and spaCy 3.7 (incl. the 3.7 models).

Cheers,
Kai

magdaaniol · March 26, 2024, 8:35am

Hi @akimotode,

Have you modified the spaCy pipeline in any way, e.g. by adding EntityRuler or custom NER component?
If so, you should make sure, the entity_ruler and ner components are after the atribute_ruler so that the POS labels produced by tagger and attribute_ruler are available for entity_ruler and ner.
You can see your current order like so:

import spacy
nlp = spacy.load("en_core_web_sm")
print(nlp.pipe_names)

To change the order, if necessary:

# move the NER component to the end of the pipeline: remove and then reload from the same source in the new position
nlp.remove_pipe("ner")
nlp.add_pipe("ner", source=spacy.load("en_core_web_sm"))

# add entity ruler
nlp.add_pipe("entity_ruler", before="ner")
print(nlp.pipe_names)
# ['tok2vec', 'tagger', 'parser', 'attribute_ruler', 'lemmatizer', 'entity_ruler', 'ner', ... ]

Topic		Replies	Views
Cannot train tagger on trf models spacy , transformers	3	502	June 24, 2022
ValueError: [T003] Resizing pretrained Tagger models is not currently supported. usage , spacy	7	631	February 18, 2020
textcat.teach with pattern match failed with trained model usage , spacy , solved	2	477	August 3, 2020
Problem to start coref.manual with new spacy model usage , solved , relations , coref	1	574	June 23, 2020
Problem creating a new language to serve as a base model for further improvement in Prodigy spacy , pos	3	645	August 17, 2020

No tagger in pre-trained models?

Related topics