Compatibility of versions

ines · October 1, 2018, 7:37am

In general, we make sure that Prodigy is always compatible with stable spaCy versions. You can obviously try and use it with Prodigy, but I'd only recommend it for experimental purposes. (Also, remember that spacy-nightly versions usually require new models.)

But for your use case, I'm not even sure you need to use Prodigy with the alpha version of spaCy? You can still collect your annotations with the current stable version, and then use the match patterns or data to train

Patterns like this are problematic, because as I've explained in the thread you linked, this one will never match. The following will look for one token whose lowercase matches "HWY SPEEDS". This will never be the case, since the string will be split into two tokens: ['HWY', 'SPEEDS'].

Instead, your patterns can either reflect the tokenization, or you can write exact string match patterns instead:

{"label":null,"pattern":"HWY SPEEDS"}

For your use case, it sounds like you probably just want to write your own converter script that takes your annotations and outputs the patterns. Basically, something similar to the script I describe at the bottom of this post. This will also let you incorporate the patterns automatically. If you look at the source of terms.to-patterns, you'll see that it doesn't really do anything magicaly at all – it's just a convenience helper function. All you want to do here it take one data format and convert it to a different one – how you do this is up to you. (You don't even have to use Python if there's a different language you prefer!)

Just to make sure I understand your use case correctly: Do you want to just find exact string matches in your text and label them, or also train a model to generalise based on those strings and find similar occurrences in context?

Topic		Replies	Views
NER or PhraseMatcher? ner , spacy , best-practices	17	6094	September 20, 2018
Create PhraseMatcher in Spacy and use them to Label data manually ner , spacy , solved , medical	9	1571	December 15, 2020
Prodigy patterns not behaving like Spacy patterns usage , spacy , solved	19	2132	May 29, 2019
match pattern work in spacy but does not work in prodigy usage , ner , spacy	2	437	January 25, 2021
EntityRuler and ner.match - different behavior usage , ner , spacy	6	1729	July 11, 2019

Compatibility of versions

Related topics