Off-track use of Prodigy/Spacy - Custom Regex Pattern Matching and Modeling

ines · February 4, 2019, 6:19pm

Sorry, I was getting a big confused by all of that nested logic and the the pattern matcher that yields examples etc. And yes, if you want to create static annotation examples (one for each span) and render them exactly as they come in to accept/reject (e.g. with the mark recipe), you’d really only need something like this?

examples = []
for text in LOTS_OF_TEXTS:
    for label, regex_patterns.items():
        for match in re.finditer(expression, text):
            start, end = match.span()
            span = {"start": start, "end": end, "label": label}
            task = {"text": text, "spans": [span]}
            examples.append(task)

Topic		Replies	Views
spaCy, prodigy, annotation usage , ner , solved	2	722	February 8, 2019
Custom Tokenization Support for Spacy (and by extension Prodigy). spacy	3	1746	January 24, 2019
Pattern Matching on Custom Attributes usage , spacy , off-topic	2	736	September 22, 2021
How to use customized spaCy model in Prodigy? ner , spacy	6	489	July 3, 2023
Migration from spaCy 2.3 to 3.x + Annotating data in prodigy usage , spacy	1	459	August 29, 2021

Off-track use of Prodigy/Spacy - Custom Regex Pattern Matching and Modeling

Related topics