Hi
I am a new user of Spacy and prodigy,
I have a question, maybe it is a stupid one!!
I am gonna do the same steps that you have did it in this video,
Training a NAMED ENTITY RECOGNITION MODEL with Prodigy and Transfer Learning
However, for the first step, I have created my match patterns with spacy using "PhraseMatcher" because I have a big database that contains exactly all the sentences that I am trying to recognize in any text (text in French). It contains more than 50,000 expressions.
import json import spacy from spacy.matcher import PhraseMatcher nlp = spacy.load("fr_core_news_lg") matcher = PhraseMatcher(nlp.vocab) with open('medicaments_expression_database.json') as json_file: data = json.load(json_file) sentences = data['sentences'] patterns = [nlp.make_doc(text) for text in sentences] matcher.add("medicaments_expressions", None, *patterns)
Is there any way to use these patterns to continue the second step and start labeling data manually with the help of them, the same way as you have used the food_patterns.jsonl in this command :
prodigy ner.manual food_data blank:en ./reddit_r_cooking_sample.jsonl --label INGRED --patterns food_patterns.jsonl