Given the following situation what is the easiest strategy to get evaluation metrics (Precision, Recall, F-Measure) for NER performance
- We have some preannotated material A (not produced by a spaCy or prodigy model)
- We use
ner.manualrecipe to produce the corrected version B
- How can we measure the performance of A against gold B without applying to many conversion steps?
Thanks for hints.