Just released a new nightly v1.11.0a10 that includes the following updates:
- improved support for updating from binary annotations, especially those created with
ner.teach
-
ner.teach
will now also ask about texts with no entities – so if a suggestion doesn't include any suggestions, you can accept it if it has no entities and reject it if it does contain entities of the given label(s) - support for providing
--spancat
datasets for training spaCy v3.1's newSpanCategorizer
inspacy train
(with auto-generated suggester function) - support for validating created spans in
spans.manual
against suggester function - support for custom config or base model in
prodigy train
anddata-to-spacy
- support for providing
--textcat
and--textcat-multilabel
(non-exclusive categories, including binary annotations) separately toprodigy train
anddata-to-spacy
-
sent.teach
andsent.correct
recipes for improving a sentence recognizer and support for--senter
annotations inprodigy train
andata-to-spacy
-
textcat.correct
for correcting an existing text classifier -
"_timestamp"
property added to all created annotations reflecting the time the annotation was submitted in the UI -
progress
command for viewing annotation progress over time - ARM wheels
- use the
-F
flag to pass in one or more comma-separated Python files to import from across all recipes to provide the recipe function, but also custom registered functions for spaC configs (e.g. inprodigy train
) - fixes for various bugs introduced in the previous nightlies
Btw, after downloading an extracting the zip containing the wheel files, you can also run the following to automatically select the best-matching wheel for your platform:
pip install prodigy -f /path/to/wheels