✨ Prodigy nightly: spaCy v3 support, UI for overlapping spans & more

Just released a new nightly v1.11.0a10 that includes the following updates:

  • improved support for updating from binary annotations, especially those created with ner.teach
  • ner.teach will now also ask about texts with no entities – so if a suggestion doesn't include any suggestions, you can accept it if it has no entities and reject it if it does contain entities of the given label(s)
  • support for providing --spancat datasets for training spaCy v3.1's new SpanCategorizer in spacy train (with auto-generated suggester function)
  • support for validating created spans in spans.manual against suggester function
  • support for custom config or base model in prodigy train and data-to-spacy
  • support for providing --textcat and --textcat-multilabel (non-exclusive categories, including binary annotations) separately to prodigy train and data-to-spacy
  • sent.teach and sent.correct recipes for improving a sentence recognizer and support for --senter annotations in prodigy train an data-to-spacy
  • textcat.correct for correcting an existing text classifier
  • "_timestamp" property added to all created annotations reflecting the time the annotation was submitted in the UI
  • progress command for viewing annotation progress over time
  • ARM wheels
  • use the -F flag to pass in one or more comma-separated Python files to import from across all recipes to provide the recipe function, but also custom registered functions for spaC configs (e.g. in prodigy train)
  • fixes for various bugs introduced in the previous nightlies

Btw, after downloading an extracting the zip containing the wheel files, you can also run the following to automatically select the best-matching wheel for your platform:

pip install prodigy -f /path/to/wheels
3 Likes