SpanCat Training Error on Custom Preprocessed Dataset

ryanwesslen · March 3, 2023, 9:32pm

Thanks for your response.

Were your annotations altered with any pre- or post-processing? You should avoid any modifications to your annotations if you want to use prodigy train.

That message is a bit too vague for me to diagnose without more details. Was this all of the error message? If not, can you provide the full stack error message?

The closest I found was relating to tokenization:

I'm wondering if this is a tokenization problem because of some pre- or post-processing you may have done.

Alternatively if not, can you provide a small sample of your data like you did previously?

Also, moving forward, please avoid screen shots of code - you can instead copy/paste it directly. This enables it be searchable for the next user (e.g., now others could search by the same error message and find this post)

Topic		Replies	Views
Unable to use train and run data-to-spacy recipes for spancat on prodigy 1.11.10 solved , spancat	4	929	May 4, 2023
Spancat training from db-in'd dataset not working usage , spancat	8	634	April 22, 2022
Spancat is not trained spancat	12	1155	July 27, 2022
Prodigy textcat train optimization?? usage , textcat , spacy	3	561	March 23, 2020
Span Cat Annotations and Incorrect Predictions spacy , spancat	4	904	June 8, 2023

SpanCat Training Error on Custom Preprocessed Dataset

Related topics