Token indices sequence length is too long

adriane · February 2, 2022, 2:02pm

This is a warning coming internally from transformers or tokenizers and you don't see actual errors because long sequences are truncated internally before they're passed to the model.

If it happens rarely, you can probably ignore it. If it's frequent, you may want to adjust the window and stride for the transformer span getter in your config. See: Receiving the warning messgae 'Token indices are too long' even after validating doc length is under max sequence length · Discussion #9277 · explosion/spaCy · GitHub

Topic		Replies	Views
Token indices sequence length is longer than the specified maximum sequence length for this model ner , spacy	4	849	October 5, 2023
Is there a limitation for string length for NER spacy models? usage , ner , spacy	1	1494	October 31, 2018
Increase the maximum length of the ner training usage , ner , spacy , solved , training	2	2190	August 3, 2021
Hard limit on consecutive tokens in NER annotations enhancement , ner , done	3	745	June 17, 2020
Working with longer texts usage , ner	3	680	September 10, 2020

Token indices sequence length is too long

Related topics