relation is responding very slowly

ines · September 29, 2021, 9:52am

Hi and sorry about this – it's currently expected that the interface becomes less performant for very long documents with many tokens, and we're working on a rewrite that doesn't have this problem. I think what makes it additionally tricky in your case is that you just end up with more tokens overall, due to the way the characters are segmented.

As a workaround, one thing you can do is use patterns to disable all tokens that you know won't ever be part of a relation – of course, only if that's possible. Obvious candidates are punctuation, but you might be able to write other disable patterns based on part-of-speech tags etc.

Can you double-check that when you load your custom pipeline with the tokenizer in Python and process a text, the tokens are segmented correctly? If a Doc produced by the model shows the correct tokens, Prodigy should refelct this accordingly in all recipes that use the model for tokenization.

Topic		Replies	Views
how to annotate a longer text in rel.manual? relations	2	442	November 11, 2022
Tokenization compatibility issues in rel.manual enhancement , usage , done , transformers , relations	7	1429	September 8, 2020
Rel receipe is getting killed after a min or so.	3	914	September 27, 2022
UI slow when serving snippets over 500 words ner , front-end	6	692	January 27, 2022
How to do relation annotation after using bert.mer.manual transformers , relations	2	367	December 12, 2023

relation is responding very slowly

Related topics