It’s really only the models
In theory there is a possibility that the tokenization can differ for very specific edge cases. But it’s extremely unlikely that this would affect any of the entity spans you’ve annotated – for this to happen, the character offsets of the entities would have to not map to valid token boundaries anymore. But this is also something you can verify pretty easily yourself: for every span you’ve annotated in a document,
Doc.char_span needs to succeed.