Hi! The JSON data specifies the character offsets and tokens so it's mostly just a question of converting character offsets to token-based tags. spaCy has some handy utilities for this that can do it automatically for you. See here for a code example: https://prodi.gy/docs/named-entity-recognition#tip-offsets-biluo