We have a space pipeline that processes messages from a chats. We use a document per message and have
Doc extensions to pass in extra information that is used in our pipeline components, e.g with an identifier for each chat, the names of the participants, who sent the chat message, etc. We then have to separately call
Language.make_doc to create a
Doc instance, add the required data, and then thread the document through a loop over the pipeline components (is there a better way of handling this in Spacy?).
I’m wondering how we can use Prodigy with this pipeline, as I see no way to pass in the information required. I see the Prodigy
meta field, but don’t think that does what we need.