I am trying to build a dependency parser on short phrases which do not necessarily grammatical like natural language style sentence. I have some annotations on the phrases for dependencies which I transferred from similar longer sentences and some from prodi.gy annotations. The way I have them is in pairs of dependencies and does not necessarily guarantee a ROOT for the phrase every time. I have couple of questions --
- With just pairs of dependencies, what is the best way to convert them in the spacy training json format ?
- Would these incomplete sparse training pairs (without necessarily a ROOT) work with spacy parser retraining (I am planning to fine-tune existing spacy parser).