How to read dep.teach dependencies?

carlson · February 13, 2019, 4:00pm

Hello again,
After solving this problem, I have started to label data via dep.teach. However, how the arcs are displayed seem either unintuitive or inconsistent. I understand fully which direction the arcs should be pointing since I’ve manually labelled a lot of data using a custom program utilizing displaCy. But, more than half the label candidates that Prodigy pulls has the correct label, with the opposite arc direction.

Example (Prodigy):

(Terminal)

>>> import spacy
>>> nlp = spacy.load('en_core_web_sm')
loading data for custok
>>> doc = nlp('no free fluid in the pelvis')
>>> print([(t.text,t.head.text,t.dep_) for t in doc])
[('no', 'free fluid', 'negate'), ('free fluid', 'free fluid', 'ROOT'), ('in', 'free fluid', 'prep'), ('the', 'pelvis', '-'), ('pelvis', 'in', 'refer')]

In the above example, the head of the should be pelvis with dep - as the terminal suggest and that is how I’ve pretrained this model. In Prodigy, it suggests the correct label, but the arc is the opposite direction. However, in the below example, it does provide the correct label and the correct arc for free fluid and in that I’ve trained the model to do.

After going through ~500 labels in Prodigy, more than 50% of the label candidates I had to reject because it’s the correct label, but the opposite direction, and I spot check here and there to make sure that my model should have predicted the correct arc. I tried a quick dep.batch-train to see if it would increase my accuracy with how I think I should be accepting/rejecting these examples, and my model accuracy was reducing on every iteration for 10 iterations.

My question then is am I supposed to ignore the arc direction in Prodigy? Or is this expected behaviour of Prodigy to be suggesting incorrect arcs intentionally?

honnibal · February 18, 2019, 2:43pm

Thanks for the report on this. We took a bit of time to track this down, but it’s definitely a bug in the display: when the dependencies are rightward, the arrow is reversed in the front-end, causing the confusing display you’ve been seeing.

We’ve fixed the issue, and are preparing a new release. Hopefully it’ll be up today.

carlson · February 18, 2019, 2:53pm

Thanks for taking the time to look into this!

Topic		Replies	Views
Dep.Teach doesn't use same tokenenization as pretrained model spacy , dep	13	1802	March 10, 2020
Prodigy Output Visualization, Dependencies Structure Training Help usage , ner , spacy , dep	3	989	June 18, 2021
Error: Can't find label with ner.teach ner , done , spacy	4	598	August 27, 2020
Correct procedure for ner.teach usage , ner , spacy	7	571	May 25, 2022
Prodigy not labeling correctly usage , ner	1	512	July 18, 2018

How to read dep.teach dependencies?

Related topics