Model explanation

curious · March 12, 2020, 3:18pm

Model explanation is a hot topic now and business users want it. If Prodigy can highlight which words/phrases carry more weight, human should be able to annotate even faster and more accurately.

I've seen some discussion on having model returns the weight for each token. I wonder whether Prodigy is thinking of introducing a model explanation out of the box. Or is there way I can implement this function by myself? Thanks.

simon.gurcke · March 13, 2020, 5:54am

I think this would be a great addition to Prodigy. I have implemented this outside of Prodigy and it really helps to understand what the model has actually learned.

My implementation is for a RoBERTa text classification model and build on top of PyTorch's Captum using GradientShap.

ines · March 13, 2020, 11:13am

Yes, that's a cool idea! Prodigy should already have all the building blocks for that – you just need to implement the process that weights for the tokens/subtokens or whatever else you want to interpret.

This thread is slightly older, but it has some custom recipes and ideas for visualizing attention during text classification annotation:

We'd love to have this more easily accessible for spaCy models! But otherwise, it really depens on your model, the framework you're using (both for ML and interpretability) and what you're trying to do. There's no out-of-the-box answer for that. But Prodigy should provide the building blocks you need to incorporate model interpretability into your annotation workflow.

This looks great! And this seems to already return formatted HTML, right? So I guess you could stream that in using the html interface, or add it as a separate block?

(Also, a small detail that I want to add to the regular ner/ spans interface: If individual spans can take a "color" value, you could easily implement the same visualization just with character offsets and different colour shades depending on the score, without having to assign distinct labels and label colours.)

curious · March 13, 2020, 1:30pm

Great. I'll try both solutions and update the result here. Thanks!

sdspieg · May 17, 2023, 11:11am

This thread is already quite old, but was any progress made on this front?

ryanwesslen · May 17, 2023, 6:56pm

hi @sdspieg!

Thanks for the suggestion. I dabbled about a year ago with a SHAP recipe:

gist.github.com

https://gist.github.com/wesslen/2873c55124eb043e82ec0219fa1d9acc

shap_textcat.py

from prodigy.components.loaders import JSONL
import prodigy
import matplotlib as mpl
import spacy
import shap  # shap requires numba, which requires < numpy 1.22; you may need to downgrade numpy to 1.21.6

def predict(texts):
    """Convert list of text to bare strings and use textcat to predict"""
    texts = [str(text) for text in texts]
    results = []

This file has been truncated. show original

Haven't tried it lately but I suspect it'd still work (not sure if shap package has changed any).

Just curious - do you have any techniques or specifics you're looking for? Especially if you know of any existing packages we could port in like shap?

We can make an internal note about the interest in an explanation recipe. We're working hard on several significant updates to Prodigy (coming out soon!) but will look back into custom recipes when we can.

Topic		Replies	Views
Highlighting the matching words for text classfication enhancement , textcat	16	6441	June 16, 2020
The model details behind Prodigy usage , spacy , solved	1	404	August 12, 2020
New recipes available in recently released Prodigy 1.13.1 :tada: news	1	306	August 23, 2023
Relevant Text Highlighting, Ports, multi-threading, reproducibility of results, cross validation textcat , spacy	3	651	July 3, 2018
hidden metadata as part of prodigy usage , textcat	1	812	July 25, 2019

Model explanation

Related topics