spaCy supports basic tokenization for Japanese, so you should be able to use Prodigy to collect annotations on Japanese text. There are no pre-trained models available, though, so you'd have to start from scratch.
See here for similar threads and discussion:
First of all, really nice work!
I am curious about the support for languages other than English, especially for CJK languages?
I couldn’t find any clue about that from the online demo.
Thanks in advance!
I guess this is a mixed Spacy/prodigy-crossover question, but i’m looking into working with Norwegian language models, specifically for text classifcation tasks. As far as i can see support for this in Spacy is pretty sparse (briefly appared in version 1.9 i think?).
What do i have to add to Spacy to make a minimally working Norwegian model for text classifcation in prodigy? I recently saw that you could import FastText vectors, that might take care of a bit of the work.
(There is also the pol…
My team want to load and re-train some Chinese models through Prodigy, e.g. NER model and text-classification model. Since spaCy doesn’t provide any basic Chinese models, we are trying to implement these recipes.
'update': model.update, # update model with annotations
I am still confused about the input, output and main process logic of this bound method model.update. Is there any paradigm to help me write a correct one?