Extracting word translations

thomas · May 25, 2021, 8:56am

Hi, I want to extract token translations from aligned sentences (DE - EN) with prodigy. Can you give me some advice, which annotation recipe(s) to use best in this case? In the end, I would like to obtain a trained model, that can predict the token translation pairs of given sentence alignments (based on the annotations). Thanks in advance for any help!

ines · May 28, 2021, 12:35am

Hi! Just to make sure I understand the exact use case and annotation task: You have two aligned sentences and you now want to match up the exact tokens? Or do already have specific spans of text in sentence A that you now want to highlight in sentence B?

Also, do you already have a model you want to use for this task? Are there existing predictions you want to include, or are you just looking to create data that you can then export for training?

Topic		Replies	Views
Tokenizer when training without base model training	3	503	December 14, 2022
Alignment of NER tokens when creating suggestions using Transformers ner	7	1068	August 12, 2022
Annotation with WordPiece tokens usage , transformers	3	493	July 30, 2021
is there a way to change prodigy annotations to transformers-based annotations, without re-annotating? usage , ner , solved , transformers	6	823	March 4, 2021
Annotation for Argument Mining usage , custom , solved	17	2186	June 29, 2018

Extracting word translations

Related topics