Feature Request: Machine Translation View

plusepsilon · January 18, 2018, 10:48pm

It would be great to have translation interface where you can type out a completely new sentence in a text box (like in machine translation).

The text box may or may not contain a predicted sentence from an MT model. Data structure could be something like:

{
    "text": "It was a pressure to meet you.",
    "predicted_text": "It was pleasure.",
    "corrected_text": "It was a pleasure to meet you."
}

wpm · January 18, 2018, 10:50pm

This could be useful for all manner of sequence-to-sequence learning.

ines · January 19, 2018, 12:21am

Thanks – I like that idea! We could even generalise this a bit more and make it a general “text input” interface that lets you render any task (text, image, NER) and adds an input box to the card that you can optionally pre-populate with text. This means you could use it for machine translation, but also for image captioning etc.

{
    "text": "It was a pressure to meet you.",
    "text_input_default": "It was pleasure."
}

The default content will then be displayed in the text field, and can be edited by the user. It will then be added to the task as "text_input" (not sure about the exact naming of the keys yet).

You could also very easily convert the annotations to a stream of compare or diff examples. This would let you re-annotate the corrections made by the user:

diff_examples = []

for eg in user_input_examples:
    before = eg['text_input_default']
    after = eg['text_input']
    if before != after:  # user has edited the text
        task = {'input': {'text': eg['text']}, 
                'accept': {'text': after}, 
                'reject': {'text': before}}
        # optional: shuffle accept / reject for less biased evaluation
        diff_examples.append(task)

plusepsilon · April 18, 2018, 8:28pm

Pinging to see if this is on the roadmap. Thanks!

jlanday · February 28, 2019, 4:44pm

Also interested if there has been any progress on this

bayethiernodiop · June 17, 2020, 2:56pm

@ines any news here ? Or a way of doing it by creating custom recipe.
Thanks in advance.

ines · June 17, 2020, 9:35pm

This should be pretty straightforward now with the blocks UI and a text input block: Custom Interfaces · Prodigy · An annotation tool for AI, Machine Learning & NLP

You can pre-populate the text box with content, e.g. the text produced by your model. I'm showing a similar(ish) workflow with image captioning annotation in my custom recipes video btw:

Topic		Replies	Views
post-editing (PE) of machine-translated (MT) usage , custom	7	631	November 18, 2021
Feature Request: Phrase Alignment View enhancement	0	417	April 20, 2022
Field report: Noisy translation data annotation and nginx-proxy deployment usage , custom , server	1	1351	January 17, 2022
Error annotation MT with source text usage , custom	2	338	November 15, 2021
Customize recipe for text generation tasks usage , solved	3	349	May 22, 2022

Feature Request: Machine Translation View

Related topics