text quality estimation

codingnoobneedshelp · November 11, 2020, 6:28pm

Hello,

I'm building a parallel corpus where I want to compare German and English sentences.

Basically, I try to reproduce the "Diff" example at the https://prodi.gy/demo page.

As far as I understand Diff is only working for mark, review, and custom recipe.

Which one do I need to use for my use-case?

I have tried the mark recipe with diff as view_id but it seems not to be the right recipe.

Thanks

Edit: I think that I have figured it out. I assume the "Diff" example uses some kind of translation model and the concept of blocks right?

ines · November 13, 2020, 1:12am

Hi! From what you describe, it sounds like you might be looking for something like the compare recipe? https://prodi.gy/docs/recipes#compare

It lets you pass in two files with the two outputs you want to compare, and lets you set a --diff flag to show a visual diff. You can also configure whether the A/B mapping should be randomised so you can perform an actual A/B evaluation

Alternatively, you can also write a custom recipe and use the diff interface. You can see an example of the expected format here – that's what your stream needs to yield: https://prodi.gy/docs/api-interfaces#diff You can also put together a custom combined interface, e.g. using diff plus choice for multiple choice options, or whatever else you need: https://prodi.gy/docs/custom-interfaces

Topic		Replies	Views
textcat annotation with diff highlight usage , textcat , done , custom , front-end	7	987	June 17, 2020
Compare recipe dont show text done , solved	2	329	September 30, 2021
How to write a recipe comparing two strings? usage , custom , solved	5	1590	December 17, 2021
Is there a way to combine the choices review with `diff` and `text_input`? usage , custom , review	5	688	January 14, 2022
Broken Link for Compare Recipe docs	1	478	August 15, 2019

text quality estimation

Related topics