Hello @ljvmiranda921, thanks a lot for responding! I forgot to add set_hashes to the input data before saving it to jsonl, after I added this line everything worked, so the problem is solved
However, if I already have you here I wanted to ask a quick question (didn't want to spawn another topic, but can move it if it would be nessecary) - I wanted to embed pdf file into this scheme (to compare two pdf files, and not two strings), and as input data pass dict with relative paths to the pdf files, like that:
{"pdf1": "file_1.pdf", "pdf2": "file_2.pdf"}
{"pdf1": "file_3.pdf", "pdf2": "file_4.pdf"}
and embed them in the scheme using i.e.:
html_template = f"""
<embed src="{{pdf1}}" width="600px" height="2100px" />
<br/><br/>
<embed src="{{pdf2}}" width="600px" height="2100px" />
"""
After using relative file paths for each pdf (the pdfs are currently in the same folder as the folder from which I start the prodigy) I receive following message instead of the pdf file:
{"detail":"Not Found"}
Additionally, when using full file path as: file:///full/file/path/file.pdf
Nothing appears. Additionally, when I uploaded my document on google drive and used shareable link (with the link hard encoded in recipe in both documents), it displayed everything properly. It's quite baffling, as I thought changing strings to embedded pdfs will be quick and easy, and maybe there is something really silly I am missing, so I'll be thankful for pointing in any direction