data-to-spacy for rel_component training

ljvmiranda921 · August 16, 2022, 7:46am

Ok, let us step back for a bit. I realized that since you already have the labeled documents in Prodigy, you can export them into .jsonl using the db-out command, then
reuse / modify this parse_data.py script to convert the JSONL files into the spaCy format.

The reason why it errored out is because it expects some labels before the component is initialized. You can see this being done in the main function. So you have to do something like:

python scripts.parse_data path/to/json path/to/train.spacy path/to/dev.spacy path/to/test.spacy

If you're using your own dataset, you might need to adjust the parsing process. But a good first step would be to try this script out in your own exported JSONL files.

Topic		Replies	Views
How to convert prodigy dataset to .spacy object? usage , spacy , solved	6	1303	January 13, 2023
prodigy data-to-spacy for relation extraction ner , spacy , relations	4	1165	February 23, 2023
How to extract dependencies in spaCy after using prodigy rel.manual? usage , spacy , relations	7	1466	April 19, 2021
Rel training usage , relations , training	7	1273	May 22, 2023
Training a relation extraction component solved , relations , training	84	5713	June 27, 2023

data-to-spacy for rel_component training

Related topics