Errors with pos.teach and pos.batch-train.

ines · July 20, 2018, 6:45pm

The patterns stuff all looks correct. I think the problem is in the data you're annotating, i.e. the Reddit corpus. In the video, we're loading in the pre-extracted data from a directory called train – sorry if this was slightly confusing. See this thread for an explanation:

The fourth argument of your command is the data you're loading int for annotation. So what currently says train needs to be a valid data file. In this case, a Reddit comments archive, because you're using --loader reddit:

python3 -m prodigy ner.teach skills_ner en_core_web_lg /path/to/data.bz2 --loader reddit --label EDU --patterns skill_patterns.jsonl

If you haven't done so already, you can download data from the Reddit comments corpus from this page.

Topic		Replies	Views
Error with pos.batch-train usage , solved	4	583	February 4, 2019
Error occur with execution pos.batch_train, pos.train_curve pos	2	660	June 19, 2018
ner.batch-train results in KeyError ner , done	2	765	January 2, 2019
Python error with pos.teach done , pos	3	824	January 29, 2019
Error when running `ner.batch_train` inside Python usage , api , solved	3	731	January 16, 2018

Errors with pos.teach and pos.batch-train.

Related topics