Hi there,
I'm trying to do binary annotation to classify a new entity type on-top of en_core_web_lg but am having trouble using the --binary flag for training. Here's the output I'm getting:
$ prodigy train ner phone_ents_train en_core_web_lg --binary
Loaded model 'en_core_web_lg'
Using 296 train / 296 eval (split 50%)
Component: ner | Batch size: compounding | Dropout: 0.2 | Iterations: 10
Baseline accuracy: 0.000
=========================== Training the model ===========================
Loss Skip Right Wrong Accuracy
Traceback (most recent call last):
File "C:\Users\613629\Anaconda3\lib\runpy.py", line 194, in _run_module_as_main
return _run_code(code, main_globals, None,
File "C:\Users\613629\Anaconda3\lib\runpy.py", line 87, in run_code
exec(code, run_globals)
File "C:\Users\613629\Projects\prodigy\lib\site-packages\prodigy_main.py", line 53, in
controller = recipe(args, use_plac=True)
File "cython_src\prodigy\core.pyx", line 321, in prodigy.core.recipe.recipe_decorator.recipe_proxy
File "C:\Users\613629\Projects\prodigy\lib\site-packages\plac_core.py", line 367, in call
cmd, result = parser.consume(arglist)
File "C:\Users\613629\Projects\prodigy\lib\site-packages\plac_core.py", line 232, in consume
return cmd, self.func((args + varargs + extraopts), **kwargs)
File "C:\Users\613629\Projects\prodigy\lib\site-packages\prodigy\recipes\train.py", line 174, in train
losses = annot_model.batch_train(
File "cython_src\prodigy\models\ner.pyx", line 346, in prodigy.models.ner.EntityRecognizer.batch_train
File "cython_src\prodigy\models\ner.pyx", line 438, in prodigy.models.ner.EntityRecognizer._update
File "cython_src\prodigy\models\ner.pyx", line 431, in prodigy.models.ner.EntityRecognizer._update
File "C:\Users\613629\Projects\prodigy\lib\site-packages\spacy\language.py", line 460, in disable_pipes
return DisabledPipes(self, *names)
File "C:\Users\613629\Projects\prodigy\lib\site-packages\spacy\language.py", line 1124, in init
self.extend(nlp.remove_pipe(name) for name in names)
File "C:\Users\613629\Projects\prodigy\lib\site-packages\spacy\language.py", line 1124, in
self.extend(nlp.remove_pipe(name) for name in names)
File "C:\Users\613629\Projects\prodigy\lib\site-packages\spacy\language.py", line 418, in remove_pipe
raise ValueError(Errors.E001.format(name=name, opts=self.pipe_names))
ValueError: [E001] No component 'sentencizer' found in pipeline. Available names: ['ner']
I've tried googling the solution, but I'm pretty lost at this point. Thanks!