Hi,
Here is an example of using verbose:
(prodigy-env) (base) gebruiker@xxxxx:~/anaconda3/envs$ PRODIGY_LOGGING=verbose python -m prodigy train /home/gebruiker/Documenten/ --ner dataset,dataset_anon,dataset_combined --base-model en_core_web_lg
08:32:23: INIT: Setting all logging levels to 10
08:32:23: RECIPE: Calling recipe 'train'
Using CPU
========================= Generating Prodigy config =========================
Auto-generating config with spaCy
08:32:30: CONFIG: Using config from global prodigy.json
/home/gebruiker/.prodigy/prodigy.json
08:32:30: DB: Initializing database SQLite
08:32:30: DB: Connecting to database SQLite
Using config from base model
Generated training config
=========================== Initializing pipeline ===========================
Killed
- The database is of size 170 MB, I could open the database in VS code and in Google SQliteviewer to see the datasets. The process uses config.cfg file from the base model, en_core_web_lg, with batch size 256.
(prodigy-env) (base) gebruiker@xxxxx:~/anaconda3/envs$ python -m prodigy stats -l
Traceback (most recent call last):
File "/home/gebruiker/anaconda3/envs/prodigy-env/lib/python3.9/site-packages/peewee.py", line 3133, in connect
self._state.set_connection(self._connect())
File "/home/gebruiker/anaconda3/envs/prodigy-env/lib/python3.9/site-packages/peewee.py", line 3478, in _connect
conn = sqlite3.connect(self.database, timeout=self._timeout,
sqlite3.OperationalError: unable to open database file
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
File "/home/gebruiker/anaconda3/envs/prodigy-env/lib/python3.9/site-packages/peewee.py", line 3195, in execute_sql
cursor = self.cursor(commit)
File "/home/gebruiker/anaconda3/envs/prodigy-env/lib/python3.9/site-packages/peewee.py", line 3179, in cursor
self.connect()
File "/home/gebruiker/anaconda3/envs/prodigy-env/lib/python3.9/site-packages/peewee.py", line 3136, in connect
self._initialize_connection(self._state.conn)
File "/home/gebruiker/anaconda3/envs/prodigy-env/lib/python3.9/site-packages/peewee.py", line 2970, in exit
reraise(new_type, new_type(exc_value, *exc_args), traceback)
File "/home/gebruiker/anaconda3/envs/prodigy-env/lib/python3.9/site-packages/peewee.py", line 191, in reraise
raise value.with_traceback(tb)
File "/home/gebruiker/anaconda3/envs/prodigy-env/lib/python3.9/site-packages/peewee.py", line 3133, in connect
self._state.set_connection(self._connect())
File "/home/gebruiker/anaconda3/envs/prodigy-env/lib/python3.9/site-packages/peewee.py", line 3478, in _connect
conn = sqlite3.connect(self.database, timeout=self._timeout,
peewee.OperationalError: unable to open database file
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
File "/home/gebruiker/anaconda3/lib/python3.9/runpy.py", line 197, in _run_module_as_main
return _run_code(code, main_globals, None,
File "/home/gebruiker/anaconda3/lib/python3.9/runpy.py", line 87, in _run_code
exec(code, run_globals)
File "/home/gebruiker/anaconda3/envs/prodigy-env/lib/python3.9/site-packages/prodigy/main.py", line 61, in
controller = recipe(args, use_plac=True)
File "cython_src/prodigy/core.pyx", line 364, in prodigy.core.recipe.recipe_decorator.recipe_proxy
File "/home/gebruiker/anaconda3/envs/prodigy-env/lib/python3.9/site-packages/plac_core.py", line 367, in call
cmd, result = parser.consume(arglist)
File "/home/gebruiker/anaconda3/envs/prodigy-env/lib/python3.9/site-packages/plac_core.py", line 232, in consume
return cmd, self.func((args + varargs + extraopts), **kwargs)
File "/home/gebruiker/anaconda3/envs/prodigy-env/lib/python3.9/site-packages/prodigy/recipes/commands.py", line 46, in stats
"total_datasets": len(DB.datasets),
File "/home/gebruiker/anaconda3/envs/prodigy-env/lib/python3.9/site-packages/prodigy/components/db.py", line 236, in datasets
return [ds.name for ds in datasets]
File "/home/gebruiker/anaconda3/envs/prodigy-env/lib/python3.9/site-packages/peewee.py", line 7014, in iter
self.execute()
File "/home/gebruiker/anaconda3/envs/prodigy-env/lib/python3.9/site-packages/peewee.py", line 1927, in inner
return method(self, database, *args, **kwargs)
File "/home/gebruiker/anaconda3/envs/prodigy-env/lib/python3.9/site-packages/peewee.py", line 1998, in execute
return self._execute(database)
File "/home/gebruiker/anaconda3/envs/prodigy-env/lib/python3.9/site-packages/peewee.py", line 2171, in _execute
cursor = database.execute(self)
File "/home/gebruiker/anaconda3/envs/prodigy-env/lib/python3.9/site-packages/peewee.py", line 3210, in execute
return self.execute_sql(sql, params, commit=commit)
File "/home/gebruiker/anaconda3/envs/prodigy-env/lib/python3.9/site-packages/peewee.py", line 3204, in execute_sql
self.commit()
File "/home/gebruiker/anaconda3/envs/prodigy-env/lib/python3.9/site-packages/peewee.py", line 2970, in exit
reraise(new_type, new_type(exc_value, *exc_args), traceback)
File "/home/gebruiker/anaconda3/envs/prodigy-env/lib/python3.9/site-packages/peewee.py", line 191, in reraise
raise value.with_traceback(tb)
File "/home/gebruiker/anaconda3/envs/prodigy-env/lib/python3.9/site-packages/peewee.py", line 3195, in execute_sql
cursor = self.cursor(commit)
File "/home/gebruiker/anaconda3/envs/prodigy-env/lib/python3.9/site-packages/peewee.py", line 3179, in cursor
self.connect()
File "/home/gebruiker/anaconda3/envs/prodigy-env/lib/python3.9/site-packages/peewee.py", line 3136, in connect
self._initialize_connection(self._state.conn)
File "/home/gebruiker/anaconda3/envs/prodigy-env/lib/python3.9/site-packages/peewee.py", line 2970, in exit
reraise(new_type, new_type(exc_value, *exc_args), traceback)
File "/home/gebruiker/anaconda3/envs/prodigy-env/lib/python3.9/site-packages/peewee.py", line 191, in reraise
raise value.with_traceback(tb)
File "/home/gebruiker/anaconda3/envs/prodigy-env/lib/python3.9/site-packages/peewee.py", line 3133, in connect
self._state.set_connection(self._connect())
File "/home/gebruiker/anaconda3/envs/prodigy-env/lib/python3.9/site-packages/peewee.py", line 3478, in _connect
conn = sqlite3.connect(self.database, timeout=self._timeout,
peewee.OperationalError: unable to open database file
(prodigy-env) (base) gebruiker@xxxxx:~/anaconda3/envs$ python -m spacy info
============================== Info about spaCy ==============================
spaCy version 3.4.1
Location /home/gebruiker/anaconda3/envs/prodigy-env/lib/python3.9/site-packages/spacy
Platform Linux-5.15.0-46-generic-x86_64-with-glibc2.35
Python version 3.9.12
Pipelines en_core_web_lg (3.4.0)
I think it has to do with some dependency error.
Thanks for your help.
gr. Rahul