How I can export all datasets or how i can list all datasets name in Prodigy
Hi @Mohammad,
You can list all datasets in your Prodigy DB with prodigy stats -ls
command (documentation).
Then, to export a dataset, you can use prodigy db-out {dataset_name} {output_dir}
(documentation
If you have many datasets to export you can run the db-out
command in loop with a simple bash script.
For example, if you store the dataset names (one name per line) in datasets.txt
, you could run:
cat datasets.txt | while read line; do python -m prodigy db-out "$line" my_folder; done
That would export all the datasets listed in datasets.txt
in .jsonl
format to my_folder
directory.