Datasets and using pre-annotated data

ines · May 3, 2019, 3:49pm

If you look at the “Annotation task formats” section in your PRODIGY_README.html, you’ll find the exact JSON format that Prodigy expects for pre-annotated data for the different annotation types (NER, text classification etc.). The format should be pretty straightforward: for each example, you usually have a "text" and then either a "label" or "spans", depending on what you’re annotating. You can then convert your pre-annotated data accordingly. For example, for named entity recognition, you’ll need the text and the start/end character offsets and labels for the entities in that text.

Topic		Replies	Views
How to download the dataset I annotated using the prodigy tool in json format？ Getting Started database	3	1123	March 6, 2023
prelabel data using regex and how to use the active learning functionality and get the model usage , ner , spacy	3	546	October 14, 2021
Use Prodigy purely as an annotating tool? usage , spacy , solved	10	1922	December 12, 2018
Getting Started Questions usage , ner	1	631	November 6, 2018
Data format for label correction task based on pre-labelled dataset Getting Started	5	351	June 24, 2022

Datasets and using pre-annotated data

Related topics