I created a dataset with the joint entity and relation annotations (rel.manual). I attempted to train a model with the dataset I created via rel.manual recipe and received the error shown below. I assume the span labels should be creating valid data for the 'ner' component, so unsure what could be causing this issue. Would appreciate any support, thanks!
=========================== Initializing pipeline ===========================
[2022-07-01 20:28:54,391] [INFO] Set up nlp object from config
Components: ner, parser
Merging training and evaluation data for 2 components
- [ner] Training: 686 | Evaluation: 171 (20% split)
✘ Invalid data for component 'ner'
spans -> 0 -> start field required
spans -> 0 -> end field required
Here is the sample data:
{'text': 'The statement credit benefit applies to the Global Entry, TSA Pre Check or NEXUS programs.', 'meta': {'source': 'Aeroplan Card | Chase.com'}, '_input_hash': 1972453691, '_task_hash': -354522162, '_is_binary': False, 'spans': [{'label': 'OtherPrograms'}, {'start': 44, 'end': 56, 'token_start': 7, 'token_end': 8, 'label': 'OtherPrograms'}, {'start': 58, 'end': 71, 'token_start': 10, 'token_end': 12, 'label': 'OtherPrograms'}, {'start': 75, 'end': 80, 'token_start': 14, 'token_end': 14, 'label': 'OtherPrograms'}], 'tokens': [{'text': 'The', 'start': 0, 'end': 3, 'id': 0, 'ws': True, 'disabled': False}, {'text': 'statement', 'start': 4, 'end': 13, 'id': 1, 'ws': True, 'disabled': False}, {'text': 'credit', 'start': 14, 'end': 20, 'id': 2, 'ws': True, 'disabled': False}, {'text': 'benefit', 'start': 21, 'end': 28, 'id': 3, 'ws': True, 'disabled': False}, {'text': 'applies', 'start': 29, 'end': 36, 'id': 4, 'ws': True, 'disabled': False}, {'text': 'to', 'start': 37, 'end': 39, 'id': 5, 'ws': True, 'disabled': False}, {'text': 'the', 'start': 40, 'end': 43, 'id': 6, 'ws': True, 'disabled': False}, {'text': 'Global', 'start': 44, 'end': 50, 'id': 7, 'ws': True, 'disabled': False}, {'text': 'Entry', 'start': 51, 'end': 56, 'id': 8, 'ws': False, 'disabled': False}, {'text': ',', 'start': 56, 'end': 57, 'id': 9, 'ws': True, 'disabled': False}, {'text': 'TSA', 'start': 58, 'end': 61, 'id': 10, 'ws': True, 'disabled': False}, {'text': 'Pre', 'start': 62, 'end': 65, 'id': 11, 'ws': True, 'disabled': False}, {'text': 'Check', 'start': 66, 'end': 71, 'id': 12, 'ws': True, 'disabled': False}, {'text': 'or', 'start': 72, 'end': 74, 'id': 13, 'ws': True, 'disabled': False}, {'text': 'NEXUS', 'start': 75, 'end': 80, 'id': 14, 'ws': True, 'disabled': False}, {'text': 'programs', 'start': 81, 'end': 89, 'id': 15, 'ws': False, 'disabled': False}, {'text': '.', 'start': 89, 'end': 90, 'id': 16, 'ws': False, 'disabled': False}], '_view_id': 'relations', 'relations': , 'answer': 'accept', '_timestamp': 1656535942}
After manually fixing the annotations, I tried training the model again and received this error on the relation side:
ValueError: Could not find gold transition - see logs above.