Not quite sure how to get the json, but let me show you what is stored in the database. I first launched the 2 annotation tasks and saved the results in the tables sentence_boundary
and sentence_boundary2
.
I then ran the review recipe pointing to those 2 tables, saving the outputs in sentence_boundary_master
.
Here is what is stored in the table sentence_boundary
(using the prodigy db python api):
[{'html': 'Scientific methods<span style="background-color: #FFFF00">.</span> Genentech\'s steady focus on research yields a plump bottom line.',
'text': "Scientific methods. Genentech's steady focus on research yields a plump bottom line.",
'boundary_index': 18,
'boundary_char': '.',
'source_id': 'PM17380983_0',
'_input_hash': 2086794253,
'_task_hash': -1768287807,
'_session_id': None,
'_view_id': 'html',
'answer': 'accept'},
{'html': 'Scientific methods. Genentech\'s steady focus on research yields a plump bottom line<span style="background-color: #FFFF00">.</span>',
'text': "Scientific methods. Genentech's steady focus on research yields a plump bottom line.",
'boundary_index': 83,
'boundary_char': '.',
'source_id': 'PM17380983_1',
'_input_hash': -1930479033,
'_task_hash': -1197153440,
'_session_id': None,
'_view_id': 'html',
'answer': 'accept'}]
Here is what is stored in sentence_boundary2
:
'text': "Scientific methods. Genentech's steady focus on research yields a plump bottom line.",
'boundary_index': 18,
'boundary_char': '.',
'source_id': 'PM17380983_0',
'_input_hash': 2086794253,
'_task_hash': -1768287807,
'_session_id': None,
'_view_id': 'html',
'answer': 'reject'},
{'html': 'Scientific methods. Genentech\'s steady focus on research yields a plump bottom line<span style="background-color: #FFFF00">.</span>',
'text': "Scientific methods. Genentech's steady focus on research yields a plump bottom line.",
'boundary_index': 83,
'boundary_char': '.',
'source_id': 'PM17380983_1',
'_input_hash': -1930479033,
'_task_hash': -1197153440,
'_session_id': None,
'_view_id': 'html',
'answer': 'reject'}]
And here is what is stored in sentence_boundary_master
:
'text': "Scientific methods. Genentech's steady focus on research yields a plump bottom line.",
'boundary_index': 18,
'boundary_char': '.',
'source_id': 'PM17380983_0',
'_input_hash': 2086794253,
'_task_hash': -1997274244,
'_session_id': None,
'_view_id': 'review',
'answer': 'accept',
'sessions': ['sentence_boundary'],
'versions': [{'html': 'Scientific methods<span style="background-color: #FFFF00">.</span> Genentech\'s steady focus on research yields a plump bottom line.',
'text': "Scientific methods. Genentech's steady focus on research yields a plump bottom line.",
'boundary_index': 18,
'boundary_char': '.',
'source_id': 'PM17380983_0',
'_input_hash': 2086794253,
'_task_hash': -1997274244,
'_session_id': 'sentence_boundary',
'_view_id': 'html',
'answer': 'accept',
'sessions': ['sentence_boundary'],
'default': True}],
'view_id': 'html'},
{'html': 'Scientific methods<span style="background-color: #FFFF00">.</span> Genentech\'s steady focus on research yields a plump bottom line.',
'text': "Scientific methods. Genentech's steady focus on research yields a plump bottom line.",
'boundary_index': 18,
'boundary_char': '.',
'source_id': 'PM17380983_0',
'_input_hash': 2086794253,
'_task_hash': 1330951531,
'_session_id': None,
'_view_id': 'review',
'answer': 'accept',
'sessions': ['sentence_boundary2'],
'versions': [{'html': 'Scientific methods<span style="background-color: #FFFF00">.</span> Genentech\'s steady focus on research yields a plump bottom line.',
'text': "Scientific methods. Genentech's steady focus on research yields a plump bottom line.",
'boundary_index': 18,
'boundary_char': '.',
'source_id': 'PM17380983_0',
'_input_hash': 2086794253,
'_task_hash': 1330951531,
'_session_id': 'sentence_boundary2',
'_view_id': 'html',
'answer': 'reject',
'sessions': ['sentence_boundary2'],
'default': True}],
'view_id': 'html'},
{'html': 'Scientific methods. Genentech\'s steady focus on research yields a plump bottom line<span style="background-color: #FFFF00">.</span>',
'text': "Scientific methods. Genentech's steady focus on research yields a plump bottom line.",
'boundary_index': 83,
'boundary_char': '.',
'source_id': 'PM17380983_1',
'_input_hash': -1930479033,
'_task_hash': -1793334956,
'_session_id': None,
'_view_id': 'review',
'answer': 'accept',
'sessions': ['sentence_boundary'],
'versions': [{'html': 'Scientific methods. Genentech\'s steady focus on research yields a plump bottom line<span style="background-color: #FFFF00">.</span>',
'text': "Scientific methods. Genentech's steady focus on research yields a plump bottom line.",
'boundary_index': 83,
'boundary_char': '.',
'source_id': 'PM17380983_1',
'_input_hash': -1930479033,
'_task_hash': -1793334956,
'_session_id': 'sentence_boundary',
'_view_id': 'html',
'answer': 'accept',
'sessions': ['sentence_boundary'],
'default': True}],
'view_id': 'html'},
{'html': 'Scientific methods. Genentech\'s steady focus on research yields a plump bottom line<span style="background-color: #FFFF00">.</span>',
'text': "Scientific methods. Genentech's steady focus on research yields a plump bottom line.",
'boundary_index': 83,
'boundary_char': '.',
'source_id': 'PM17380983_1',
'_input_hash': -1930479033,
'_task_hash': -555819029,
'_session_id': None,
'_view_id': 'review',
'answer': 'accept',
'sessions': ['sentence_boundary2'],
'versions': [{'html': 'Scientific methods. Genentech\'s steady focus on research yields a plump bottom line<span style="background-color: #FFFF00">.</span>',
'text': "Scientific methods. Genentech's steady focus on research yields a plump bottom line.",
'boundary_index': 83,
'boundary_char': '.',
'source_id': 'PM17380983_1',
'_input_hash': -1930479033,
'_task_hash': -555819029,
'_session_id': 'sentence_boundary2',
'_view_id': 'html',
'answer': 'reject',
'sessions': ['sentence_boundary2'],
'default': True}],
'view_id': 'html'}]