Hi,
I've annotated two identical datasets containing 5,000 texts. I annotated a different non-overlapping entity type on each respective iteration of the 5,000 texts.
I'm now wanting to merge the datasets to have a combined set of 5,000 texts with both entity labels applied.
When I use db-merge, the resulting dataset seems to just be the 5,000 texts x 2 i.e. it's been appended to itself.
I assumed the input hashes might have changed somewhere between annotating the two sets, so I tried db-out'ing the datasets, then db-in'ing them with a new name and applying -R to force reshash and then combining them. This did not work either.
Is there a way to combine my datasets without having to manually add the second entity label to one of the labelled sets?
FYI, I have six entities that I would like to apply in total, but having completed the first two I wanted to test the theory before continuing.
Thanks in advance!
Darren