What does the "id" field in the "tokens" mean?

The input and task hashes are unique IDs that help Prodigy identify annotations that apply to the same input text. My comment on this thread explains this in more detail. You can also find more information on the hashing functions in your PRODIGY_README.html, available for download with the Prodigy package.

I'm not 100% sure I understand the question correctly. The output data you're exporting with prodigy db-out contains all annotations stored in the database that have been labelled in the web app. So if you load in more texts, annotate them and then save them to a dataset, they should be included in the exported data.

If you haven't seen it already, you might also want to check out the "First steps" guide. It explains the most important terms and concepts, and shows a simple Prodigy workflow: