I would like to know how experts perform in NLP workflow.
I have a project with pdf files. In each document, I would like to perform a NER extraction on the name and reason for resign. In order to have the training set, I have use my own code to separate the pdf into sentences (with the use of spacy) and put each sentence into prodigy for labeling and training.
My question is
1.) Should I use a long paragraph/page instead of sentence for labeling? As some of the sentences are not complete sentences.
2.) Should I use long paragraph/ page to run with the model (mostly trained by sentences not long paragraph/ page).
Thank you for any comments/ recommendations