Hi
I want to train the spacy algorithm with prodigy. But now I have a logical question. In the end, I am only interested in names and organization names in a text, which I want to point out with spacy. If I train spacy, do I have to label placeholders for names too or only the names in the text?
For example:
1a. Mark Zuckerberg decided to invest in AI.
1b. The Facebook-CEO decided to invest in AI.
In 1a, I would label Mark Zuckerberg as a person but do I also label Facebook-CEO as a person? Generally, it is a person too, but I am just interested in the names in the end. Will spacy be confused if I label only the names and not the placeholders like "CEO"? Or should I label all the words in the context of the sentence a person and ignore the none names when I proceed with the results?
Same problem with the organizations. Example:
2a. Facebook invests in promising startups.
2b. The company invests in promising startups.
Same here. In the end, I need Facebook for further proceedings and not 'the company.' But will it affect the spacy result when I label the names when I train the algorithm?
Thanks for your help.