Very new to nlp and spacy but excited to use these tools.
Looking for some help on how to create a pipeline where I can initially label a bunch of entities (PHONE NUMBER , ADDRESS, ORDER_ID.. others) using regex but then as a second step categorize the docs in a trained model using text.cat.
Few things I am unsure of - if the regex is performed before the ner - does the regex labeled entities influence the output of the statistical model? Is there a preferable pattern or example of how to do this? Finally what I am really wanting is the ability to extract all the entities , tabulate them with their categorical classification - is there a function that does this already ?