like always, very informative! you are right. I have only around 150000 tokens, many thanks, let me suppose that I will find another corpus, can I use the prodigy comments instead of scripts for pre-processing?
is there any other usage of sense2vec that I can use with the combination of NER to expand and improve my entities?