Use custom tokenizer in data-to-spacy recipe

Hi @alvaro.marlo!

Yes, it seems like you can pass the -F for a script with a tokenizer function. It seems to work even though it wasn't originally designed to do this:

Hope this helps!