Size of the raw text in the source file

Hi! I just answered a similar question here:

The main thing that's important is that your training and runtime inputs should match. So if you're training on single pargraphs, your model should also be run on single paragraphs. But if you have control over the preprocessing, that's usually no problem.