feature request: pre-trained model evaluation recipe

Yes, that's a good point!

For NER, here's a custom recipe written by a user – this is pretty much what you're looking for, right?