Create baseline metrics based on manual NER annotations

It seems like the following post is dealing with the same question:

I'm trying this now.