Demographics Entity Extraction from clinical trail eligibility criteria

Hi! It's difficult to give a definitive answer because the approach that works best will depend on your data, how you break down the demographics you want to extract into categories etc. Maybe in this case, you want to experiment with doing some manual annotation first (maybe with patterns to help you and pre-select entities for you), and then train a separate entity recognizer. The usage guide on NER should be a good place to start:

If you haven't seen it already, also check out the medical tag on the forum for discussion related to training models for biomedical use cases: Topics tagged medical

Also, this is a recent project published by researchers at Oxford, and it's built on top of spaCy and trained on data annotated with Prodigy. They published a detailed blog post and a paper the approaches they chose and the different considerations. So if you haven't seen this yet, it's definitely an interesting read and should be pretty relevant to you.