Synonyms

I hope I understand the question correctly. But I think you probably want to focus on training your model to recognise DRUG (any drug) until it's reasonably good at it. You can then add a rule-based component on top later that normalises them and groups them into subtypes. I actually outlined a very similar approach here: