Entity Linker related questions

Hi!

Great to hear the video tutorial's been useful to you!

What isn't clear to me from the explanation you've given, is the following:

  • Is there additional context to your sentences, or are they really only just containing the tokens "Starship Enterprise ML 09 55799" etc?

  • What do the entities look like in the knowledge base that you want to link to? Is there just one entity "Starship Enterprise", or are there different ones according to the built year / tonnage?

The current implementation of the entity linker really depends on the words in the sentence and compares those to a reference description you've given to each entity in your knowledge base.

Should I pre-process the numbers and replaces them with tokens which are more meaningful to the model?

Well it depends - are the numbers meaningful by themselves? If the built year is important to decide on the final entity ID you would assign, then it would be better to just leave it.

Is there good approach when there is disbalance lots of numbers and fewer words in text examples?

I don't think this matters much. If the numbers are not meaningful, you'd hope that the model will learn to ignore them.

But as always, the proof is in the pudding :wink: