I'm trying to convert a text to json, using my own script, so that I can annotate dependencies. The text is in ancient Greek and has different punctuations. So, I want to split the sentences using spacy' sentencizer in this simple way:
sentencizer = Sentencizer(punct_chars=[".", ";", "·"])
the period and semicolon work (question mark in Greek), but the "·" (raised dot) does not. It is ignored by the sentencizer. Any idea why this is so, and solutions?