Question

Serbian Cyrilic to Latin

  • 26 February 2016
  • 2 replies
  • 5 views

I have OSM data with booth Cyrilic and Latin Serbian characters. I want to translate it all to Latin. So, ? ? ? would become ? ? dž. I tried TextEncoder, AttributeEncoder, PythonCaller, but unsuccessfully.

Can somebody help me with this?


2 replies

Badge +7

What is the encoding of the incoming attribute(s) that hold the Cyrillic characters? UTF-8?

I don't know what you did with the PythonCaller btw, but you could consider changing the Python Interpreter to 3.4+ if you didn't do that already. The latest Python 3 versions are less problematic with character encoding/decoding.

Badge

Hi @aleksandar,

you data must be in UTF to preserve both - Cyrillic and Latin Serbian. And I guess, you would like it to be saved in Win-1250.

I would suggest replacing ? ? ? with ? ? dž using StringPairReplacer first. After this it should be possible to save data in Win-1250 encoding without any extra steps (i.e. TextEncoder or AttributeEncoder shouldn't be needed anymore as the Writer will deal with the encoding).

Reply