I have OSM data with booth Cyrilic and Latin Serbian characters. I want to translate it all to Latin. So, ? ? ? would become ? ? dž. I tried TextEncoder, AttributeEncoder, PythonCaller, but unsuccessfully.
Can somebody help me with this?
I have OSM data with booth Cyrilic and Latin Serbian characters. I want to translate it all to Latin. So, ? ? ? would become ? ? dž. I tried TextEncoder, AttributeEncoder, PythonCaller, but unsuccessfully.
Can somebody help me with this?
What is the encoding of the incoming attribute(s) that hold the Cyrillic characters? UTF-8?
I don't know what you did with the PythonCaller btw, but you could consider changing the Python Interpreter to 3.4+ if you didn't do that already. The latest Python 3 versions are less problematic with character encoding/decoding.
Hi @aleksandar,
you data must be in UTF to preserve both - Cyrillic and Latin Serbian. And I guess, you would like it to be saved in Win-1250.
I would suggest replacing ? ? ? with ? ? dž using StringPairReplacer first. After this it should be possible to save data in Win-1250 encoding without any extra steps (i.e. TextEncoder or AttributeEncoder shouldn't be needed anymore as the Writer will deal with the encoding).