Skip to main content
Solved

Problem with swedish characters when writing to ESRI geodatabase

  • January 12, 2016
  • 4 replies
  • 101 views

keoe
Contributor
Forum|alt.badge.img+1

I have a workspace where I read a shape file, clip the data and write it to a ESRI geodatabase with the

 

Esri Geodatabase (ArcSDE Geodb) reader. The attributes in the shape file contains swedish caracters. The characters is right when I read the shape file in ArcGIS but when I write the data to the geodatabase the swedish characters is replaced. How can I handle that? I have try to change character encoding on the shape file writer to iso-8859-1 (which contain swedish characters).

I'm attach print screens from ArcGIS when reading the shape file (as it should look like) and from ArcGIS when reading the geodatabase.

Best answer by david_r

It's always difficult to debug these issues remotely, but I agree it is probably an encoding issue.

Two things to try:

  • On the shape file reader, try also using the "Windows Latin1 ANSI (1252)" encoding
  • Just before the writer, insert a Logger or an Inspector and look at the characters and the encoding that is reported by FME. Example:

David

This post is closed to further activity.
It may be an old question, an answered question, an implemented idea, or a notification-only post.
Please check post dates before relying on any information in a question or answer.
For follow-up or related questions, please post a new question or idea.
If there is a genuine update to be made, please contact us and request that the post is reopened.

4 replies

david_r
Celebrity
  • Best Answer
  • January 12, 2016

It's always difficult to debug these issues remotely, but I agree it is probably an encoding issue.

Two things to try:

  • On the shape file reader, try also using the "Windows Latin1 ANSI (1252)" encoding
  • Just before the writer, insert a Logger or an Inspector and look at the characters and the encoding that is reported by FME. Example:

David


keoe
Contributor
Forum|alt.badge.img+1
  • Author
  • Contributor
  • January 12, 2016

I try to use Windows Latin1 ANSI (1252) encoding. The logger I insert before the writer gives me

Attribute(encoded: windows-1252): `Lan' has value `™sterg”tlands l„n'

FME report the same encoding but i get the same attribute value as before.


david_r
Celebrity
  • January 12, 2016

I try to use Windows Latin1 ANSI (1252) encoding. The logger I insert before the writer gives me

Attribute(encoded: windows-1252): `Lan' has value `™sterg”tlands l„n'

FME report the same encoding but i get the same attribute value as before.

What happens if you set the Shape file reader encoding to Unicode UTF-8?

Basically, you need to experiment with the reader encodings until it looks right in the Logger.

In most cases, it is either Latin1 or Unicode. If your dataset is really old it might be worth trying "DOS Nordic (ibm865)" as well.


keoe
Contributor
Forum|alt.badge.img+1
  • Author
  • Contributor
  • January 12, 2016

What happens if you set the Shape file reader encoding to Unicode UTF-8?

Basically, you need to experiment with the reader encodings until it looks right in the Logger.

In most cases, it is either Latin1 or Unicode. If your dataset is really old it might be worth trying "DOS Nordic (ibm865)" as well.

Thank you for your help! It work's fine with DOS Nordic.