FME encoded text (internals)

Question

Not really a question, but feel free to add comments or thoughts below.

We are implementing pre-commit hooks and centralized Python linters in our git workflows, and for this we need to extract all Python code from workspace files pushed to git. Since we do not want to require a full FME installation for this, we’re parsing the .fmw files as text files using Python and extracting all the code blocks within, before passing them on to the linter as if they were stand-alone Python scripts.

As code blocks are encoded in a somewhat non-standard way, I’m sharing a Python function to convert FME encoded strings to regular text, without resorting to any dependencies like an FME installation, (which would allow using FMESession.decodeFromFMEParsableText) or other third-party libraries.

It can decode FME encoded strings on either format

import&lt;space&gt;fme&lt;lf&gt;import&lt;space&gt;fmeobjects&lt;lf&gt;import&lt;space&gt;json&lt;lf&gt;&lt;lf&gt;&lt;lf&gt;class&lt;space&gt;

or

<opencurly><quote>automation.id<quote>:<quote>839b0966-82ed-4fdf-8539-d95e72edf52e<quote><comma><quote>job.timeSubmitted<quote>...

Here’s the code, it can either be imported as a module or run as a script from the command line:

from xml.sax.saxutils import unescape
import re


def decode_from_fme_parsable_text(encoded: str | None) -> str:
    """
    Decodes strings encoded in FME internal format using
    proprietary XML-like tags, and possibly also using
    encoded tag characters.

    :param encoded: encoded input string
    :return: decoded input string, empty if input is not a string
    """

    if not isinstance(encoded, str):
        return ""
    else:
        decoded = (
            unescape(encoded)
            .replace("<lt>", "<")
            .replace("<gt>", ">")
            .replace("<quote>", '"')
            .replace("<amp>", "&")
            .replace("<backslash>", "\\")
            .replace("<solidus>", "/")
            .replace("<apos>", "'")
            .replace("<dollar>", "$")
            .replace("<at>", "@")
            .replace("<space>", " ")
            .replace("<comma>", ",")
            .replace("<openparen>", "(")
            .replace("<closeparen>", ")")
            .replace("<openbracket>", "[")
            .replace("<closebracket>", "]")
            .replace("<opencurly>", "{")
            .replace("<closecurly>", "}")
            .replace("<semicolon>", ";")
            .replace("<cr>", "\r")
            .replace("<lf>", "\n")
            .replace("<tab>", "\t")
            .replace("<bell>", "\a")
            .replace("<backspace>", "\b")
            .replace("<verttab>", "\v")
            .replace("<formfeed>", "\f")
        )

        # Decode extended characters beyond 7-bit ASCII
        specials_re = "<u([0123456789abcdef]{4})>"
        specials = re.findall(specials_re, decoded, flags=re.I)
        for special in set(specials):
            decoded_chr = chr(int(special, 16))
            decoded = decoded.replace(f"<u{special}>", decoded_chr)

        return decoded


if __name__ == "__main__":
    str_encoded = input("FME encoded string: ")
    print("Result:")
    print(decode_from_fme_parsable_text(str_encoded))

Source: http://docs.safe.com/fme/2013sp1/pdf/FMEQuickTranslator.pdf, pages 48-49.

I’m assuming this hasn’t changed much since 2013, but feel free to correct me :-)

If this is useful to anyone, I’d love your feedback below.

nielsgerrits · Answer

Very useful to document this. I know I looked up my notes because I needed this right after the User Conf in Bonn😀

FME encoded text (internals)

2 replies

Reply

Helpful Members This Week

Recently Solved Questions

How to see which features have invalid source datasets when using a FeatureWrite?

How to compare multiple AGOL Feature Services

Simple arithmatic problem

How to get a list of Asana tasks with their corresponding custom field values?

Using one AttributeRounder for different accuracies

Community Stats

Latest FME

Cookie policy

Cookie settings

Reply

Related Topics

Using the GUI_LINE_TABLE parameter type in FMXicon

Hi there, today I have a question about the SchemaMapper lookup table. The SchemaMapper filter allows me to define conditional clauses to perform attribute mappings based on specific conditions.icon

FME 2018 FMW reader Parameter value SQL_STATEMENT formaticon

Conditional Value error in XMLUpatericon

DGN seed reading. What attribute ties them together?icon

Helpful Members This Week

Recently Solved Questions

How to see which features have invalid source datasets when using a FeatureWrite?

How to compare multiple AGOL Feature Services

Simple arithmatic problem

How to get a list of Asana tasks with their corresponding custom field values?

Using one AttributeRounder for different accuracies

Popular Tags

Community Stats

Latest FME

Sign up

An FME Account is required to contribute

Login to the community

An FME Account is required to contribute

Scanning file for viruses.

This file cannot be downloaded

Cookie policy

Cookie settings