Question

How to filter attribute on Bytes vs String Data type

1 year ago
July 17, 2023
10 replies
64 views

+10

thijsknapen
Contributor
154 replies

Say I have the following data;

for which I can see the following information in the Feature Information window;

feature 1; text (bytes): 48656C6C6F
feature 2; text (string: UTF-8): World

Is there a way that I can filter the features on the basis of the indicated data type (i.e. 'bytes' vs 'string (UTF-8)'?

lgrie
Contributor
19 replies
1 year ago
July 17, 2023

You could use a TestFilter.

If you select "Type Is" as an operator you can filter all strings.

lgrie
Contributor
19 replies
1 year ago
July 17, 2023

lgrie wrote:

You could use a TestFilter.

If you select "Type Is" as an operator you can filter all strings.

You could also use "Encodable In" and select "UTF-8" in the options.

+10

thijsknapen
Author
Contributor
154 replies
1 year ago
July 17, 2023

lgrie wrote:

You could use a TestFilter.

If you select "Type Is" as an operator you can filter all strings.

Hi @lgrie

Thanks for the response. I already tried those two options. Unfortunately this doesn't work (both features pass these tests);

lgrie
Contributor
19 replies
1 year ago
July 17, 2023

thijsknapen wrote:

Hi @lgrie

Thanks for the response. I already tried those two options. Unfortunately this doesn't work (both features pass these tests);

Can you provide a sample dataset ?

Does the string attribut contain any numbers ? if not, you could use a RegEx to filter.

+10

thijsknapen
Author
Contributor
154 replies
1 year ago
July 17, 2023

thijsknapen wrote:

Hi @lgrie

Thanks for the response. I already tried those two options. Unfortunately this doesn't work (both features pass these tests);

Sure, it's now added to the main ticket/question.

lgrie
Contributor
19 replies
1 year ago
July 17, 2023

thijsknapen wrote:

Hi @lgrie

Thanks for the response. I already tried those two options. Unfortunately this doesn't work (both features pass these tests);

I cannot open it, sorry.

+10

thijsknapen
Author
Contributor
154 replies
1 year ago
July 17, 2023

thijsknapen wrote:

Hi @lgrie

Thanks for the response. I already tried those two options. Unfortunately this doesn't work (both features pass these tests);

Hmm, strange. Why not?

Did you use the 'FME Feature Store (FFS)' reader?

If I re-download the file (zipped FFS), I can successfully read/inspect it. (on FME 2022.1.0.0 - Build 22618 - WIN64)

+39

ebygomm
Influencer
3306 replies
1 year ago
July 17, 2023

Not sure how reliable this method is, it works for your test data.

In FME, copy the attribute to a new value, use the AttributeEncoder with Incoming Attribute parameter set to "Use Bytes", tester to check if the encoded attribute is different from the original attribute

Python

import fme
import fmeobjects
 
def FeatureProcessor(feature):
    data = feature.getAttribute("text")
    try:
        data = data.decode()
        feature.setAttribute("datatype","bytes")
    except (UnicodeDecodeError,AttributeError):
        feature.setAttribute("datatype","string")

+10

thijsknapen
Author
Contributor
154 replies
1 year ago
August 2, 2023

ebygomm wrote:

Not sure how reliable this method is, it works for your test data.

Python

import fme
import fmeobjects
 
def FeatureProcessor(feature):
    data = feature.getAttribute("text")
    try:
        data = data.decode()
        feature.setAttribute("datatype","bytes")
    except (UnicodeDecodeError,AttributeError):
        feature.setAttribute("datatype","string")

Hi @ebygomm ,

Bit late, but thanks for the reply! That's a creative solution that will definitly work in most cases.

That said, in my usecase I am a bit hesitant to clone the attribute, as the encoded attributes (the bytes), can be quite sizeable (your Python solution may help there).

Nothing to do with your solution, but I still feel it's quite odd that the Feature Information window the data type of the attributes, whereas it's not possible to fetch/use that information in Workbench.

If for instance I would have the same value '48656C6C6F', but once as 'bytes' and once as 'string: UTF-8', it seems that they are indistinguishable for Transformers/functions in Workbench, whereas in the Feature Information window you can see what is what. I admit this is probably a theoretical case, but wouldn't it be much easier to be able to leverage the information that is seemingly stored on some level by FME?

1 Attachments

Sample_Dataset_FT_with_String_and_Bytes_features_v2.zip

+10

thijsknapen
Author
Contributor
154 replies
1 year ago
August 2, 2023

ebygomm wrote:

Not sure how reliable this method is, it works for your test data.

Python

import fme
import fmeobjects
 
def FeatureProcessor(feature):
    data = feature.getAttribute("text")
    try:
        data = data.decode()
        feature.setAttribute("datatype","bytes")
    except (UnicodeDecodeError,AttributeError):
        feature.setAttribute("datatype","string")

Update, I created the following idea; AC Idea: Formalize 'bytes' as a Data Type (safe.com)

Reply

Rich Text Editor, editor1

Cookie policy

We use cookies to enhance and personalize your experience. If you accept you agree to our full cookie policy. Learn more about our cookies.

Cookie settings

We use 3 different kinds of cookies. You can choose which cookies you want to accept. We need basic cookies to make this site work, therefore these are the minimum you can select. Learn more about our cookies.

Basic
Functional

Normal
Functional + analytics

Complete
Functional + analytics + social media + embedded videos + marketing

How to filter attribute on Bytes vs String Data type

1 Attachments

10 replies

1 Attachments

Reply

Helpful Members This Week

Recently Solved Questions

How to restart a REST Server in ArcGIS Server?

Remove last CR/LF from a CSV

1019 error with change detector and polygons

Where is the "Show Bookmark Navigator" option in FME 2024.2?

How to dynamically write new or update existing ArcGIS Online Feature Layers.

Community Stats

Latest FME

Cookie policy

Cookie settings

1 Attachments

1 Attachments

Reply

Related Topics

Extracting a nested dynamically-sized array within a JSON fileicon

Extracting values from a Double Nested JSON array valuesicon

Extracting nested multipick questions from a JSON form and writing to CSV.icon

Need help Fragmenting JSON File with Nested Objects and Arraysicon

DataVirtualizationJSONListicon

Helpful Members This Week

Recently Solved Questions

How to restart a REST Server in ArcGIS Server?

Remove last CR/LF from a CSV

1019 error with change detector and polygons

Where is the "Show Bookmark Navigator" option in FME 2024.2?

How to dynamically write new or update existing ArcGIS Online Feature Layers.

Popular Tags

Community Stats

Latest FME

Sign up

An FME Account is required to contribute

Login to the community

An FME Account is required to contribute

Scanning file for viruses.

This file cannot be downloaded

Cookie policy

Cookie settings