Question

TextEncoder failing to encode Roman numbers and special characters

  • 4 January 2024
  • 1 reply
  • 7 views

Hi all,

 

I need to extract the Hex Code Point of multiple letters using TextEncoder.

However, the transformer fails to encode Roman numbers and some special characters. You can see in the below screenshot that it successfully encodes Bengali letters and some special characters, but not the others.

Anyone has an idea how to solve it or any workarounds? Thanks!

 

image


1 reply

Userlevel 5

From the documentation, under the "Unicode" heading: All characters with standard ASCII representations are not encoded. In this context, it seems to indicate 7-bit ASCII.

If you want to encode these characters, you can encode them to HEX and optionally prefix the result with "\\u00" to get the corresponding Unicode code point.

Reply