ToText deprecated

Enric_Herrera · June 9, 2024, 4:30pm

u.name is a string property of a Class represented by “u”

Porting to the new API, the old code was:

elseif u.name.toText = "Blücher" then

Xojo does not accept changing it to:

elseif u.name.toString = "Blücher" then

If comparing the strings directly:

elseif u.name = "Blücher" then

the strings don’t match. This issue occurs after saving/loading from an SQLite database. Using toText again and ignoring the deprecated warning makes it work. This is likely an encoding conflict with the “ü” character probably…

TimStreater · June 9, 2024, 4:36pm

Perhaps not surprising as u.name is already a String.

How is the value of u.name set? What encoding shows up if you examine u.name in the debugger?

Enric_Herrera · June 9, 2024, 4:39pm

UTF-8

TimStreater · June 9, 2024, 5:03pm

Have you compared the bytes in your string with the bytes in u.name, by eye? In the debugger?

One might be:

U+00FC ü c3 bc LATIN SMALL LETTER U WITH DIAERESIS

the other might be:

U+0075 u 75 LATIN SMALL LETTER U
U+0308 ̈ cc 88 COMBINING DIAERESIS

so, two bytes (c3 bc) vs. three bytes (75 cc 88).

(Assuming this is what is meant by combining).

Enric_Herrera · June 9, 2024, 5:11pm

But, what does ToText that make this work?

u.name = "Blûcher" // no work
u.name.toText =  "Blûcher"  // works

TimStreater · June 9, 2024, 5:13pm

Text is not the same as String. The compiler probably converts your “Blücher” to Text automatically.

Anyway - have you compared them in the debugger?

Enric_Herrera · June 9, 2024, 5:17pm

the debugger shows the name correctly

TimStreater · June 9, 2024, 5:22pm

I suggested to you to compare the bytes.

But as I suspected, the bytes involved are 75 cc 88. These are the bytes from the database. Which will not compare to c3 bc.

Anyone know how to force this to work?

TimStreater · June 9, 2024, 6:01pm

Basically your problem seems to be that the data in the database uses non-composed characters, whereas Xojo is assuming composed characters.

The ü is not composed, in the database - it’s a simple u followed by a composing umlaut (so, two code points, 3 bytes), which will appear the same on-screen as the single code point ü (two bytes). How did the data get into the database?

Eric_Williams · June 9, 2024, 6:15pm

This may be your answer:

Enric_Herrera · June 9, 2024, 6:40pm

If I display the u.name, for example:

mylabel.text = u.name
it is showed “Blücher” ok, is when comparing that it not works

Enric_Herrera · June 9, 2024, 6:50pm

var  source as string  = "Blücher"
if u.name = source then
  break
end if

in the debbuger
Source shows in Text as “Blücher”
In Binary as

u.name show in Text as “Blücher”
but in binary

Eric_Williams · June 9, 2024, 6:53pm

Right, that’s exactly what Tim has been explaining: those two strings appear the same when printed to the screen, but the underlying bytes are different, and thus the string compare function isn’t behaving as expected.

Go follow that link that I posted and download the Unicode module. Normalize both of your strings and then compare them; you should find that they compare properly.

Eric_Williams · June 9, 2024, 6:54pm

Honestly I think this is a flaw in how strings are compared, whether that’s in the Unicode specification; the OS’s implementation; or Xojo’s internal implementation. Decomposed characters should equate to their composed representations. I don’t know why it is done this way.

Enric_Herrera · June 9, 2024, 6:59pm

Where is the Unicode module?, please.

Eric_Williams · June 9, 2024, 7:07pm

Enric_Herrera · June 10, 2024, 6:49am

May it be considered a bug?
Made the following test::

var enc as TextEncoding = u.name.Encoding
var  source as string  = "Blücher"
var enc2 as TextEncoding = source.Encoding

source = source.DefineEncoding(enc)

if u.name = source then
  break
end if

enc and enc2 show the same on the debbuger

Pasted Graphic

But the string comparetion fails

kevin_g · June 10, 2024, 6:55am

The STRING data type has never implemented all of the Unicode rules.

You could try the STRING Compare function to see if that helps but you may find that you need to log an enhancement request.

FYI. The TEXT data type should compare them correctly but that data type is no longer recommended.

Emile_Schwarz · June 10, 2024, 6:59am

From the documentation:

Enric_Herrera · June 10, 2024, 7:09am

This works

elseif u.name.Compare("Blücher") = 0 then