Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unicode Strings cannot handle high-plane characters #9

Open
HalosGhost opened this issue Apr 21, 2016 · 0 comments
Open

Unicode Strings cannot handle high-plane characters #9

HalosGhost opened this issue Apr 21, 2016 · 0 comments

Comments

@HalosGhost
Copy link

This may very well be due to how tcl handles strings underneath L.

Here is an initial PoC:

#!/usr/bin/L

puts("🐼");

Running the above file produces the following output:

ð¼

That is, characters with UTF-32 codepoints beyond 0xffff are out-of-range and are not displayed correctly.

It seems like this might be because tcl appears to use UTF-16 (or what it refers to as “double-byte” representation) internally.

As a result, anything high-plane is quite difficult to do in L.

@HalosGhost HalosGhost changed the title Unicode Strings are not sensibly indexable Unicode Strings cannot handle high-plane characters Apr 22, 2016
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant