Recent posts

Welcome to Pelles C forum.
Log in
Sign up

July 12, 2025, 11:24:20 AM

News:

Download Pelles C here: http://www.smorgasbordet.com/pellesc/

Main Menu

Home
Search

Pelles C forum
► Recent posts

Pages 1 2 3 4 5 ... 10

#21

Assembly discussions / Re: Unicode strings in Assembl...

Last post by TimoVJL - July 04, 2025, 04:01:14 PM

I just analyzed things with pope.exe and TLPEView.exe
Just checking object file .data section, something is really wrong.
So use those tools to check how things went.

#22

Assembly discussions / Re: Unicode strings in Assembl...

Last post by Vortex - July 04, 2025, 03:27:32 PM

Hello LeraUnu,

Quoteset the encoding of the source files to UTF-16LE and everything is ok.

That didn't work for me. Tested with PellesC V13. I receive the uncorrect text display.

#23

Assembly discussions / Re: Unicode strings in Assembl...

Last post by LeraUnu - July 04, 2025, 03:13:01 PM

Hi TimoVJL,

Thank you for your answer.
Unfortunately I don't know how to generate an object file with the correct string.
In a C project I can define UNICODE and _UNICODE symbols, set the encoding of the source files to UTF-16LE and everything is ok.
In asm I don't know how ...

#24

Assembly discussions / Re: Unicode strings in Assembl...

Last post by TimoVJL - July 04, 2025, 07:50:48 AM

In UTF8 file

Code Select

pFile      00 01 02 03 04 05 06 07  08 09 0A 0B 0C 0D 0E 0F    Value           
00000000    EF BB BF 6D 65 73 73 61  67 65 20 64 77 20 27 54    ï»¿message dw 'T
00000010    68 69 73 20 61 20 55 4E  49 43 4F 44 45 20 74 65    his a UNICODE te
00000020    73 74 20 C4 83 2C 20 C3  A2 2C 20 C3 AE 2C 20 C8    st Äƒ, Ã¢, Ã®, È
00000030    99 2C 20 C8 9B 27 2C 30  0D 0A     ™, È›',0..

In UTF16 file

Code Select

pFile      00 01 02 03 04 05 06 07  08 09 0A 0B 0C 0D 0E 0F    Value           
00000000    FF FE 6D 00 65 00 73 00  73 00 61 00 67 00 65 00    ÿþm.e.s.s.a.g.e.
00000010    20 00 64 00 77 00 20 00  27 00 54 00 68 00 69 00     .d.w. .'.T.h.i.
00000020    73 00 20 00 61 00 20 00  55 00 4E 00 49 00 43 00    s. .a. .U.N.I.C.
00000030    4F 00 44 00 45 00 20 00  74 00 65 00 73 00 74 00    O.D.E. .t.e.s.t.
00000040    20 00 03 01 2C 00 20 00  E2 00 2C 00 20 00 EE 00     ...,. .â.,. .î.
00000050    2C 00 20 00 19 02 2C 00  20 00 1B 02 27 00 2C 00    ,. ...,. ...'.,.
00000060    30 00 0D 00 0A 00     0.....

In object file

Code Select

pFile      00 01 02 03 04 05 06 07  08 09 0A 0B 0C 0D 0E 0F    Value           
000000EB    48 00 65 00 6C 00 6C 00  6F 00 00 00 54 00 68 00    H.e.l.l.o...T.h.
000000FB    69 00 73 00 20 00 61 00  20 00 55 00 4E 00 49 00    i.s. .a. .U.N.I.
0000010B    43 00 4F 00 44 00 45 00  20 00 74 00 65 00 73 00    C.O.D.E. .t.e.s.
0000011B    74 00 20 00 C4 00 83 00  2C 00 20 00 C3 00 A2 00    t. .Ä.ƒ.,. .Ã.¢.
0000012B    2C 00 20 00 C3 00 AE 00  2C 00 20 00 C8 00 99 00    ,. .Ã.®.,. .È.™.
0000013B    2C 00 20 00 C8 00 9B 00  00 00     ,. .È.›...

#25

Assembly discussions / Re: Unicode strings in Assembl...

Last post by LeraUnu - July 03, 2025, 06:41:05 PM

Hi Vortex,

I know it is an old topic...
If I change the message to display:

Code Select

message dw 'This a UNICODE test ă, â, î, ș, ț',0

the message box doesn't display correctly.
Even if I change the encoding of the source file to UTF-16LE
nothing happens.

Can you help me?

Thank you!

#26

Beginner questions / Re: wcslen on MinGW-w64

Last post by Vortex - June 29, 2025, 01:03:34 PM

Hi dimmed,

Any chance to learn why specifically you selected this string?

#27

Beginner questions / Re: wcslen on MinGW-w64

Last post by Pelle - June 28, 2025, 03:20:34 PM

When using:

Code Select

wcslen((wchar_t*)str))

at the very least, make sure str is a properly terminated wide string (ignoring all other problems with this approach for now).

Append one (or two) '\0' to str, like so:

Code Select

const char* str = "\x41\x00\xa9\x03\x03\x26\x2d\x4e\x3d\xd8\x02\xde\0";

#28

Beginner questions / Re: wcslen on MinGW-w64

Last post by TimoVJL - June 28, 2025, 12:27:48 PM

Code Select

#include <stdio.h>
#include <wchar.h>

int main(void)
{
    const char* str = "\x41\x00\xa9\x03\x03\x26\x2d\x4e\x3d\xd8\x02\xde";
    printf("wcslen(str) = %zu\n", wcslen((wchar_t*)str));
    return 0;
}

x86

Code Select

wcslen(str) = 15x64

Code Select

wcslen(str) = 9
This version using -Ze gives 6 in both cases

Code Select

#define WIN32_LEAN_AND_MEAN
#include <windows.h>
#include <stdio.h>
#include <wchar.h>

#pragma comment(lib, "user32.lib")

int main(void)
{
    const char* str = "\x41\x00\xa9\x03\x03\x26\x2d\x4e\x3d\xd8\x02\xde";
    printf("wcslen(str) = %zu\n", wcslen((wchar_t*)str));
	MessageBoxW(0, (wchar_t*)str, L"test", MB_OK);
    return 0;
}

#29

Beginner questions / Re: wcslen on MinGW-w64

Last post by Vortex - June 28, 2025, 10:24:49 AM

Hi dimmed,

Could you be more specific as your question is not directly related to Pelles C ?

#30

Beginner questions / wcslen on MinGW-w64

Last post by dimmed - June 28, 2025, 05:56:18 AM

I'm the groggily user on Github. I tried to use a library named libutf in the past:

https://github.com/holepunchto/libutf/issues/1

https://github.com/holepunchto/libutf/issues/2

The developer was not easy to work with. In the end the thread closed without anything settled.

I still think I was right about wcslen on MinGW-w64.

Pages 1 2 3 4 5 ... 10