escape sequence in a string

Started by jamiebreeze, October 02, 2019, 10:10:25 AM

Previous topic - Next topic

jamiebreeze

Can anyone help me about the following code? The result i got is wierd

#include <stdio.h>

int main(int argc, char *argv[])
{
unsigned char *p;
p = "\xA0\xFF";
printf("%x, %x\n", *p, *(p+1));

    return 0;
}


bitcoin


jamiebreeze

#2
I supposed the result would be "a0, ff", but "3f, 3f" is what i get, seems abnormal?

John S. Kent

You are trying to put 2 characters in a single character.  Assuming ASCII encoding, 0xA0 has a character with no glyph. 0xFF has the glyph that has 2 dots over a 'ÿ'. The display value 0x3f has the glyph '?'.

My guess is that compiler is trying to tell you that it can't display what you are asking for.

All this has to do with language localization & character sets being used.  You need to read about the c Standard closely for how c handles language encoding differences. Actually a complex subject.

This is my 1st posting on the form. I hope this is helpful; otherwise my apologies.

TimoVJL

#4
what Pelles C version, 9 ?
as 3fh is '?', what is your language/ codepage ?

for me it works.unsigned char *p = (unsigned char *)"\xA0\xFF";
give us a zipped project or just an object file, if it still fails.

optimized version don't even create string, just put parameters_main:
  [00000000] 68FF000000             push              FF
  [00000005] 68A0000000             push              A0
  [0000000A] 6800000000             push              @9
  [0000000F] E800000000             call              _printf
EDIT:
using ntleas
Building test_ch.obj.
C:\code\PellesC\_Forum\test_ch.c(6): warning #2223: Unable to convert character '\u00a0' to codepage 932; using default character.
C:\code\PellesC\_Forum\test_ch.c(6): warning #2223: Unable to convert character '\u00ff' to codepage 932; using default character.
Done.

_main:
  [00000000] 6A3F                   push              3F
  [00000002] 6A3F                   push              3F
  [00000004] 6800000000             push              @11
  [00000009] E800000000             call              _printf
  [0000000E] 83C40C                 add               esp,C
  [00000011] 31C0                   xor               eax,eax
  [00000013] C3                     ret               

another way to do byte arrayunsigned char s[] = {0xA0,0xFF,0};

test ntleas.exe with test_ch.c
A is App parametersx64\ntleas.exe pocc.exe Atest_ch.c
May the source be with you

jamiebreeze

Thank you to John and TimoVJL, but the problem still exists.
My system language is Chinese, but for 'codepage', I dont know?
I've tried several hex values to figure out why, and no matter which value I used, once it's big enough, it will be '3f'.
I'm going to test my orignal code on another computer to confirm the version problem (oh yes mine is Pelles C 9.00.9 installed on a Win10-64bit system).
Thank you again Timo for trying my codes on your machine.

John S. Kent

I only know English. So I can only give advice for an
English Windows environment.

The Code Page can be set in Windows from the Control Panel.
Select "Region & Language"
Select the "Advanced" tab of the dialog.
In the "Language for non-Unicode programs" frame
Select the Change System Local.
The selection you make is the "Code Page"
I see 5 selectable Code Pages.
Undoubtedly 1 of these is selected.

This is just the beginning.

You will need to read extensively about UNICODE & Wide-character handling in
the c language.  The are several data types & many functions specific to these
in the c language. This is well beyond my experience.

John S. Kent

I said: I see 5 selectable Code Pages.
I should have said  I see 5 select-able Code Pages for Chinese
You would have choose the correct region for you.

jamiebreeze

Thanks for your update, Timo.
And yes John, I'm just a beginner here but you get me a simple solution for the problem, thanks. (Though its hard to understand your 'only know English'.)