Endianness conversion macros

**BrownB** · 12-10-2004

Hello everybody, I'm just asking to the C programming gurus if these 2 macros are right or not.

MSB2LSBW is intended to exchange the byte order of a word.
MSB2LSBDW is intended to exchange the byte order of a Double Word.

Code:

#define MSB2LSBW( x )   ( x << 8 | (char)x )

#define MSB2LSBDW( x )  (\
			  ( ( x & 0x000000FF ) << 24 ) \
			| ( ( x & 0x0000FF00 ) << 8 ) \
			| ( ( x & 0x00FF0000 ) >> 8 ) \
			| ( ( x & 0xFF000000 ) >> 24 ) \
			 )

Thank you for any suggestion!
BrownB

**CornedBee** · 12-10-2004

Put parentheses around every x and they should be fine.

**Nyda** · 12-10-2004

Originally Posted by BrownB

Code:

#define MSB2LSBW( x )   ( x << 8 | (char)x )

#define MSB2LSBDW( x )  (\
			  ( ( x & 0x000000FF ) << 24 ) \
			| ( ( x & 0x0000FF00 ) << 8 ) \
			| ( ( x & 0x00FF0000 ) >> 8 ) \
			| ( ( x & 0xFF000000 ) >> 24 ) \
			 )

MSB2LSBW moves the lower 8 bits into the MSB, then fills the lower 8 bits with whatever the typecast return (the original lower 8 bits). You duplicated your lowbyte and killed the highbyte.
Do it as above in the other function and move those bits down from the highbyte.

There's another catch. What happens if the number is signed? How would you get rid of that effect (in both macros) ?

edit:

Originally Posted by BrownB

just asking to the C programming gurus if these 2 macros are right or not.

Ooops, just realized I shouldn't have replied. Sorry

**CornedBee** · 12-10-2004

I just noticed yet another thing. Endian-ness should be a matter of the byte stream. Yes, your macros flip the endian-ness, but at the cost of ruining the number. This thing should be, in my opinion, be done when you convert to/from a byte stream.

**BrownB** · 12-10-2004

MSB2LSBW moves the lower 8 bits into the MSB, then fills the lower 8 bits with whatever the typecast return (the original lower 8 bits). You duplicated your lowbyte and killed the highbyte.

This was one my thought: I wan't shure about the byte keeped after the cast: so (char) x will keep the lower value byte, not the lower addressed byte. Ok!

There's another catch. What happens if the number is signed? How would you get rid of that effect?

I don't understand: isn't the signed format maintained? I'm just swapping bytes, not bits: what's wrong?

**CornedBee** · 12-10-2004

What's wrong is that this way, your macros get even more implementation-defined than they already are. It will work on MSVC++, but I make no guarantees about anything else.

**BrownB** · 12-10-2004

I modified those macros:

Code:

#define BE2LEW( x )   ( ( (x) << 8 ) | ( (x) & 0xFF00 ) >> 8 )

#define BE2LEDW( x )  (\
			   (   (x) << 24 ) \
			 | ( ( (x) & 0x0000FF00 ) << 8 ) \
			 | ( ( (x) & 0x00FF0000 ) >> 8 ) \
			 | (   (x) >> 24 ) \
			 )

I need to send some data from a Big Endian machine to a Little Endian machine, and before doing that I need to swap the byte order of the word and double word data.

I still can't understand: I'm going to use those macros on the Big Endian machine to exchange Little Endian formatted data, so those will not be compiled with MSVC++ but with gcc.

I've not a deep sight inside this problem you exposed: please can you be more clear ?

BrownB

**BrownB** · 12-10-2004

This is better, I hope..

Code:

#define BE2LEW( x )    ( (x) << 8 ) | ( (x) >> 8 )

**Nyda** · 12-10-2004

Originally Posted by BrownB

I don't understand: isn't the signed format maintained? I'm just swapping bytes, not bits: what's wrong?

You're not technically swapping, you're shifting them downward. Shifting the MSB of a signed positive or unsigned is fine, but if you attempt to shift down a signed positive you will have the sign bit duplicated throughout the upper 3 bytes, so that

Code:

int a= 0x80000000;
int b= a >> 31;

would make b== 0xFFFFFFFF whereas

Code:

unsigned int a= 0x80000000;
unsigned int b= a >> 31;

would make b== 0x1.

edit -- Just to clarify: Of course you'd have the sign bit of a signed postive duplicated too, but since that is 0 it is exactly what you expect in your code.

edit2: missed the 0x...

**BrownB** · 12-10-2004

Oh...really? I was thinking that the "refill" was made only by zeros, not ones...I keep going on learning!

Thank you very much!
BrownB

**CornedBee** · 12-10-2004

Big and Little endian aren't the only ways in which bit pattern representations of numbers can differ. The most common other way is the way negative numbers are represented. The only requirement that C sets is that all-zeroes are 0. Everything else is up to the implementation (implementation refers to the way a specific compiler behaves on a specific system). This means that, apart from offset, pretty much every system is permitted: sign bit, 1's complement, 2's complement, perhaps others that I don't know about. The result is that bit shifts can only reliably executed on unsigned numbers. For all other numbers, well, let's look at some cases.
2's complement is the most common negative represenatation for integers. It's used on x86. Let's put -16 in a byte:
11110000
1's complement:
11101111
Sign bit:
10010000

In this case, left-shifting is more interesting. The 2's complement version has no problem:
<< 1 = 11100000 = -32
<< 2 = 11000000 = -64
<< 3 = 10000000 = -128
<< 4 = overflow
The arithmetic pattern of every shift multiplying by 2 is preserved with a naive shift.

1's complement:
<< 1 = 11011110 = -33
Already the rule is violated. To preserve it, you need to shift 1s in if the number is negative.
<< 1 = 11011111 = -32
<< 2 = 10111111 = -64
<< 3 = overflow

Sign bit:
<< 1 = 00100000 = 32
Again the rule is violated. The shift must be done ignoring the sign bit
<< 1 = 10100000 = -32
<< 2 = 11000000 = -64
<< 3 = overflow

Therefore, signed numbers are highly unreliable to shift. You never really know what the resulting bit pattern is. This makes your code implementation-dependent.
Of course, that's exactly what it is supposed to be, right? After all, it should transfer between different implementations. This means you need to know your implementations. The x86 will use 2's complement and LE. Your other machine uses BE. What negative system? Unless it's using 2's complement too, you have to do conversions yourself on the signed numbers, and more than just flipping endians. That one only suffices for unsigneds.

Interestingly enough, since you're only shifting in complete bytes, the signed shift problem that Nyda pointed out does not apply to you. The reason for that is that the bytes' integrity is preserved on full-byte shifts. Hard to explain, easy to figure out when you think about it.
However, this preservation only works for x's complement numbers, sign bit numbers do not preserve!

**quzah** · 12-10-2004

Therefore, unsigned numbers are highly unreliable to shift. You never really know what the resulting bit pattern is.

You meant signed, right?

Quzah.

**CornedBee** · 12-10-2004

Did, yeah. Post will be edited.

**Nyda** · 12-10-2004

Originally Posted by CornedBee

Interestingly enough, since you're only shifting in complete bytes, the signed shift problem that Nyda pointed out does not apply to you. The reason for that is that the bytes' integrity is preserved on full-byte shifts. Hard to explain, easy to figure out when you think about it.
However, this preservation only works for x's complement numbers, sign bit numbers do not preserve!

That's not true. His code was shifting the MSB down to the LSB. Afterwards this was OR'ed together with other shift operation results. The final result should therefore always have been -1 in 2's complement for negative integers.
Basically I tried to hint that he would have to apply his bitmask *after* shifting the MSB, but I guess this was not understood.

**CornedBee** · 12-10-2004

Oh, I thought he did (apply the mask after the shift). Never mind me then.

Thread: Endianness conversion macros

Thread Tools

Search Thread

Display

Endianness conversion macros

Similar Threads

Screwy Linker Error - VC2005

Dikumud

Header File Question(s)

Do I have a scanf problem?

Creation of Menu problem