Question concerning struct hack

**Edelweiss** · 08-22-2011

Hi, I am trying to do a struct hack

Code:

typedef struct{
   uint8_t a;
   uint8_t b[1];
}c;

Then I want to my array b to have size of 4 bytes instead so i did this

Code:

c test;
uint8_t* pValue;
pValue = (uint8_t*)malloc(4);

//Assign value to where pValue is pointing to
*pValue = 0xffff;

//memcpy to test.b 
memcpy(test.b,pValue,4);

Will this work out? if i do a size of the struct c now, will it reflect the additional bytes that i allocate?

**Maz** · 08-23-2011

No. If you have not specifically allocated larger space for struct test, it will only be the size it's type specifies. Thus your memcpy will write the rest of the data (3 bytes) to unallocated memory.

You could do

Code:

c *test=calloc(1,sizeof(c)+3);
if(NULL!=test)
{
    /* Now you have pointer to struct type c, but there's actually more space allocated */
}

NOTE: If you use some other data types but chars, the struct padding may bite your but. Eg, data is typically aligned to 4 byte boundaries, and if you declare for exampl a struct

Code:

struct foo
{
    short data1;
    int    data2:
}

it typically (NOT WITH ALL ARCHITECTURES THOUGH) will be in memory as follows:

2 bytes for data1
2 badding bytes to align data correctly
4 bytes for integer data2

There is compiler specific pragmas for example to do packing of data

(gcc has #pragma pack and #pragma pop)

But I would not encourage using these. And in any case, if you go to the road of such "hacking", you need to be carefull with alignment and little/big endian formats.

For example, if we run following code in big and little endian machines, we'll have different results (can you tell me why, if you can't, avoid doing such a "hacks"

)

Code:

char *x;
short y;

y=1;

x = (char *) &y;

printf("%d",(int) (*x));

**Salem** · 08-23-2011

You do something like this

Code:

c * test = malloc( sizeof *test + myArraySize * sizeof(test->b[0]) );
for ( i = 0 ; i < myArraySize ; i++ ) {
  test->b[i] = 0;
}

> Will this work out? if i do a size of the struct c now, will it reflect the additional bytes that i allocate?
No, because declaring a literal struct instance will ONLY ever have a 1-element array associated with it.
If you want it to be a variable length, then you must allocate a block with that amount of space.

**Edelweiss** · 08-23-2011

Salem, what is the sizeof *test referring to? is it the same as sizeof(c)?

**Maz** · 08-23-2011

Yes.

test is a pointer to data type of c. so sizeof(test) would be size of pointer, and sizeof(*test) should be size of the type (c in this case)

**Edelweiss** · 08-23-2011

hmm okie my code goes like this

Code:

c* pValue;
pValue= (c*)malloc(2 +  sizeof(uint8_t));
pValue->a= 1
uint16_t setValue= 0x1234;
memcpy(c->b,(uint8_t*)&setValue,sizeof(setValue));

the malloc will allocate 3 bytes in total. because the first byte refers to the size of a and the rest of the 2 bytes refer to the size of the setValue which takes the size of 2 bytes.

Is this correct?

**Salem** · 08-23-2011

Well it's technically correct in this instance, but if your struct was more complicated (more members, different types, with the internal padding that implied), then it would just be horrible.

The allocation I showed you simply does not care about how many members the struct has, or what type the Flexible Array Member is. All you need to know at the point of the malloc call is the name of the FAM - that's all.

Also, read the FAQ on why casting malloc is a bad idea in a C program.
Are you trying to suppress "cannot convert void*" error messages by any chance? If so, then stop using a C++ compiler to compile C code.

**~~CommonTater~~** · 08-23-2011

Originally Posted by Edelweiss

Hi, I am trying to do a struct hack

Code:

typedef struct{
   uint8_t a;
   uint8_t b[1];
}c;

Then I want to my array b to have size of 4 bytes instead so i did this

Code:

c test;
uint8_t* pValue;
pValue = (uint8_t*)malloc(4);

//Assign value to where pValue is pointing to
*pValue = 0xffff;

//memcpy to test.b 
memcpy(test.b,pValue,4);

Will this work out? if i do a size of the struct c now, will it reflect the additional bytes that i allocate?

At the risk of asking a painfully stupid question... why not just change the structure definition?

Code:

typedef struct{
   uint8_t a;
   uint8_t b[4];
}c;

And please don't use singe letter names in typedefs... name it to define the type you are creating, not the next letter in the alphabet...

**Salem** · 08-23-2011

> At the risk of asking a painfully stupid question... why not just change the structure definition?
Perhaps we're just discussing "proof of concept" at this stage, with a nice simple example.

**Edelweiss** · 08-23-2011

lol ohhh because in my implementation the size of b may varies according to the variable i want to store inside, hence asking about the concept behind struct hack.

Code:

c * test = malloc( sizeof *test + myArraySize * sizeof(test->b[0]) );

Some doubts about the example that u gave, the sizeof(c) is 2 bytes, so if i want my array to have size of 4 bytes, myArraySize should be 3 or 4? Because I am assuming that size of test includes that 1 byte for a and 1 byte for b array.

**~~CommonTater~~** · 08-23-2011

Originally Posted by Edelweiss

lol ohhh because in my implementation the size of b may varies according to the variable i want to store inside, hence asking about the concept behind struct hack.

Then you should either
A) Make one struct big enough to hold the biggest data set ( a union might be a plan, here)
or
B) Make multiple structs specific to each data set.

I know everyone here is approching this as a theoretical exercise but trying to implement this in reality would be a total nightmare.

**AndrewHunter** · 08-23-2011

Originally Posted by Edelweiss

Code:

c * test = malloc( sizeof *test + myArraySize * sizeof(test->b[0]) );

Is there any reason you are still casting malloc after you have been told not to? I will make it easy for you: FAQ-Casting Malloc() <DONT DO IT>

**Maz** · 08-24-2011

Originally Posted by CommonTater

Then you should either
A) Make one struct big enough to hold the biggest data set ( a union might be a plan, here)
or
B) Make multiple structs specific to each data set.

I know everyone here is approching this as a theoretical exercise but trying to implement this in reality would be a total nightmare.

Actually no. This kind of implementations are quite common in interfaces which are meant to be generic. The netlink socket interface for data exchange between linux kernel and userspace is a good example. The data is exchanged in a buffer which is structurized as follows:

stars with message header, where one field states the "type" of the message, and other states the total lenght. (Theres also some fields not really relevant here).
Then theres a type specific struct
and at the end there is appended a set of optional attributes. Each attribute being a struct with number telling attribute type, lenght of attribute, and actual data payload.

So basically, this whole construct is dynamic, and it contains dynamic structs. And to ease handling of all this there is a set of macros written, handling the correct alignment etc.

So it is not rare at all to have structures with dynamic arrays. Actually, that is quite standard way to exhange data. (between kernel and userspace, between processes, between computers..) Almost all network protocols flowing on top of ethernet / IP consist of such a structures. Still, even though this is common, it still has plenty of pitfalls. The hardest parts for me is alignment (padding bytes) and little/big endian conversions when exchanging data between different systems / casting values.

**quzah** · 08-24-2011

Originally Posted by Salem

You do something like this

Code:

c * test = malloc( sizeof *test + myArraySize * sizeof(test->b[0]) );
for ( i = 0 ; i < myArraySize ; i++ ) {
  test->b[i] = 0;
}

This is much harder to pull off with just a pointer:

Code:

#include<stdio.h>
#include<stdlib.h>
#include<string.h>

int main( void )
{
    struct foo { char *bar; } *baz = malloc( 100 + sizeof *baz );
    baz->bar = (char*)((&baz->bar) +1);
    strcpy( baz->bar, "hello world" );
    printf( "%s!\n", baz->bar );
    free( baz );
    return 0;
}

Ugly, and requires a cast for your pointer to "work right". It seems like I should be able to just point at the address of the pointer itself, because that's basically what making it an array would be doing, but it doesn't like that at all.

Quzah.

Thread: Question concerning struct hack

Thread Tools

Search Thread

Display

Question concerning struct hack

Similar Threads

Interesting Linux hack

are strings a hack in c?