your opinion (not a help question)

**ali.franco95** · 10-18-2011

After dynamically allocating my arrays, I think I have seen some speed gains in my code down from 13sec to 11sec. Am I imagining this (could be that I am not listening to music as I am executing the program) or there is something to it. I just noticed it, was not looking to optimise my code.

**grumpy** · 10-18-2011

One of the features of code that does dynamic memory allocation is non-deterministic behaviour. So performance can vary over time, with code that does repeated allocations and deallocations, and sometimes there are performance differences between code that uses static and dynamic allocation.

I wouldn't bet the advantage is totally in favour of code that does dynamic memory allocation though. There are many interacting factors that affect program performance: memory allocation and usage is only one of those factors, particularly with the memory hierarchy of modern systems (multiple caches in processors, available machine registers, amount of virtual memory versus RAM, etc). So it is not possible to make blanket statements about dynamic memory allocation being better (or worse) for performance than, say, static allocation.

**MK27** · 10-18-2011

Originally Posted by ali.franco95

After dynamically allocating my arrays, I think I have seen some speed gains in my code down from 13sec to 11sec. Am I imagining this (could be that I am not listening to music as I am executing the program) or there is something to it. I just noticed it, was not looking to optimise my code.

Here's something I learned recently here; it is specifically to do with the linux kernel, but it might apply in some way to all modern operating systems since it has to do with fundamental issues.

11-13 seconds is a long time and implies the total size of these arrays is very large. What happens when you do this:

Code:

int *array_x = malloc(10000*sizeof(int));

Is that the kernel assigns the program enough virtual address space to cover the array. This is not actual memory. There are a few reasons to do that:

1) So that the OS can juggle numerous large applications simultaneously, that, when considered together, might exhaust all of the real physical memory.

2) So that the OS can provide contiguous addresses (the C standard, eg, requires such for arrays) possibly using small, non-contiguous blocks if there is not actually such a large single chunk available.

If you now examine the uninitialized contents of array_x, it is all zeros. WRT linux at least, these are fake. A read on uninitialized heap memory still does not involve any real physical memory.

Physical memory (actual RAM) comes into play when you write something into the array. At that point, the kernel finds enough physical memory to cover the part written into. That's all. Ie, if the first thing you do is this:

Code:

array_x[1001] = 666;
array_x[7123] = 3000000;

The kernel will come up with 2 page sized blocks of RAM. A page is the smallest unit of memory the kernel will deal with (4096 bytes is common). At this point, presuming sizeof(int) == 4, array_x represents 40000 bytes of virtual address space, but only 8192 bytes of real memory.

Every time a new page is needed to cover a write, there is some processor overhead. I believe this is why traditionally malloc is considered an expensive call; traditionally, this would all happen at once (which would probably be more efficient time-wise, but would be very inefficient, RAM usage wise). Instead, it happens a page at a time when allocated space is written into for the first time.

Part of the cost here is that if the write does not cover the entire page, the kernel actually zeros out the rest of it.* Ie, calloc() calls are meaningless on linux (and I believe other modern OS's) because malloc'd memory is zero'd out anyway. One reason for that is security; if the memory were left in the state it was last used, you could get access to all kinds of information you maybe should not have access to just by allocating large chunks of memory and then reading the uninitialized data.

So this might be your issue. There are ways to determine that and to alleviate the problem but they will be platform specific.

* it could be that this happens when the previous user frees it.

**manasij7479** · 10-18-2011

I remember a thread some months ago in General Discussions(or Tech....can't find it now..unfortunately), in which someone presented some benchmarks on this(sort of) issue, showing some unusual results.
Ultimately Salem figured out that it was due to cache misses(not complete sure though, if someone can dig it up, please do).

<It turned out to be in the C forum, and for a different reason(the difference; I was ignorant of.)>

**MK27** · 10-18-2011

Originally Posted by manasij7479

I remember a thread some months ago in General Discussions(or Tech....can't find it now..unfortunately), in which someone presented some benchmarks on this(sort of) issue, showing some unusual results.
Ultimately Salem figured out that it was due to cache misses(not complete sure though, if someone can dig it up, please do).

If you are thinking of the thread I'm thinking of, it was minor page faults (a minor page fault being what I just described):

Using less malloc and free makes code _slower_

Thread: your opinion (not a help question)

Thread Tools

Search Thread

Display

your opinion (not a help question)

Similar Threads

Just Want an opinion

GUI Design Question (Seeking an opinion)

Your opinion

design question: opinion

Yet Another Question but this envolves code not opinion