Vector vs. array.

**matsp** · 06-21-2008

Just a quick analyzis of the vector vs. array code generated shows:

Code:

// array code
;	COMDAT ?sum@A@@UAEHXZ
_TEXT	SEGMENT
?sum@A@@UAEHXZ PROC NEAR				; A::sum, COMDAT
; _this$ = ecx

	push	ebx
	push	esi
	xor	eax, eax
	push	edi
	add	ecx, 32					; 00000020H
	mov	esi, 128				; 00000080H
$L11447:
	mov	edx, 16					; 00000010H
$L11451:
	mov	ebx, DWORD PTR [ecx-24]
	mov	edi, DWORD PTR [ecx-28]
	add	edi, ebx
	add	edi, DWORD PTR [ecx-20]
	add	edi, DWORD PTR [ecx-16]
	add	edi, DWORD PTR [ecx-12]
	add	edi, DWORD PTR [ecx-8]
	add	edi, DWORD PTR [ecx-4]
	add	edi, DWORD PTR [ecx]
	add	eax, edi
	add	ecx, 32					; 00000020H
	dec	edx
	jne	SHORT $L11451
	dec	esi
	jne	SHORT $L11447
	pop	edi
	pop	esi
	pop	ebx
	ret	0
?sum@A@@UAEHXZ ENDP					; A::sum
_TEXT	ENDS

// vector code:
_TEXT	SEGMENT
?sum@V@@UAEHXZ PROC NEAR				; V::sum, COMDAT
; _this$ = ecx
	push	esi
	mov	esi, DWORD PTR [ecx+8]
	push	edi
	xor	eax, eax
	add	esi, 4
	mov	edi, 128				; 00000080H
	npad	1
$L11413:
	mov	ecx, DWORD PTR [esi]
	add	ecx, 8
	mov	edx, 16					; 00000010H
	npad	6
$L11417:
	add	eax, DWORD PTR [ecx-8]
	add	eax, DWORD PTR [ecx-4]
	add	eax, DWORD PTR [ecx]
	add	eax, DWORD PTR [ecx+4]
	add	eax, DWORD PTR [ecx+8]
	add	eax, DWORD PTR [ecx+12]
	add	eax, DWORD PTR [ecx+16]
	add	eax, DWORD PTR [ecx+20]
	add	ecx, 32					; 00000020H
	dec	edx
	jne	SHORT $L11417
	add	esi, 16					; 00000010H
	dec	edi
	jne	SHORT $L11413
	pop	edi
	pop	esi
	ret	0
?sum@V@@UAEHXZ ENDP					; V::sum

The main difference, as I see it, is the vector has to do another indirection in the outer loop, vs. the array just doing a simple decrement operation in the second loop. I would actually expect the compiler to merge the two loops, since loading edx with 16 then using edi to loop the outer loop seems excessive, why not just load edx with 16 * 128?

The above code is with "size" set to 128 rather than 100 as in the posted code - I doubt it makes much difference in the overall code generated, I changed to 128 to see if gcc would do some better code generation - currenly, the MS compiler beats the gcc version by running BOTH loops quicker than one of the gcc functions.

--
Mats

**Elysia** · 06-21-2008

So in essence, the compiler is able to apply further optimizations to the vector loop than the array loop.
Clever compiler. Or is it clever code?

**matsp** · 06-21-2008

Originally Posted by Elysia

So in essence, the compiler is able to apply further optimizations to the vector loop than the array loop.
Clever compiler. Or is it clever code?

The other way around, the array is a more compact loop than the vector [with the version of compiler I've been using, at least].

--
Mats

**grumpy** · 06-21-2008

Originally Posted by iMalc

I'm with dwks:
A vector can be as fast as an array, not faster. Why? because a vector uses an array internally. Anything that proves otherwise has to be an invalid test, and there are so many ways for it to be invalid.

I'd actually argue the reverse: contradiction of a belief or theory by a counter-example is evidence that belief or theory is incorrect (or, at best, incomplete).

**laserlight** · 06-22-2008

I'd actually argue the reverse: contradiction of a belief or theory by a counter-example is evidence that belief or theory is incorrect (or, at best, incomplete).

A certain Albert Einstein reportedly stated that: "If the facts don't fit the theory, change the facts."

But yes, I think the point is that we need to be sure that the counter-example is indeed a counter-example.

**grumpy** · 06-22-2008

Originally Posted by laserlight

A certain Albert Einstein reportedly stated that: "If the facts don't fit the theory, change the facts."

It was quite a time ago that I read the story of that, but I seem to recall the context was a swipe at occurrences of inconvenient facts being dismissed because of entrenched beliefs.

Originally Posted by laserlight

But yes, I think the point is that we need to be sure that the counter-example is indeed a counter-example.

Sure. The way to check that is to explain what occurred, and make sure the supporting examples and counter-examples are evaluated on an equal footing, not to dismiss occurrences by default.

**Daved** · 06-23-2008

I apologize if this has been stated already. matsp, why are you comparing a statically sized 2D array with a vector of vectors that is created in a loop in the class constructor? It's not a terribly fair comparison in the first place, but it still could be useful and interesting if you used the vector's constructor to initialize it instead of a loop.

I'm wondering what the results of using std::tr1::array (or boost::array) would be here.

Edit: Oh... you're not timing initialization, just access.

**matsp** · 06-23-2008

Originally Posted by Daved

Edit: Oh... you're not timing initialization, just access.

Correct, I'm only timing the access of the contents of both the array and the vector.

I'd be interested to see the code of those who can achieve faster vector than array access, since I can't really see how that can be done - the only thing I can think of is:
1. the array access is less than optimal due to some sort of alignment issues.
2. the array access is done via for example mul operations because the array sizes is not a factor 2^n. But since the 2D array access in Visual Studio .Net translates to a 1D array access, I can't really see how that happens.

--
Mats

Thread: Vector vs. array.

Thread Tools

Search Thread

Display

Similar Threads

allocation and reallocation of memory dynamically of an integer array

Array coping into another array?or function returning array

[question]Analyzing data in a two-dimensional array

Unknown Memory Leak in Init() Function

Quick question about SIGSEGV