Right the intrinsic instruction is _mm_mul_ps, the benefit is being able to working on 4 floats as a time, of course ymmv : )
Type: Posts; User: Danielm103
Right the intrinsic instruction is _mm_mul_ps, the benefit is being able to working on 4 floats as a time, of course ymmv : )
how about using SIMD?
a quick hack with VS
#include "stdafx.h"
#include <iostream>
#include <vector>
//0.0965613