How "slow" is it now?
How did you measure it?
How much "faster" does it need to be for success?
Which OS/Compiler are you using?
Did you try using the compiler optimisation flags?