If all you want is some speedup from the cores, and not necessarily a certain amount of speedup in a certain way, you might want to use an auto-parallelizing compiler. I don't think GCC has that option, but Sun's SunCC does. Check out the -xautopar option.