Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Not sure why the difference in performance between function and non-function, particularly considering this looks inline-able.

2 theories on why gcc is doing better. Maybe its unrolling the loop, and avoids a bunch of jumps. Alternatively, it could be using SSE, but this theory is a much less likely, because I would expect it to be 4 times faster not twice as fast. gc doesn't use SSE instructions yet.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: