zlacker

[parent] [thread] 0 comments
1. Locke1+(OP)[view] [source] 2010-07-08 09:19:17
If they wrote this in pure assembly as a personal project: congratulations and good on them. However, if they actually believe that this is a performance benefit I have to say that they know just enough about hardware optimization to be dangerous.

One of the great things about writing the core in C and then either optimizing for the compiler or writing the inner loop in the target ISA is that one can adapt with the CPU architecture. Anyone who has done development for true high performance architectures knows that you need to optimize for memory architecture as much as CPU architecture. When the new Xeons come out with double the L2 and a larger cache line and X cores in NUMA layout, writing everything in the Pentium 3 ISA is going to look pretty stupid.

[go to top]