zlacker

[parent] [thread] 1 comments
1. pclmul+(OP)[view] [source] 2024-01-17 06:26:04
Do you know that for a fact? For all calls of clamp? I have definitely used min and max when they are true 50/50s and I assume clamp also gets some similar use.
replies(1): >>fooker+vc
2. fooker+vc[view] [source] 2024-01-17 08:07:27
>>pclmul+(OP)
Modern compilers generate code assuming all branches are highly predictable.

If your use case does not follow that pattern and you really care about performance, you have to pull out something like inline assembly.

Consider software like ffmpeg which have to do this for the sake of performance.

[go to top]