zlacker

[return to "How to Use AVX512 in Golang"]
1. sylwar+b76[view] [source] 2023-01-23 14:03:47
>>signa1+(OP)
Not really related to golang but: AVX512 is THE sweet spot to go for that for vector instructions as 512bits=64bytes=x86_64 cache line size.
◧◩
2. zamada+ku6[view] [source] 2023-01-23 15:50:41
>>sylwar+b76
Dumb question but do you have to do anything manually to ensure it’s actually aligned to a single cache line or does it just happen because of the way things work?
◧◩◪
3. stagge+2z6[view] [source] 2023-01-23 16:06:38
>>zamada+ku6
Not a dumb question. In theory, yes you want it to be aligned for performance reasons, and you can do this with one of the _aligned_malloc()/aligned_alloc() variants. In reality though, for any non-trivial algorithm, you may not be able to always enforce this.
[go to top]