zlacker

[parent] [thread] 1 comments
1. kllrno+(OP)[view] [source] 2023-10-14 19:14:25
https://developer.nvidia.com/blog/how-access-global-memory-e...

SIMT still expects coalesced memory access that's close together otherwise performance falls off a cliff

replies(1): >>the_sv+UK1
2. the_sv+UK1[view] [source] 2023-10-15 14:32:49
>>kllrno+(OP)
Yes, but not all thread in the block need to. As long as you fill a cache line you’re good.
[go to top]