zlacker

[parent] [thread] 2 comments
1. startu+(OP)[view] [source] 2023-03-05 16:36:47
At 1080p yes, tensor cores are not used. But at 4k majority of the pixels are filled by tensor cores (DLSS), so these FP8 ops are used.

Of course the card linked above is a server card, not a desktop or workstation card optimized for rendering.

What is that Megatron chat in the advertisement? Does it refer to a loser earth destroying character from Transformers? Rockfart?

replies(2): >>dahart+w6 >>tysam_+e31
2. dahart+w6[view] [source] 2023-03-05 17:08:44
>>startu+(OP)
Oh yeah excellent point, I should not draw lines between graphics and ML — graphics has will continue to see more and more ML applications. I hope none of my coworkers see this.

I guess Megatron is a language model framework https://developer.nvidia.com/blog/announcing-megatron-for-tr...

3. tysam_+e31[view] [source] 2023-03-05 23:01:49
>>startu+(OP)
Megatron is a Large Language Model -- unfortunately it seems they really undertrained it for the parameter counts it had, so it was more a numbers game of "hey, look how big this model is!" when they first released it.

Many modern models are far more efficient for inference IIRC, though I guess it remains a good exercise in "how much can we fit through this silicon?" engineering. :D

[go to top]