zlacker

[return to "OrangePi 6 Plus Review"]
1. andy99+Ta[view] [source] 2025-12-27 14:42:36
>>ekianj+(OP)
When something has an 30 TOPS NPU, what are the implications? Do NPUs like this have some common backend that ggml/llama.cpp targets? Is it proprietary and only works for some specific software? Does it have access to all the system RAM and at what bandwidth?

I know the concept has been around for a while but no idea if it actually means anything. I assume that people are targeting ones in common devices like Apple, but what about here?

◧◩
2. ekianj+8b[view] [source] 2025-12-27 14:44:49
>>andy99+Ta
It needs specific support, and for example llama.cpp would have support for some of them. But that comes with limitations in how much RAM they can allocate. But when they work, you see a flat CPU usage and the NPU does everything for inference.
[go to top]