zlacker

[parent] [thread] 0 comments
1. why_on+(OP)[view] [source] 2023-09-12 21:49:36
You can keep around the KV cache from previous generations which lowers the cost of prompts significantly.
[go to top]