Article URL: https://daniel.haxx.se/blog/2025/05/16/leeks-and-leaks/
Comments URL: https://news.ycombinator.com/item?id=44003447
Points: 72
# Comments: 4
Létrehozva
10h
|
2025. máj. 16. 12:40:18
Jelentkezéshez jelentkezzen be
EGYÉB POSTS Ebben a csoportban

I discovered that in LLM inference, keys and values in the KV cache have very different quantization sensitivities. Keys need higher precision than values to maintain quality.
I patched llama.cp
Article URL: https://clojurescript.org/news/2025-05-16-release




Article URL: https://bobacollection.staxmuseum.org/
Comments URL: https://news.y