Article URL: https://textquery.app/
Comments URL: https://news.ycombinator.com/item?id=43897129
Points: 50
# Comments: 19
Creato
11d
|
5 mag 2025, 19:20:10
Accedi per aggiungere un commento
Altri post in questo gruppo

Hey everyone!
Over the past two years I threw myself back into full-time engineering with a simple goal: write code that gives back to the community. After a lot of late-night FOMO (“AI w


Article URL: https://cacm.acm.org/news/the-collapse-of-gpt/

I discovered that in LLM inference, keys and values in the KV cache have very different quantization sensitivities. Keys need higher precision than values to maintain quality.
I patched llama.cp
Article URL: https://clojurescript.org/news/2025-05-16-release
