26× Faster Inference with Layer-Condensed KV Cache for Large Language Models

Article URL: https://arxiv.org/abs/2405.10637

Comments URL: https://news.ycombinator.com/item?id=40416657

Points: 22

# Comments: 1

https://arxiv.org/abs/2405.10637

Établi 24d | 20 mai 2024 à 18:30:34

Connectez-vous pour ajouter un commentaire

Autres messages de ce groupe

Bibliography keys: It's as easy as [1], [2], [3]

Bibliography keys: It's as easy as [1], [2], [3]

Article URL: http://blog.cr.yp.to/20240612-bibkeys.html

Comments URL: https:

13 juin 2024 à 19:10:17 | Hacker news

SSH agent extensions as an arbitrary RPC mechanism

SSH agent extensions as an arbitrary RPC mechanism

Article URL: https://mjg59.dreamwidth.org/69646.html

Comments URL: https://news

13 juin 2024 à 19:10:16 | Hacker news

Show HN: Pathway – Build Mission Critical ETL and RAG in Python (NATO, F1 Used)

Show HN: Pathway – Build Mission Critical ETL and RAG in Python (NATO, F1 Used)

Hi HN data folks,

I am excited to share Pathway, a Python data processing framework we built for ETL and RAG pipelines.

https://github.com/pathw

13 juin 2024 à 19:10:15 | Hacker news

IntelliJ GitHub Plugin leaking credentials

IntelliJ GitHub Plugin leaking credentials

Article URL: https://blog.jetbrains.com/security/2024/06/up

13 juin 2024 à 19:10:14 | Hacker news

Postgres 17: Streaming I/O for sequential scans and ANALYZE

Postgres 17: Streaming I/O for sequential scans and ANALYZE

Article URL: https://pganalyze.com/blog/5mins-postgres-17-streaming-io

Comments URL:

13 juin 2024 à 19:10:13 | Hacker news

Ted Chiang has won the PEN/Faulkner Foundation's short story prize

Ted Chiang has won the PEN/Faulkner Foundation's short story prize

Article URL: https://lithub.com/ted-chiang-has-won-the-pen-faulkner-foundations-short-story-prize/

13 juin 2024 à 19:10:12 | Hacker news

Spectrum of Covid-19: From Asymptomatic Organ Damage to Long Covid Syndrome

Spectrum of Covid-19: From Asymptomatic Organ Damage to Long Covid Syndrome

Article URL: https://whn.global/scientific/spectrum-of-covid-19-from-asymptomati

13 juin 2024 à 19:10:10 | Hacker news

Techie