Local LLM inference – impressive but too hard to work with

Article URL: https://medium.com/@aazo11/local-llm-inference-897a06cc17a2

Comments URL: https://news.ycombinator.com/item?id=43753890

Points: 8

# Comments: 5

https://medium.com/@aazo11/local-llm-inference-897a06cc17a2

созданный 11d | 21 апр. 2025 г., 19:10:21

Войдите, чтобы добавить комментарий

Другие сообщения в этой группе

Images of Soviet Venus lander falling to Earth suggest its parachute may be out

Images of Soviet Venus lander falling to Earth suggest its parachute may be out

Article URL: https://www.leonarddavid.com/old-soviet-venus-descent-craft-nearing-earth-reentry/

2 мая 2025 г., 21:50:06 | Hacker news

VR Design Unpacked: The Secret to Beat Saber's Fun Isn't What You Think

VR Design Unpacked: The Secret to Beat Saber's Fun Isn't What You Think

Article URL: https://www.roadtovr.com/beat-saber-instructed-motion-until-you-fall-inside-xr-design/

2 мая 2025 г., 21:50:05 | Hacker news

OneText (YC W23) Is Hiring a DevOps/DBA Lead Engineer

OneText (YC W23) Is Hiring a DevOps/DBA Lead Engineer

Comments URL: https://news.ycombinator.com/item?id=43874534

Points: 0

# Comments: 0

https://news.ycombinator.com/ite

2 мая 2025 г., 21:50:04 | Hacker news

Building Burstables: CPU slicing with cgroups

Building Burstables: CPU slicing with cgroups

Article URL: https://www.ubicloud.com/blog/building-burstables-cpu-slicing-with-cgroups

Comments URL

2 мая 2025 г., 19:30:16 | Hacker news

Toma (YC W24) Is Hiring Engs #3-4 (AI for Automotive)

Toma (YC W24) Is Hiring Engs #3-4 (AI for Automotive)

Article URL: https://www.ycombinator.com/companies/toma/jobs

Comments URL:

2 мая 2025 г., 19:30:14 | Hacker news

The History of Album Art

The History of Album Art

Article URL: https://matthewstrom.com/writing/album-art/

Comments URL: http

2 мая 2025 г., 19:30:12 | Hacker news

Show HN: Blast – Fast, multi-threaded serving engine for web browsing AI agents

Show HN: Blast – Fast, multi-threaded serving engine for web browsing AI agents

Hi HN!

BLAST is a high-performance serving engine for browser-augmented LLMs, designed to make deploying web-browsing AI easy, fast, and cost-manageable.

The goal with BLAST is to ultimately a

2 мая 2025 г., 19:30:11 | Hacker news

Techie