Ask HN: What's Your Useful Local LLM Stack?

What I’m asking HN:

What does your actually useful local LLM stack look like?

I’m looking for something that provides you with real value — not just a sexy demo.

---

After a recent internet outage, I realized I need a local LLM setup as a backup — not just for experimentation and fun.

My daily (remote) LLM stack:

  - Claude Max ($100/mo): My go-to for pair programming. Heavy user of both the Claude web and desktop clients.
  - Windsurf Pro ($15/mo): Love the multi-line autocomplete and how it uses clipboard/context awareness.
  - ChatGPT Plus ($20/mo): My rubber duck, editor, and ideation partner. I use it for everything except code.

Here’s what I’ve cobbled together for my local stack so far:

Tools

  - Ollama: for running models locally
  - Aider: Claude-code-style CLI interface
  - VSCode w/ continue.dev extension: local chat & autocomplete

Models

  - Chat: llama3.1:latest
  - Autocomplete: Qwen2.5 Coder 1.5B
  - Coding/Editing: deepseek-coder-v2:16b

Things I’m not worried about:

  - CPU/Memory (running on an M1 MacBook)
  - Cost (within reason)
  - Data privacy / being trained on (not trying to start a philosophical debate here)

I am worried about:

  - Actual usefulness (i.e. “vibes”)
  - Ease of use (tools that fit with my muscle memory)
  - Correctness (not benchmarks)
  - Latency & speed

Right now: I’ve got it working. I could make a slick demo. But it’s not actually useful yet.

---

Who I am

  - CTO of a small startup (5 amazing engineers)
  - 20 years of coding (since I was 13)
  - Ex-big tech

Comments URL: https://news.ycombinator.com/item?id=44572043

Points: 18

# Comments: 2

https://news.ycombinator.com/item?id=44572043

Létrehozva 9h | 2025. júl. 15. 16:50:26

Jelentkezéshez jelentkezzen be

EGYÉB POSTS Ebben a csoportban

Voxtral – Frontier open source speech understanding models

Article URL: https://mistral.ai/news/voxtral

Comments URL: https://news.ycombinator.com

2025. júl. 15. 23:50:21 | Hacker news

What caused the 'baby boom'? What would it take to have another?

Article URL: https://www.derekthompson.org/p/what-caused-the-baby-boom-what-would

Comments URL:

2025. júl. 15. 23:50:20 | Hacker news

What is going on with US weather radar today?

Here's the loop from today

https://atlas.niu.edu/analysis/radar/midwest/midwest_radar_b...

2025. júl. 15. 23:50:19 | Hacker news

The FIPS 140-3 Go Cryptographic Module

Article URL: https://go.dev/blog/fips140

Comments URL: https://news.ycombinator.com/item?id

2025. júl. 15. 23:50:18 | Hacker news

Where's Firefox Going Next?

Article URL: https://connect.mozilla.org/t5/discussions/where-s-firefox-going-next-you

2025. júl. 15. 23:50:14 | Hacker news

Claude for Financial Services

Article URL: https://www.anthropic.com/news/claude-for-financial-services

Comments URL:

2025. júl. 15. 23:50:12 | Hacker news

Huawei's star AI model was built on burnout and plagiarism

Article URL: https://the-open-source-ward.ghost.io

2025. júl. 15. 23:50:11 | Hacker news

Techie