Article URL: https://simonwillison.net/2025/Aug/8/surprise-deprecation-of-gpt-4o/
Comments URL: https://news.ycombinator.com/item?id=44839842
Points: 199
# Comments: 165
https://simonwillison.net/2025/Aug/8/surprise-deprecation-of-gpt-4o/

Comments URL: https://news.ycombinator.com/item?id=44839681
Points: 47
# Comments: 32
https://www.latimes.com/california/story/2025-08-08/mystery-plane-thief
Article URL: https://instavm.io/blog/building-my-offline-ai-workspace
Comments URL: https://news.ycombinator.com/item?id=44840013
Points: 185
# Comments: 53

Article URL: https://www.macrumors.com/2025/07/10/no-m5-macbook-pro-2025/
Comments URL: https://news.ycombinator.com/item?id=44840281
Points: 27
# Comments: 18
https://www.macrumors.com/2025/07/10/no-m5-macbook-pro-2025/

Article URL: https://github.com/alurm/json2dir
Comments URL: https://news.ycombinator.com/item?id=44840307
Points: 21
# Comments: 5

Article URL: https://stayathomemacro.substack.com/p/job-growth-has-slowed-sharply-the
Comments URL: https://news.ycombinator.com/item?id=44840428
Points: 56
# Comments: 47
https://stayathomemacro.substack.com/p/job-growth-has-slowed-sharply-the


Article URL: https://www.dbos.dev/blog/why-postgres-durable-execution
Comments URL: https://news.ycombinator.com/item?id=44840693
Points: 24
# Comments: 10
Sam said yesterday that chatgpt handles ~700M weekly users. Meanwhile, I can't even run a single GPT-4-class model locally without insane VRAM or painfully slow speeds.
Sure, they have huge GPU clusters, but there must be more going on - model optimizations, sharding, custom hardware, clever load balancing, etc.
What engineering tricks make this possible at such massive scale while keeping latency low?
Curious to hear insights from people who've built large-scale ML systems.
<

Article URL: https://inference.net/blog/what-s-left-is-distillation
Comments URL: https://news.ycombinator.com/item?id=44840746
Points: 18
# Comments: 1