Running GPT-OSS-120B at 500 tokens per second on Nvidia GPUs

Article URL: https://www.baseten.co/blog/sota-performance-for-gpt-oss-120b-on-nvidia-gpus/

Comments URL: https://news.ycombinator.com/item?id=44819968

Points: 33

# Comments: 1

https://www.baseten.co/blog/sota-performance-for-gpt-oss-120b-on-nvidia-gpus/

Établi 7h | 7 août 2025, 05:10:08

Connectez-vous pour ajouter un commentaire

Autres messages de ce groupe

About AI

Article URL: https://priver.dev/blog/ai/about-ai/

Comments URL: https://news.ycomb

7 août 2025, 12:10:10 | Hacker news

New AI Coding Teammate: Gemini CLI GitHub Actions

New AI Coding Teammate: Gemini CLI GitHub Actions

Article URL: https://blog.google/technology/developers/introducing-gemini-cli-github-actions/

7 août 2025, 12:10:09 | Hacker news

"I closed MPEG on 2 Jun '20 when I left because obscure forces had hijacked it."

"I closed MPEG on 2 Jun '20 when I left because obscure forces had hijacked it."

Article URL: https://leonardo.chiariglione.org/

Comments URL: https://news.ycombinat

7 août 2025, 12:10:07 | Hacker news

Schools are using AI surveillance to protect students. Sometimes arresting them

Schools are using AI surveillance to protect students. Sometimes arresting them

Article URL: https://apnews.com/article/ai-school-surveillance-gaggle-goguardian

7 août 2025, 12:10:07 | Hacker news

The Emperor's New Trade Deal – Paul Krugman

The Emperor's New Trade Deal – Paul Krugman

Article URL: https://paulkrugman.substack.com/p/the-emperors-new-trade-deal

Comments URL:

7 août 2025, 12:10:06 | Hacker news

"AI hype" is the true AI product

"AI hype" is the true AI product

Article URL: https://hardresetmedia.substack.com/p/machine-learning-expert-ai-hype-is

Comments URL:

7 août 2025, 12:10:05 | Hacker news

OpenAI's new GPT-5 models announced early by GitHub

OpenAI's new GPT-5 models announced early by GitHub

Article URL: https://www.theverge.com/news/752091/openai-gpt-5-model-announcement-github-leak

7 août 2025, 09:40:13 | Hacker news

Techie