Running GPT-OSS-120B at 500 tokens per second on Nvidia GPUs

Article URL: https://www.baseten.co/blog/sota-performance-for-gpt-oss-120b-on-nvidia-gpus/

Comments URL: https://news.ycombinator.com/item?id=44819968

Points: 33

# Comments: 1

https://www.baseten.co/blog/sota-performance-for-gpt-oss-120b-on-nvidia-gpus/

Created 4h | Aug 7, 2025, 5:10:08 AM

Login to add comment

Other posts in this group

We replaced passwords with something worse

We replaced passwords with something worse

Article URL: https://blog.danielh.cc/blog/passwords

Comments URL: https://news.y

Aug 7, 2025, 7:30:06 AM | Hacker news

How ChatGPT spoiled my semester (2024)

How ChatGPT spoiled my semester (2024)

Article URL: https://benborgers.com/chatgpt-semester

Comments URL: https://news

Aug 7, 2025, 7:30:05 AM | Hacker news

Researchers Uncover RCE Attack Chains in HashiCorp Vault and CyberArk Conjur

Researchers Uncover RCE Attack Chains in HashiCorp Vault and CyberArk Conjur

Article URL: https://www.csoonline.com/article/4035274/resear

Aug 7, 2025, 7:30:04 AM | Hacker news

Cracking the Vault: How we found zero-day flaws in HashiCorp Vault

Cracking the Vault: How we found zero-day flaws in HashiCorp Vault

Article URL: https://cyata.ai/blog/cracking-the-vaul

Aug 7, 2025, 7:30:04 AM | Hacker news

Mac history echoes in current Mac operating systems

Mac history echoes in current Mac operating systems

Article URL: http://tenfourfox.blogspot.com/2025/08/mac-history-echoes-in-mac-operating.html

Co

Aug 7, 2025, 5:10:09 AM | Hacker news

FDA approves eye drops that fix near vision without glasses

FDA approves eye drops that fix near vision without glasses

Article URL: https://newatlas.com/aging/age-related-near-sighted-drops-vizz/

Comments URL:

Aug 7, 2025, 5:10:08 AM | Hacker news

Show HN: Rust framework for advanced file recognition and identification

Show HN: Rust framework for advanced file recognition and identification

Alternative to magic.h and infer. Zero dependencies. Fully extensible. Works in no_std, async, and embedded contexts.

Comments URL:

Aug 7, 2025, 5:10:07 AM | Hacker news

Techie