Show HN: We made our own inference engine for Apple Silicon

We wrote our inference engine on Rust, it is faster than llama cpp in all of the use cases. Your feedback is very welcomed. Written from scratch with idea that you can add support of any kernel and platform.

Comments URL: https://news.ycombinator.com/item?id=44570048

Points: 72

# Comments: 23

https://github.com/trymirai/uzu

Erstellt 7h | 15.07.2025, 16:50:31

Melden Sie sich an, um einen Kommentar hinzuzufügen

Andere Beiträge in dieser Gruppe

Voxtral – Frontier open source speech understanding models

Article URL: https://mistral.ai/news/voxtral

Comments URL: https://news.ycombinator.com

15.07.2025, 23:50:21 | Hacker news

What caused the 'baby boom'? What would it take to have another?

Article URL: https://www.derekthompson.org/p/what-caused-the-baby-boom-what-would

Comments URL:

15.07.2025, 23:50:20 | Hacker news

What is going on with US weather radar today?

Here's the loop from today

https://atlas.niu.edu/analysis/radar/midwest/midwest_radar_b...

15.07.2025, 23:50:19 | Hacker news

The FIPS 140-3 Go Cryptographic Module

Article URL: https://go.dev/blog/fips140

Comments URL: https://news.ycombinator.com/item?id

15.07.2025, 23:50:18 | Hacker news

Where's Firefox Going Next?

Article URL: https://connect.mozilla.org/t5/discussions/where-s-firefox-going-next-you

15.07.2025, 23:50:14 | Hacker news

Claude for Financial Services

Article URL: https://www.anthropic.com/news/claude-for-financial-services

Comments URL:

15.07.2025, 23:50:12 | Hacker news

Huawei's star AI model was built on burnout and plagiarism

Article URL: https://the-open-source-ward.ghost.io

15.07.2025, 23:50:11 | Hacker news

Techie