Show HN: I built a free in-browser Llama 3 chatbot powered by WebGPU

I spent the last few days building out a nicer ChatGPT-like interface to use Mistral 7B and Llama 3 fully within a browser (no deps and installs).

I’ve used the WebLLM project by MLC AI for a while to interact with LLMs in the browser when handling sensitive data but I found their UI quite lacking for serious use so I built a much better interface around WebLLM.

I’ve been using it as a therapist and coach. And it’s wonderful knowing that my personal information never leaves my local computer.

Should work on Desktop with Chrome or Edge. Other browsers are adding WebGPU support as well - see the Github for details on how you can get it to work on other browsers.

Note: after you send the first message, the model will be downloaded to your browser cache. That can take a while depending on the model and your internet connection. But on subsequent page loads, the model should be loaded from the IndexedDB cache so it should be much faster.

The project is open source (Apache 2.0) on Github. If you like it, I’d love contributions, particularly around making the first load faster.

Github: https://github.com/abi/secret-llama Demo: https://secretllama.com

Comments URL: https://news.ycombinator.com/item?id=40252569

Points: 46

# Comments: 10

https://github.com/abi/secret-llama

Établi 19d | 4 mai 2024 à 00:50:15

Connectez-vous pour ajouter un commentaire

Autres messages de ce groupe

Confessions of a College Professor: All the remedial classes in one place

Article URL: http://professorconfess.blogspot.com/2013/06/all-remedial-classes-in-one-place.html

23 mai 2024 à 01:10:08 | Hacker news

US Justice Department to Seek Breakup of Live Nation-Ticketmaster

Article URL: https://www.bloomberg.com/news/articles/2024-05-22/justice-

23 mai 2024 à 01:10:08 | Hacker news

California could require age verification to visit porn sites

Article URL: https://calmatters.org/politics/2024/05/california-porn-id-bill/

Comments URL:

23 mai 2024 à 01:10:07 | Hacker news

Schumacher's family wins compensation for AI 'interview'

Article URL: https://www.bbc.com/sport/formula1/articles/cd1176240lko

Comments URL:

23 mai 2024 à 01:10:07 | Hacker news

Show HN: Route your prompts to the best LLM

Hey HN, we've just finished building a dynamic router for LLMs, which takes each prompt and sends it to the most appropriate model and provider. We'd love to know what you think!

Here is a quick

22 mai 2024 à 22:50:12 | Hacker news

Safe Terminal Escape Codes

Article URL: https://www.arp242.net/safeterm.html

Comments URL: https://news.ycomb

22 mai 2024 à 22:50:12 | Hacker news

Show HN: Simple and fast resume document generation with AI

Article URL: https://cvgist.com/

Comments URL: https://news.ycombinator.com/item?id=40444926

22 mai 2024 à 22:50:11 | Hacker news

Techie