Hi HN! I was mesmerized by the Claude Computer Use reveal last week and was specifically impressed by how well it navigated websites. This motivated me to create Cerebellum, a library that lets an LLM take control of a browser.
Here is a demo of Cerebellum in action, performing the goal “Find a USB C to C cable that is 10 feet long and add it to cart” on amazon.com:
https://youtu.be/xaZbuaWtVkA?si=Tq9lE6BXv9wjZ-qC
Currently, it uses Claude 3.5 Sonnet’s newly released computer use ability, but the ultimate goal is to crowdsource a high quality set of browser sessions to train an open source local model.
Checkout the MIT licensed repo on github (https://github.com/theredsix/cerebellum) or install the library from npm (https://www.npmjs.com/package/cerebellum-ai)
Looking for feedback from the HN community, especially on: What browser tasks would you use an LLM to complete? Thanks again for taking a look!
Comments URL: https://news.ycombinator.com/item?id=42007491
Points: 10
# Comments: 4
Login to add comment
Other posts in this group
Article URL: https://telcontar.net/Misc/GUI/RISCOS/
Comments URL: https://news.y
Article URL: https://fabiensanglard.net/2168/index.html
Comments URL: https:

Hey HN! We’ve just open-sourced model2vec-rs, a Rust crate for loading and running Model2Vec static embedding models with zero Python dependency. This allows you to embed text at (very) high throu

Buckaroo is my open source project. It is a dataframe viewer that has the basic features we expect in a modern table - scroll, search, sort. In addition there are summary stats, and histograms ava

Article URL: https://muse.jhu.edu/article/2832
Comments URL: https://news.ycombinator

I built this project as a way to learn more about NLP by applying it to something weird and unsolved.
The Voynich Manuscript is a 15th-century book written in an unknown script. No one’s been ab