Hi HN! I was mesmerized by the Claude Computer Use reveal last week and was specifically impressed by how well it navigated websites. This motivated me to create Cerebellum, a library that lets an LLM take control of a browser.
Here is a demo of Cerebellum in action, performing the goal “Find a USB C to C cable that is 10 feet long and add it to cart” on amazon.com:
https://youtu.be/xaZbuaWtVkA?si=Tq9lE6BXv9wjZ-qC
Currently, it uses Claude 3.5 Sonnet’s newly released computer use ability, but the ultimate goal is to crowdsource a high quality set of browser sessions to train an open source local model.
Checkout the MIT licensed repo on github (https://github.com/theredsix/cerebellum) or install the library from npm (https://www.npmjs.com/package/cerebellum-ai)
Looking for feedback from the HN community, especially on: What browser tasks would you use an LLM to complete? Thanks again for taking a look!
Comments URL: https://news.ycombinator.com/item?id=42007491
Points: 10
# Comments: 4
Connectez-vous pour ajouter un commentaire
Autres messages de ce groupe

Article URL: https://github.com/ivanbelenky/us-routing
Comments URL: https://

Article URL: https://yggdrasil-network.github.io/about.html

Article URL: https://baharmon.github.io/basics
Comments URL: https://news.ycombinator