Hi HN! I was mesmerized by the Claude Computer Use reveal last week and was specifically impressed by how well it navigated websites. This motivated me to create Cerebellum, a library that lets an LLM take control of a browser.
Here is a demo of Cerebellum in action, performing the goal “Find a USB C to C cable that is 10 feet long and add it to cart” on amazon.com:
https://youtu.be/xaZbuaWtVkA?si=Tq9lE6BXv9wjZ-qC
Currently, it uses Claude 3.5 Sonnet’s newly released computer use ability, but the ultimate goal is to crowdsource a high quality set of browser sessions to train an open source local model.
Checkout the MIT licensed repo on github (https://github.com/theredsix/cerebellum) or install the library from npm (https://www.npmjs.com/package/cerebellum-ai)
Looking for feedback from the HN community, especially on: What browser tasks would you use an LLM to complete? Thanks again for taking a look!
Comments URL: https://news.ycombinator.com/item?id=42007491
Points: 10
# Comments: 4
Войдите, чтобы добавить комментарий
Другие сообщения в этой группе

Article URL: https://mistral.ai/news/le-chat-enterprise
Comments URL: https:
Article URL: https://pganalyze.com/blog/postgres-18-async-io

Kindle Comic Converter optimizes comics and manga for eink readers like Kindle, Kobo, ReMarkable, and more. Pages display in fullscreen without margins, with proper fixed layout support. Its main

