Hi HN!
BLAST is a high-performance serving engine for browser-augmented LLMs, designed to make deploying web-browsing AI easy, fast, and cost-manageable.
The goal with BLAST is to ultimately achieve google search level latencies for tasks that currently require a lot of typing and clicking around inside a browser. We're starting off with automatic parallelism, prefix caching, budgeting (memory and LLM cost), and an OpenAI-Compatible API but have a ton of ideas in the pipe!
Website & Docs: https://blastproject.org/ https://docs.blastproject.org/
MIT-Licensed Open-Source: https://github.com/stanford-mast/blast
Hope some folks here find this useful! Please let me know what you think in the comments or ping me on Discord.
— Caleb (PhD student @ Stanford CS)
Comments URL: https://news.ycombinator.com/item?id=43872761
Points: 43
# Comments: 16
Connectez-vous pour ajouter un commentaire
Autres messages de ce groupe

Article URL: https://250bpm.substack.com/p/accountability-sinks
Article URL: https://oregoncapitalchronicle.com/2025/05/02/t

Article URL: https://www.vice.com/en/article/the-totalitarian-buddhist-who-beat-sim-city/
Comments

Article URL: https://arxiv.org/abs/2504.18920
Comments URL: https://news.ycombinator.c

Article URL: https://github.com/c1570/Connomore64
Comments URL: https://news.ycomb