How Hebbia is building AI for in-depth research

A New York-based AI startup called Hebbia says it’s developed techniques that let AI answer questions about massive amounts of data without merely regurgitating what it’s read or, worse, making up information.

To help generative AI tools answer questions beyond the information in their training data, AI companies have recently used a technique called retrieval-augmented generation, or RAG. When users ask a question, RAG-powered AI typically uses a search-engine-style system to locate relevant information it has access to, whether that’s on the web or in a private database. 

Then, that information is fed to the underlying AI model along with the user’s query and any other instructions, so it can use it to formulate a response.

The problem, says Hebbia CEO George Sivulka, is that RAG can get too bogged down in keyword matches to focus on answering a user’s actual question. For instance, if an investor asks a RAG-powered system whether a particular company looks like a good investment, the search process might surface parts of the business’s financial filings using that kind of language, like favorable quotes from the CEO, rather than conducting an in-depth analysis based on criteria for picking a stock.

“Traditional RAG is good at answering questions that are in the data, but it fails for questions that are about the data,” Sivulka says.

It’s a problem that surfaces in general-purpose AI-powered search engines, which can confidently regurgitate satire, misinformation, or off-topic information that matches a query, as well as special-purpose tools. One recent test of a legal AI tool, where the system was asked to find notable opinions by a made-up judge, found it highlighted a case involving a party with a similar name. 

And, according to Hebbia, questions that require an analysis of a big data set that goes beyond finding relevant documents often can’t be answered by RAG alone.

Hebbia, says Sivulka, has approached the problem with a technique the company calls iterative source decomposition. That method identifies relevant portions of a data set or collection of documents using an actual AI model rather than mere keyword or textual similarity matching, feeding what it finds into a nested network of AI models that can analyze portions of the data and intermediate results together. Then, the system can ultimately come up with a comprehensive answer to a question.

“Technically, what it’s doing is running a LLM over every token that matters and then using that to feed that into another model, that feeds it into another model, and so it recurses all the way up to the top,” Sivulka says.

Hebbia’s nested processing techniques also help overcome limitations with AI context windows, which limit the amount of information that can be provided to a language model in one query, he says. 

Hebbia announced a $130 million Series A funding round in July and claims clients like the U.S. Air Force, law firm Gunderson Dettmer, and private equity firms Charlesbank and Cinven. Sivulka says various clients harness the company’s technology to answer complex questions about financial data for potential investments or to search for valuable information buried deep in voluminous legal discovery data sets, among numerous other use cases. The tool can also be configured to notify users when new data enables it to draw new conclusions, such as when new financial filings by a particular company appear online, Sivulka says. 

Demand is high enough that Hebbia, which uses models from big providers like OpenAI and Anthopic, has developed its own software to maximize the number of queries it can send to different models. Its system even takes into account varying rate limits for, say, access to GPT offered by Microsoft and by OpenAI directly. 

And, says Sivulka, demand continues to increase, as customers use Hebbia’s software to consider larger data sets than previously would have been possible in making decisions. The company says it’s handled 250 million queries from users so far this year, compared to 100 million last year.

“If you’re an investor and you’re building a case to invest or not to invest, you can look at—and corroborate your assumptions with—way more data,” Sivulka says.

https://www.fastcompany.com/91307847/how-hebbia-is-building-ai-for-in-depth-research?partner=rss&utm_source=rss&utm_medium=feed&utm_campaign=rss+fastcompany&utm_content=rss

Created 4mo | Mar 31, 2025, 10:40:03 AM


Login to add comment

Other posts in this group

RushTok is back. TikTok still can’t get enough of sorority recruitment

The internet’s favorite programming is back on: #RushTok season is officially upon us. 

If this is your first time tuning in, “rush” is the informal name for the recruitment process

Aug 7, 2025, 7:10:02 AM | Fast company - tech
Instagram launches map feature. It looks a lot like Snap Map

Location sharing among friends, family, and significant others has quietly become the norm in recent years.

Now Instagram is looking for a piece of the action with the launch of a

Aug 7, 2025, 12:10:05 AM | Fast company - tech
WhatsApp removes 6.8 million accounts linked to scam centers

WhatsApp has taken down 6.8 million accounts that were “linked to criminal scam centers” target

Aug 6, 2025, 9:40:06 PM | Fast company - tech
Google wants you to be a citizen data scientist

For more than a decade, enterprise teams bought into the promise of business intelligence platforms delivering “decision-making at the speed of thought.” But most discovered the opposite: slow-mov

Aug 6, 2025, 7:30:04 PM | Fast company - tech
Apple to invest another $100 billion in the U.S.

President Donald Trump on Wednesday is expected to celebrate at the White House a commitment by

Aug 6, 2025, 7:30:03 PM | Fast company - tech
Character.AI launches social feed to let users interact, create, and share with AI personas

Character.AI is going social, adding an interactive feed to its mobile apps. 

Rolled out on Monday, the new social feed may initially look similar

Aug 6, 2025, 5:10:05 PM | Fast company - tech
Exclusive: Google Gemini adds AI tutoring, heating up the fight for student users

Just in time for the new school year, Google has introduced a tool called Guided Learning within its Gemini chatbot. Unlike tools that offer instant answers, Guided Learning breaks down complex pro

Aug 6, 2025, 5:10:04 PM | Fast company - tech