Ben and Ryan are joined by Robin Gupta for a conversation about benchmarking and testing AI systems. They talk through the lack of trust and confidence in AI, the inherent challenges of nondeterministic systems, the role of human verification, and whether we can (or should) expect an AI to be reliable. https://stackoverflow.blog/2024/05/24/would-you-board-a-plane-safety-tested-by-genai/
Chcete-li přidat komentář, přihlaste se
Ostatní příspěvky v této skupině
![Say goodbye to "junior" engineering roles](https://www.cdn5.niftycent.com/a/D/y/Y/X/m/l/say-goodbye-to-junior-engineering-roles.webp)
On today’s episode we chat with Kirimgeray Kirimli, a director at Flatiron Software and CEO of Snapshot Reviews, a tool that measure developer productivity based on activity from Github, Jira, standup
![The real 10x developer makes their whole team better](https://www.cdn5.niftycent.com/a/k/K/q/x/l/O/the-real-10x-developer-makes-their-whole-team-better.webp)
Single individuals make less of a difference to the success or failure of a technology project than you might think (and that’s a good thing). https://stackoverflow.blog/2024/06/19/the-real-10x-devel
![Enterprise 2024.4: Demonstrating and improving community impact](https://www.cdn5.niftycent.com/a/D/y/Y/X/w/9/enterprise-2024-4-demonstrating-and-improving-community-impact.webp)
In the latest Stack Overflow for Teams Enterprise release, you'll see reporting capabilities and insights that help demonstrate community impact. Microsoft customers can also rejoice: OverflowAI now i
![Making ETL pipelines a thing of the past](https://www.cdn5.niftycent.com/a/D/Z/3/X/J/K/making-etl-pipelines-a-thing-of-the-past.webp)
On today’s episode we chat with Cassandra Shum, VP of Field Engineering at RelationalAI, about her company’s efforts to create what it calls the industry’s first coprocessor for data clouds and langua
![The world’s most popular web framework is going AI native](https://www.cdn5.niftycent.com/a/e/5/p/0/G/2/the-world-s-most-popular-web-framework-is-going-ai-native.webp)
On today’s episode we chat with Jared Palmer, VP of AI at Vercel, who says the company has three key goals. First, support AI native web apps like ChatGPT and Claude. Second, use GenAI to make it easi
![A peek behind the curtain with Stack Overflow’s sales engineers](https://www.cdn5.niftycent.com/a/D/y/Y/R/j/O/a-peek-behind-the-curtain-with-stack-overflow-s-sales-engineers.webp)
In this episode, Alexa Montelibano and Tiago Torre, sales engineers at Stack Overflow, take you behind the scenes to show how customer feedback shapes our products, including OverflowAI. Alexa and Tia
![Generative AI Is Not Going To Build Your Engineering Team For You](https://www.cdn5.niftycent.com/a/D/Z/3/J/l/b/generative-ai-is-not-going-to-build-your-engineering-team-for-you.webp)
It’s easy to generate code, but not so easy to generate good code. https://stackoverflow.blog/2024/06/10/generative-ai-is-not-going-to-build-your-engineering-team-for-you/