Would you board a plane safety-tested by GenAI?

Ben and Ryan are joined by Robin Gupta for a conversation about benchmarking and testing AI systems. They talk through the lack of trust and confidence in AI, the inherent challenges of nondeterministic systems, the role of human verification, and whether we can (or should) expect an AI to be reliable. https://stackoverflow.blog/2024/05/24/would-you-board-a-plane-safety-tested-by-genai/

Created 1y | May 24, 2024, 5:50:06 AM


Login to add comment

Other posts in this group

Learn like a lurker: Gen Z’s digital-first lifestyle and the future of knowledge

As a generation characterized as "digital natives," the way Gen Z interacts with and consumes knowledge is rooted in their desire for instant gratification and personalization. How will this affect th

Jun 18, 2025, 2:50:09 PM | StackOverflow blog
After 30 years, Java is still brewing up new features

It’s Java’s 30th anniversary! Ryan welcomes back Georges Saab,  Senior VP of Development for the Java Platform Group and Chair of the OpenJDK Governing Board, to reflect on Java’s changes over the las

Jun 17, 2025, 6:30:08 AM | StackOverflow blog
“We’re not worried about compute anymore”: The future of AI models

Ryan Donovan and Ben Popper sit down with Jamie de Guerre, SVP of Product at Together AI, to discuss the evolving landscape of AI and open-source models. They explore the significance of infrastructur

Jun 13, 2025, 5:10:08 AM | StackOverflow blog
Better vibes and vibe coding with Gemini 2.5

Ryan and Ben welcome Tulsee Doshi and Logan Kilpatrick from Google's DeepMind to discuss the advanced capabilities of the new Gemini 2.5, the importance of feedback loops for model improvement and red

Jun 10, 2025, 5:20:08 AM | StackOverflow blog
Banking on a serverless world

Kathleen Vignos, VP of Software Engineering at Capital One, sits down with Ryan to explore shifting to 100% serverless architecture in enterprise, deploying talent for better customer experience, and

Jun 6, 2025, 6:20:07 AM | StackOverflow blog