Show HN: Luminal – Open-source, search-based GPU compiler

Hi HN, I’m Joe. My friends Matthew, Jake and I are building Luminal (https://luminalai.com/), a GPU compiler for automatically generating fast GPU kernels for AI models. It uses search-based compilation to achieve high performance.

We take high level model code, like you'd have in PyTorch, and generate very fast GPU code. We do that without using LLMs or AI - rather, we pose it as a search problem. Our compiler builds a search space, generates millions of possible kernels, and then searches through it to minimize runtime.

You can try out a demo in `demos/matmul` on mac to see how Luminal takes a naive operation, represented in our IR of 12 simple operations, and compiles it to an optimized, tensor-core enabled Metal kernel. Here’s a video showing how: https://youtu.be/P2oNR8zxSAA

Our approach differs significantly from traditional ML libraries in that we ahead-of-time compile everything, generate a large search space of logically-equivalent kernels, and search through it to find the fastest kernels. This allows us to leverage the Bitter Lesson to discover complex optimizations like Flash Attention entirely automatically without needing manual heuristics. The best rule is no rule, the best heuristic is no heuristic, just search everything.

We’re working on bringing CUDA support up to parity with Metal, adding more flexibility to the search space, adding full-model examples (like Llama), and adding very exotic hardware backends.

We aim to radically simplify the ML ecosystem while improving performance and hardware utilization. Please check out our repo: https://github.com/luminal-ai/luminal and I’d love to hear your thoughts!


Comments URL: https://news.ycombinator.com/item?id=44963135

Points: 28

# Comments: 4

https://github.com/luminal-ai/luminal

Creado 2d | 20 ago 2025, 17:30:45


Inicia sesión para agregar comentarios

Otros mensajes en este grupo.

Show HN: Changefly ID + Anonymized Identity and Age Verification

Hey HN! I’m Lukas Dickie the founder of Changefly and I’m truly excited to share with you our latest release of Changefly ID with Anonymized Identity & Age Verification.

By putting privacy first

22 ago 2025, 2:10:15 | Hacker news
From GPT-4 to GPT-5: Measuring Progress in Medical Language Understanding [pdf]

I recently worked on running a thorough healthcare eval on GPT-5. The results show a (slight) regression in GPT-5 performance compared to GPT-4 era models.

I found this to be an interesting find

22 ago 2025, 2:10:09 | Hacker news