ETH Zurich and EPFL to release a LLM developed on public infrastructure



Zaloguj się, aby dodać komentarz

Inne posty w tej grupie

jank is C++
11 lip 2025, 20:10:34 | Hacker news
Show HN: RULER – Easily apply RL to any agent

Hey HN, Kyle here, one of the co-founders of OpenPipe.

Reinforcement learning is one of the best techniques for making agents more reliable, and has been widely adopted by frontier labs. However

11 lip 2025, 20:10:33 | Hacker news