Does RL Incentivize Reasoning in LLMs Beyond the Base Model?

Article URL: https://limit-of-rlvr.github.io/

Comments URL: https://news.ycombinator.com/item?id=43760625

Points: 12

# Comments: 3

https://limit-of-rlvr.github.io/

Vytvořeno 16d | 22. 4. 2025 13:40:21

Chcete-li přidat komentář, přihlaste se

Ostatní příspěvky v této skupině

Microservices are a tax your startup probably can't afford

Article URL: https://nexo.sh/posts/microservices-for-startups/

Comments URL:

8. 5. 2025 17:10:09 | Hacker news

20 years to give away virtually all my wealth

Article URL: https://www.gatesnotes.com/home/home-page-topic/reader/n20-years-to-giv

8. 5. 2025 17:10:08 | Hacker news

Huawei unveils laptop running self-developed HarmonyOS as Windows licence expire

Article URL: https://www.scmp.com/tech/big-tech/ar

8. 5. 2025 17:10:07 | Hacker news

Progress toward fusion energy gain as measured against the Lawson criteria

Article URL: https://www.fusionenergybase.co

8. 5. 2025 17:10:06 | Hacker news

Show HN: Checking Pope's election results with smoke test script for chimney

This Playwright test script uses AI to test if there's smoke coming out of the Sistine Chapel chimney and whether that smoke is white. The test only passes if the smoke is white.

Currently, set

8. 5. 2025 17:10:05 | Hacker news

High tariffs become 'real' with our first $36K bill

Article URL: https://blog.adafruit.com/2025/05/08/high-tariffs-become-real-with-our-first-36k-bill/

8. 5. 2025 17:10:05 | Hacker news

Show HN: Hypermode Model Router Preview – OpenRouter Alternative

Article URL: https://hypermode.com/blog/introducing-model-router

Comments URL:

8. 5. 2025 17:10:04 | Hacker news

Techie