Does RL Incentivize Reasoning in LLMs Beyond the Base Model?

Article URL: https://limit-of-rlvr.github.io/

Comments URL: https://news.ycombinator.com/item?id=43760625

Points: 12

# Comments: 3

https://limit-of-rlvr.github.io/

созданный 1mo | 22 апр. 2025 г., 13:40:21

Войдите, чтобы добавить комментарий

Другие сообщения в этой группе

Peer Programming with LLMs, for Senior+ Engineers

Peer Programming with LLMs, for Senior+ Engineers

Article URL: https://pmbanugo.me/blog/peer-programming-with-llms

Comments URL:

24 мая 2025 г., 20:10:21 | Hacker news

I used o3 to find a remote zeroday in the Linux SMB implementation

I used o3 to find a remote zeroday in the Linux SMB implementation

Article URL: https://sean.heelan.io/2025/05

24 мая 2025 г., 20:10:20 | Hacker news

Use ramoops for logging under Linux (2021)

Use ramoops for logging under Linux (2021)

Article URL: https://embear.ch/posts/using-ramoops/

Comments URL: https://news.y

24 мая 2025 г., 20:10:19 | Hacker news

Exposed Industrial Control Systems and Honeypots in the Wild [pdf]

Exposed Industrial Control Systems and Honeypots in the Wild [pdf]

Article URL: https://gsmaragd.github.io/publications/EuroSP2025-ICS/EuroSP2025-ICS.pdf

Comments URL:

24 мая 2025 г., 20:10:19 | Hacker news

Lone coder cracks 50-year puzzle to find Boggle's top-scoring board

Lone coder cracks 50-year puzzle to find Boggle's top-scoring board

Article URL: https://www.ft.com/content/0ab64ced-1ed1-466d-acd3-78510d10c3a1

Comments URL:

24 мая 2025 г., 20:10:17 | Hacker news

Please Fund More Science (2020)

Please Fund More Science (2020)

Article URL: https://blog.samaltman.com/please-fund-more-science

Comments URL:

24 мая 2025 г., 20:10:16 | Hacker news

What I discovered when I asked Amazon to tell me everything Alexa had heard

What I discovered when I asked Amazon to tell me everything Alexa had heard

Article URL: https://www.theguardian.com/technology/2025/ma

24 мая 2025 г., 20:10:16 | Hacker news

Techie