DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via RL

Article URL: https://arxiv.org/abs/2501.12948

Comments URL: https://news.ycombinator.com/item?id=42823568

Points: 41

# Comments: 8

https://arxiv.org/abs/2501.12948

Creată 6mo | 25 ian. 2025, 19:40:10

Autentifică-te pentru a adăuga comentarii

Alte posturi din acest grup

Wish you could escape the planet? Too bad life in space would suck

Article URL: https://www.salon.com/2024/11/22/wish-you-could-escape-the-planet-too-life-in

29 iul. 2025, 14:40:15 | Hacker news

Ask HN: How will the OSA affect small Mastodon instances?

I am not currently a user of Mastodon, but I have some interest in the project. I was looking at some stuff that seemed to indicate to me that the OSA could make it difficult to self host Mastodon

29 iul. 2025, 14:40:12 | Hacker news

Linux Performance Analysis in 60 seconds

Article URL: https://netflixtechblog.com/linux-performance-analysis-in-60-000-milliseconds-accc

29 iul. 2025, 14:40:12 | Hacker news

Coverage.py Regex Pragmas

Article URL: https://nedbatchelder.com/blog/202507/coveragepy_regex_pragmas.html

Comments URL:

29 iul. 2025, 14:40:10 | Hacker news

Can a Country Be Too Rich? Norway Is Finding Out

Article URL: https://www.bloomberg.com/news/articles/2025-07-25/can-a-country-be-

29 iul. 2025, 14:40:10 | Hacker news

RP2350 A4, RP2354, and a New Hacking Challenge

Article URL: https://www.raspberrypi.com/news/rp2350-a4-rp2354-and-a-new-hacking-challenge/

Comm

29 iul. 2025, 14:40:09 | Hacker news

My 2.5 year old laptop can write Space Invaders in JavaScript now (GLM-4.5 Air)

Article URL: https://simonwillison.net/2025/Jul/29/space-invaders/

Comments URL:

29 iul. 2025, 14:40:08 | Hacker news

Techie