Reinforcement Learning from Human Feedback (RLHF) in Notebooks

Article URL: https://github.com/ash80/RLHF_in_notebooks

Comments URL: https://news.ycombinator.com/item?id=44481066

Points: 6

# Comments: 0

https://github.com/ash80/RLHF_in_notebooks

Établi 4h | 6 juil. 2025, 15:10:03

Connectez-vous pour ajouter un commentaire

Autres messages de ce groupe

Building a Mac app with Claude code

Building a Mac app with Claude code

Article URL: https://www.indragie.com/blog/i-shipped-a-macos-app-built-entirely-by-claude-code

<

6 juil. 2025, 17:20:12 | Hacker news

Hannah Cairo has solved the Mizohata-Takeuchi conjecture

Hannah Cairo has solved the Mizohata-Takeuchi conjecture

Article URL: https://english.elpais.com/science-tech/20

6 juil. 2025, 17:20:10 | Hacker news

Huawei cloned Qwen and DeepSeek models, claimed as own

Huawei cloned Qwen and DeepSeek models, claimed as own

Article URL: https://dilemmaworks.substack.com/p/whistleblower-huawei-cloned-and-renamed

Comments U

6 juil. 2025, 17:20:10 | Hacker news

Metriport (YC S22) is hiring engineers to improve healthcare data exchange

Metriport (YC S22) is hiring engineers to improve healthcare data exchange

Article URL: https://www.ycombinator.com/companies/metriport/jobs/Rn2Je8M-software-engineer

Comm

6 juil. 2025, 17:20:07 | Hacker news

Get the location of the ISS using DNS

Get the location of the ISS using DNS

Article URL: https://shkspr.mobi/blog/2025/07/get-the-location-of-the-iss-using-dns/

Comments URL:

6 juil. 2025, 15:10:05 | Hacker news

Stop killing games and the industry response

Stop killing games and the industry response

Article URL: https://blog.kronis.dev/blog/stop-killing-games

Comments URL:

6 juil. 2025, 15:10:04 | Hacker news

Two and a Half Years in GameDev

Two and a Half Years in GameDev

Article URL: https://smyachenkov.com/posts/two-and-half-years-in-gamedev/

Comments URL:

6 juil. 2025, 15:10:04 | Hacker news

Techie