Direct Nash Optimization: Teaching language models to self-improve

Article URL: https://arxiv.org/abs/2404.03715

Comments URL: https://news.ycombinator.com/item?id=39972800

Points: 30

# Comments: 6

https://arxiv.org/abs/2404.03715

Établi 2mo | 8 avr. 2024 à 21:10:11

Connectez-vous pour ajouter un commentaire

Autres messages de ce groupe

Python's many command-line utilities

Python's many command-line utilities

Article URL: https://www.pythonmorsels.com/cli-tools/

Comments URL: https://ne

4 juin 2024 à 00:20:13 | Hacker news

Scientists should use AI as a tool, not an oracle

Scientists should use AI as a tool, not an oracle

Article URL: https://www.aisnakeoil.com/p/scientists-should-use-ai-as-a-tool

Comments URL:

4 juin 2024 à 00:20:08 | Hacker news

If English was written like Chinese (1999)

If English was written like Chinese (1999)

Article URL: https://zompist.com/yingzi/yingzi.htm

Comments URL: https://news.yco

3 juin 2024 à 22:10:07 | Hacker news

Sam Altman, Lately

Sam Altman, Lately

Article URL: http://oftheclock.com/sam-altman-lately

Comments URL: https://news

3 juin 2024 à 22:10:06 | Hacker news

Crooks threaten to leak 3B personal records 'stolen from background check firm'

Crooks threaten to leak 3B personal records 'stolen from background check firm'

Article URL: https://www.theregister.com/2024/06/03/usdod_data_dump/

Comments URL:

3 juin 2024 à 22:10:05 | Hacker news

Grokfast: Accelerated Grokking by Amplifying Slow Gradients

Grokfast: Accelerated Grokking by Amplifying Slow Gradients

Article URL: https://arxiv.org/abs/2405.20233

Comments URL: https://news.ycombinator.c

3 juin 2024 à 22:10:05 | Hacker news

SnapMagic (YC S15), the AI copilot for electronics, is hiring a PM

SnapMagic (YC S15), the AI copilot for electronics, is hiring a PM

Article URL: https://careers.snapmagic.com/o/technical-project-manager

Comments URL:

3 juin 2024 à 22:10:04 | Hacker news

Techie