Built RL for long-horizon agents – tested on 32x H100s but too poor to train

Article URL: https://github.com/Danau5tin/terminal-bench-rl

Comments URL: https://news.ycombinator.com/item?id=44721791

Points: 31

# Comments: 2

https://github.com/Danau5tin/terminal-bench-rl

Creado 1d | 29 jul 2025, 12:20:15

Inicia sesión para agregar comentarios

Otros mensajes en este grupo.

PanamaPlaylists – Leaked Tech CEOs Spotify Profiles

PanamaPlaylists – Leaked Tech CEOs Spotify Profiles

Article URL: https://panamaplaylists.com/

Comments URL: https://news.ycombinator.com/item?

30 jul 2025, 13:50:43 | Hacker news

From XML to JSON to CBOR

From XML to JSON to CBOR

Article URL: https://cborbook.com/introduction/from_xml_to_json_to_cbor.html

Comments URL:

30 jul 2025, 13:50:42 | Hacker news

Moneybadger and Peach Payments partner to enable Bitcoin payments

Moneybadger and Peach Payments partner to enable Bitcoin payments

Article URL: https://bitcoinke.io/2025/07/moneybadger-peach-payments-partnership/

Comments URL:

30 jul 2025, 13:50:41 | Hacker news

U.S. intelligence intervened with DOJ to push HPE-Juniper merger

U.S. intelligence intervened with DOJ to push HPE-Juniper merger

Article URL: https://www.axios.com/2025/07/30/merger-hpe-juniper-networks-national-security

Comm

30 jul 2025, 13:50:39 | Hacker news

Figma S-1, the Figma OS, Figma's AI Potential

Figma S-1, the Figma OS, Figma's AI Potential

Article URL: https://stratechery.com/2025/figma-s-1-the-figma-os-figmas-ai-potential/

Comments URL:

30 jul 2025, 13:50:36 | Hacker news

SensorLM: Learning the Language of Wearable Sensors

SensorLM: Learning the Language of Wearable Sensors

Article URL: https://research.google/blog/sensorlm-learning-the-language-of-wearable-sensors/

30 jul 2025, 13:50:35 | Hacker news

How email tracking works behind the scenes

How email tracking works behind the scenes

Article URL: https://buttondown.com/blog/email-tracking-pixels-bugs

Comments URL:

30 jul 2025, 13:50:33 | Hacker news

Techie