Built RL for long-horizon agents – tested on 32x H100s but too poor to train

Article URL: https://github.com/Danau5tin/terminal-bench-rl

Comments URL: https://news.ycombinator.com/item?id=44721791

Points: 31

# Comments: 2

https://github.com/Danau5tin/terminal-bench-rl

Created 1d | Jul 29, 2025, 12:20:15 PM

Login to add comment

Other posts in this group

PanamaPlaylists – Leaked Tech CEOs Spotify Profiles

PanamaPlaylists – Leaked Tech CEOs Spotify Profiles

Article URL: https://panamaplaylists.com/

Comments URL: https://news.ycombinator.com/item?

Jul 30, 2025, 1:50:43 PM | Hacker news

From XML to JSON to CBOR

From XML to JSON to CBOR

Article URL: https://cborbook.com/introduction/from_xml_to_json_to_cbor.html

Comments URL:

Jul 30, 2025, 1:50:42 PM | Hacker news

Moneybadger and Peach Payments partner to enable Bitcoin payments

Moneybadger and Peach Payments partner to enable Bitcoin payments

Article URL: https://bitcoinke.io/2025/07/moneybadger-peach-payments-partnership/

Comments URL:

Jul 30, 2025, 1:50:41 PM | Hacker news

U.S. intelligence intervened with DOJ to push HPE-Juniper merger

U.S. intelligence intervened with DOJ to push HPE-Juniper merger

Article URL: https://www.axios.com/2025/07/30/merger-hpe-juniper-networks-national-security

Comm

Jul 30, 2025, 1:50:39 PM | Hacker news

Figma S-1, the Figma OS, Figma's AI Potential

Figma S-1, the Figma OS, Figma's AI Potential

Article URL: https://stratechery.com/2025/figma-s-1-the-figma-os-figmas-ai-potential/

Comments URL:

Jul 30, 2025, 1:50:36 PM | Hacker news

SensorLM: Learning the Language of Wearable Sensors

SensorLM: Learning the Language of Wearable Sensors

Article URL: https://research.google/blog/sensorlm-learning-the-language-of-wearable-sensors/

Jul 30, 2025, 1:50:35 PM | Hacker news

How email tracking works behind the scenes

How email tracking works behind the scenes

Article URL: https://buttondown.com/blog/email-tracking-pixels-bugs

Comments URL:

Jul 30, 2025, 1:50:33 PM | Hacker news

Techie