Show HN: Beating Pokemon Red with RL and 10M Parameters

Hi everyone!

After spending hundreds of hours, we're excited to finally share our progress in developing a reinforcement learning system to beat Pokémon Red. Our system successfully completes the game using a policy under 10M parameters, PPO, and a few novel techniques. With the release of Claude Plays Pokémon, now feels like the perfect time to showcase our work.

We'd love to get feedback!

Comments URL: https://news.ycombinator.com/item?id=43269330

Points: 41

# Comments: 26

https://drubinstein.github.io/pokerl/

Utworzony 6mo | 5 mar 2025, 20:20:12

Zaloguj się, aby dodać komentarz

Inne posty w tej grupie

Type checking is a symptom, not a solution

Type checking is a symptom, not a solution

Article URL: https://programmingsimplicity.substack.com/p/type-checking-is-a-symptom-not-a

Commen

5 wrz 2025, 18:40:17 | Hacker news

Purposeful animations

Purposeful animations

Article URL: https://emilkowal.ski/ui/you-dont-need-animations

Comments URL:

5 wrz 2025, 18:40:15 | Hacker news

Morse Code Translator

Morse Code Translator

Article URL: https://morse-coder.com/

Comments URL: https://news.ycombinator.com/item?id=45139

5 wrz 2025, 18:40:13 | Hacker news

MentraOS – open-source Smart glasses OS

MentraOS – open-source Smart glasses OS

Article URL: https://github.com/Mentra-Community/MentraOS

Comments URL: ht

5 wrz 2025, 18:40:12 | Hacker news

European Commission fines Google €2.95B over abusive ad tech practices

European Commission fines Google €2.95B over abusive ad tech practices

Article URL: https://ec.europa.eu/commission/presscorner/detail/en/ip_25_1992

Comments URL:

5 wrz 2025, 18:40:10 | Hacker news

Freeway guardrails are now a favorite target of thieves

Freeway guardrails are now a favorite target of thieves

Article URL: https://laist.com/news/transportation/guardrails-aluminum-theft

Comments URL:

5 wrz 2025, 18:40:09 | Hacker news

Show HN: Open-sourcing our text-to-CAD app

Show HN: Open-sourcing our text-to-CAD app

Hey HN! I'm Zach from Adam (https://adam.new/). We’re building an AI co-pilot for mechanical CAD software.

As part of our broader research, we built a browser-bas

5 wrz 2025, 18:40:07 | Hacker news

Techie