Hi everyone!
After spending hundreds of hours, we're excited to finally share our progress in developing a reinforcement learning system to beat Pokémon Red. Our system successfully completes the game using a policy under 10M parameters, PPO, and a few novel techniques. With the release of Claude Plays Pokémon, now feels like the perfect time to showcase our work.
We'd love to get feedback!
Comments URL: https://news.ycombinator.com/item?id=43269330
Points: 41
# Comments: 26
Zaloguj się, aby dodać komentarz
Inne posty w tej grupie

Article URL: https://programmingsimplicity.substack.com/p/type-checking-is-a-symptom-not-a
Commen

Article URL: https://emilkowal.ski/ui/you-dont-need-animations

Article URL: https://morse-coder.com/
Comments URL: https://news.ycombinator.com/item?id=45139

Article URL: https://github.com/Mentra-Community/MentraOS
Comments URL: ht

Hey HN! I'm Zach from Adam (https://adam.new/). We’re building an AI co-pilot for mechanical CAD software.
As part of our broader research, we built a browser-bas