Showing 1-1 of 1 projects
A PyTorch implementation of the Proximal Policy Optimization (PPO) algorithm for training an AI agent to play Super Mario Bros.
Get weekly updates on trending AI coding tools and projects.