Showing 1-1 of 1 projects
A Jupyter Notebook-based benchmark for evaluating the quality of large language models by having them play Street Fighter 3.
Get weekly updates on trending AI coding tools and projects.