Pinned Loading
-
sokobench
sokobench PublicEvaluation framework designed to test the 2D spatial awareness and planning capabilities of (LLMs). SokoBench measures a model's ability to map symbolic state representations to effective spatial a…
Python
-
atcoder-cot
atcoder-cot PublicCode to generate the dataset found at https://huggingface.co/datasets/Nan-Do/atcoder_cot
Python
-
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.

