You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
reflexion-human-eval/benchmarks
elleven11 30c6c5d2e9 sample of 30 1 year ago
..
.DS_Store Leetcode Hard: Python3 Benchmark 2 years ago
humaneval-py.jsonl.gz . 2 years ago
humaneval-py_sample30.jsonl sample of 30 1 year ago
humaneval-rs.jsonl . 2 years ago
leetcode-hard-py.jsonl Leetcode Hard: Python3 Benchmark 2 years ago
mbpp-py.jsonl . 2 years ago
mbpp-rs.jsonl validate rs 2 years ago