You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
reflexion-human-eval/programming_runs
cassanof 49485efc8f undo 9 months ago
..
benchmarks better messages 9 months ago
executors upd 9 months ago
generators undo 9 months ago
human-eval start v2 12 months ago
lazzzy@404c06a5bf reinit submodules 12 months ago
root add logs 12 months ago
dataset_random_sample.py start v2 12 months ago
evaluate_leet_results.py start v2 12 months ago
evaluate_rs_leet_results.py start v2 12 months ago
generate_dataset.py start v2 12 months ago
humaneval_result_sort.py start v2 12 months ago
immediate_refinement.py added model class 11 months ago
immediate_reflexion.py added model class 11 months ago
main.py added model class 11 months ago
reflexion.py resume with success count 9 months ago
reflexion_ucs.py added model class 11 months ago
requirements.txt fix literal import 9 months ago
run_immediate_refinement.sh start v2 12 months ago
run_immediate_reflexion.sh start v2 12 months ago
run_reflexion.sh better messages 9 months ago
run_reflexion_codellama_multi.sh exec 9 months ago
run_reflexion_py_leet.sh start v2 12 months ago
run_reflexion_rs_leet.sh start v2 12 months ago
run_reflexion_starchat_multi.sh scripts 10 months ago
run_reflexion_startchat.sh change name 11 months ago
run_reflexion_ucs.sh add runscripts 12 months ago
run_seq.sh scripts 10 months ago
run_simple.sh better messages 9 months ago
run_simple_codellama.sh code llaam 9 months ago
run_simple_py_leet.sh start v2 12 months ago
run_simple_rs_leet.sh start v2 12 months ago
run_simple_starchat_multi.sh scripts 10 months ago
run_testacc.sh start v2 12 months ago
simple.py better messages 9 months ago
test_acc.py start v2 12 months ago
utils.py fix 9 months ago
validate_py_results.py start v2 12 months ago
validate_rs_results.py start v2 12 months ago