Commit Graph

138 Commits (7e8b29a5bb93073b155a0011c3c4aa6da2f1bdbd)
 

Author SHA1 Message Date
Noah Shinn 7e8b29a5bb remove last of challenge set for alfworld 1 year ago
Noah Shinn be94bc418b fix code block instruction always python 1 year ago
Noah Shinn 510ed0008e fixes #19 1 year ago
Noah Shinn 59c84d9854 add code block instruction 1 year ago
Beck LaBash f2720b347a
Implement AnyOpenAILLM for use across completion and chat endpoints (#20) 1 year ago
Noah Shinn d0b997e181 add parser 1 year ago
Noah Shinn 851b46779c parse code blocks 1 year ago
Noah Shinn e085b08de5 fix literal import 1 year ago
Noah Shinn f5ac5200e9 requirements for alfworld 1 year ago
Noah Shinn 8a2aa8afb8 alfworld chat 1 year ago
Noah Shinn ff7bbeb22b better messages 1 year ago
Noah Shinn e5c64d96f0 better messages 1 year ago
cassanof af69490a59 scripts 1 year ago
cassanof 8a03029c34 todo 1 year ago
cassanof a8e13b1b0f fix simple 1 year ago
Federico Cassano 0e45c6a115
Merge pull request #15 from noahshinn024/starchat
Merge Starchat into main
1 year ago
Noah Shinn 807a06578c todo file 1 year ago
Noah Shinn 1c7367fb1c Start code parsing and instruction
TODO:
- remove func signature during evaluation
- edit prompts for rust
- add parse_rust_code
1 year ago
cassanof b9d2c54114 temp fix 1 year ago
cassanof a9d34708ad use right dtype 1 year ago
cassanof 42bcfe7c23 change name 1 year ago
cassanof 98bd65153a rem dangling import 1 year ago
cassanof 020e32f7bf fix ciruclar 1 year ago
cassanof e60072c524 move gen into class 1 year ago
cassanof 97d5190a7c reqs for starchat? 1 year ago
cassanof af90f4444d added model class 1 year ago
cassanof dbfc7c6a4f runner 1 year ago
Shunyu Yao f27481d8a3
Update README.md 1 year ago
Noah Shinn 9a9d7d8b2f demos 1 year ago
Noah Shinn 0f7a737015 add runscripts 1 year ago
Noah Shinn 6a8b75ccdd update citation 1 year ago
Beck LaBash d876c4cdb4 Put HotPotQA on top 1 year ago
Beck LaBash c2159d4b93 NBs and README 1 year ago
Beck LaBash e531a5c0d6 Organize notebooks 1 year ago
Noah Shinn 4924ce40f2 add logs 1 year ago
elleven11 245fd11901 move benchmarks to their place 1 year ago
Noah Shinn b6a324f78a start run instructions 1 year ago
Noah Shinn 34ab94a3b3 start run instructions 1 year ago
Beck LaBash 5942b44c41 HotPotQA runs 1 year ago
Noah Shinn 5269ef4ae0 start v2 1 year ago
elleven11 970c487d97 reinit submodules 1 year ago
elleven11 a98e92b20a reset submodule 1 year ago
Noah Shinn 4e42b24dab start v2 1 year ago
Noah Shinn 878a144a66 alfworld and webshop 1 year ago
Noah Shinn 3148695707 note about paper 1 year ago
Noah Shinn a0162a065d update leetcode hard gym link 1 year ago
Noah Shinn d2cdf66bc2 leetcode-hard gym repo 1 year ago
Noah Shinn 9a71c64882 leetcode-hard gym repo 1 year ago
Beck LaBash 5b6a1bd990 Merge branch 'py-prompts' 1 year ago
Beck LaBash 1eb65193d9 Lazy imports for leetcode 1 year ago