Commit Graph

872 Commits

Author SHA1 Message Date
Adam Treat
bd5e279621 Don't display the endoftext token. 2023-04-09 01:22:12 -04:00
Adam Treat
02e13737f3 Don't repeat the prompt in the response. 2023-04-09 01:11:52 -04:00
Adam Treat
0903da3afa Update README.md 2023-04-09 00:03:15 -04:00
Adam Treat
ed68a2cccb Update README.md 2023-04-09 00:01:42 -04:00
Adam Treat
dfe9d43c5e Update README.md 2023-04-08 23:54:25 -04:00
Adam Treat
04f7dbb395 Update README.md 2023-04-08 23:53:24 -04:00
Adam Treat
243972e3d8 Update README.md 2023-04-08 23:46:23 -04:00
adtreat
65837727a7
Update README.md 2023-04-08 23:41:49 -04:00
Adam Treat
ff2fdecce1 Initial commit. 2023-04-08 23:28:39 -04:00
Zach Nussbaum
7807a80bbb fix: bs try one more time? 2023-04-08 21:47:07 +00:00
Zach Nussbaum
2f0eba211d fix: smaller bs for 40gb 2023-04-08 21:36:20 +00:00
Zach Nussbaum
7f95ab3a06 fix: config for lora gptj 2023-04-08 21:17:12 +00:00
Zach Nussbaum
9efdf56e38 fix: saving name 2023-04-08 20:56:13 +00:00
Zach Nussbaum
633df8edb4 Merge remote-tracking branch 'origin/mosaic' into gptj 2023-04-08 20:47:01 +00:00
Zach Nussbaum
31195270cb fix: eos/pad token + wd 2023-04-08 20:38:10 +00:00
Zach Nussbaum
c82ee7d882 fix: add wd + min lr to config 2023-04-08 20:37:51 +00:00
Zach Nussbaum
be3f528810 fix: tokenization error 2023-04-08 20:33:51 +00:00
Zach Nussbaum
b66f127ade fix: config + ignore pkl 2023-04-08 20:33:02 +00:00
Zach Nussbaum
0606ab46b9 feat: build map script 2023-04-08 19:30:53 +00:00
Zach
1c6d2d9622 fix: embeddings instead of logits!!! 2023-04-08 17:05:40 +00:00
zanussbaum
147c2fd7eb feat: lora gptj 2023-04-07 17:53:07 -04:00
zanussbaum
2b001e8932 fix: batch size 2023-04-07 17:41:45 -04:00
zanussbaum
7cfda6a21f feat: update for mosaic 2023-04-07 16:54:29 -04:00
Zach Nussbaum
4b51e6ef37 fix: pyarrow filter 2023-04-07 19:04:19 +00:00
Andriy Mulyar
ed53fe1966
Updated roadmap and links. 2023-04-07 13:53:47 -04:00
Zach Nussbaum
7a9f6d1cdc fix: inference save shards 2023-04-07 16:23:34 +00:00
Andriy Mulyar
8e28a33731
Merge pull request #268 from MalikMAlna/dev
Slight cleanup
2023-04-07 10:50:56 -04:00
Andriy Mulyar
7d06b4cd23
Merge pull request #267 from dte/patch-1
Update README.md
2023-04-07 10:50:27 -04:00
Andriy Mulyar
c5d010f352
Correct MD5 Hash 2023-04-07 10:50:02 -04:00
Andriy Mulyar
d8cde6d272
Update README.md 2023-04-07 10:47:15 -04:00
Zach
0bd6acb4dd fix: drop uneven batch size 2023-04-07 12:09:31 +00:00
Zach
985da51fbc fix: concat 2023-04-07 04:33:34 +00:00
Zach
1b14b1f723 fix: data for inference 2023-04-07 01:45:07 +00:00
Zach
fb9ff9c40d feat: inference for embedding plots 2023-04-07 01:40:39 +00:00
MalikMAlna
43ddc3eefa Rephrasing comment for clarity 2023-04-06 20:20:18 -04:00
MalikMAlna
0689c2e974 Changing single to double quotes for quote consistency 2023-04-06 20:07:08 -04:00
MalikMAlna
604176ace8 Slight cleanup of superfluous comment and space after commas 2023-04-06 19:57:46 -04:00
MalikMAlna
b3be94a0ef Slight cleanup of superfluous comment and space after comma 2023-04-06 19:56:49 -04:00
Dillon Erb
416eaf1d28
Update README.md 2023-04-06 18:11:05 -04:00
Andriy Mulyar
dc08c43867
Merge pull request #129 from sagehawk/main
adds to README.md
2023-04-06 14:17:32 -04:00
Andriy Mulyar
50f7d09993
Merge pull request #175 from chrismessina/patch-1
Update README.md
2023-04-06 14:17:05 -04:00
Andriy Mulyar
283bfaad84
Merge pull request #208 from MalikMAlna/main
Fixing Small Punctuation and Capitalization Issues
2023-04-06 14:15:57 -04:00
Andriy Mulyar
1bbe9b6d6c
Merge pull request #260 from nomic-ai/license
Add MIT license.
2023-04-06 11:29:56 -04:00
Ben Schmidt
9f69513d72 Add MIT license. 2023-04-06 11:28:59 -04:00
Zach
809680d621 fix: grad accum loss calc 2023-04-06 12:11:10 +00:00
Zach
7751f39432 fix: data processing 2023-04-06 03:03:34 +00:00
Zach
5baead45be fix: configs 2023-04-05 20:42:35 +00:00
Zach
a57adb0344 fix: try except push 2023-04-05 20:42:22 +00:00
Andriy Mulyar
2b2237adb2
Formatting Update 2023-04-05 14:10:00 -04:00
Andriy Mulyar
af1722760d
Typescript and Langchain bindings 2023-04-05 13:24:47 -04:00