Commit Graph

759 Commits

Author SHA1 Message Date
Zach Nussbaum
633df8edb4 Merge remote-tracking branch 'origin/mosaic' into gptj 2023-04-08 20:47:01 +00:00
Zach Nussbaum
31195270cb fix: eos/pad token + wd 2023-04-08 20:38:10 +00:00
Zach Nussbaum
c82ee7d882 fix: add wd + min lr to config 2023-04-08 20:37:51 +00:00
Zach Nussbaum
be3f528810 fix: tokenization error 2023-04-08 20:33:51 +00:00
Zach Nussbaum
b66f127ade fix: config + ignore pkl 2023-04-08 20:33:02 +00:00
Zach Nussbaum
0606ab46b9 feat: build map script 2023-04-08 19:30:53 +00:00
Zach
1c6d2d9622 fix: embeddings instead of logits!!! 2023-04-08 17:05:40 +00:00
zanussbaum
147c2fd7eb feat: lora gptj 2023-04-07 17:53:07 -04:00
zanussbaum
2b001e8932 fix: batch size 2023-04-07 17:41:45 -04:00
zanussbaum
7cfda6a21f feat: update for mosaic 2023-04-07 16:54:29 -04:00
Zach Nussbaum
4b51e6ef37 fix: pyarrow filter 2023-04-07 19:04:19 +00:00
Andriy Mulyar
ed53fe1966
Updated roadmap and links. 2023-04-07 13:53:47 -04:00
Zach Nussbaum
7a9f6d1cdc fix: inference save shards 2023-04-07 16:23:34 +00:00
Andriy Mulyar
8e28a33731
Merge pull request #268 from MalikMAlna/dev
Slight cleanup
2023-04-07 10:50:56 -04:00
Andriy Mulyar
7d06b4cd23
Merge pull request #267 from dte/patch-1
Update README.md
2023-04-07 10:50:27 -04:00
Andriy Mulyar
c5d010f352
Correct MD5 Hash 2023-04-07 10:50:02 -04:00
Andriy Mulyar
d8cde6d272
Update README.md 2023-04-07 10:47:15 -04:00
Zach
0bd6acb4dd fix: drop uneven batch size 2023-04-07 12:09:31 +00:00
Zach
985da51fbc fix: concat 2023-04-07 04:33:34 +00:00
Zach
1b14b1f723 fix: data for inference 2023-04-07 01:45:07 +00:00
Zach
fb9ff9c40d feat: inference for embedding plots 2023-04-07 01:40:39 +00:00
MalikMAlna
43ddc3eefa Rephrasing comment for clarity 2023-04-06 20:20:18 -04:00
MalikMAlna
0689c2e974 Changing single to double quotes for quote consistency 2023-04-06 20:07:08 -04:00
MalikMAlna
604176ace8 Slight cleanup of superfluous comment and space after commas 2023-04-06 19:57:46 -04:00
MalikMAlna
b3be94a0ef Slight cleanup of superfluous comment and space after comma 2023-04-06 19:56:49 -04:00
Dillon Erb
416eaf1d28
Update README.md 2023-04-06 18:11:05 -04:00
Andriy Mulyar
dc08c43867
Merge pull request #129 from sagehawk/main
adds to README.md
2023-04-06 14:17:32 -04:00
Andriy Mulyar
50f7d09993
Merge pull request #175 from chrismessina/patch-1
Update README.md
2023-04-06 14:17:05 -04:00
Andriy Mulyar
283bfaad84
Merge pull request #208 from MalikMAlna/main
Fixing Small Punctuation and Capitalization Issues
2023-04-06 14:15:57 -04:00
Andriy Mulyar
1bbe9b6d6c
Merge pull request #260 from nomic-ai/license
Add MIT license.
2023-04-06 11:29:56 -04:00
Ben Schmidt
9f69513d72 Add MIT license. 2023-04-06 11:28:59 -04:00
Zach
809680d621 fix: grad accum loss calc 2023-04-06 12:11:10 +00:00
Zach
7751f39432 fix: data processing 2023-04-06 03:03:34 +00:00
Zach
5baead45be fix: configs 2023-04-05 20:42:35 +00:00
Zach
a57adb0344 fix: try except push 2023-04-05 20:42:22 +00:00
Andriy Mulyar
2b2237adb2
Formatting Update 2023-04-05 14:10:00 -04:00
Andriy Mulyar
af1722760d
Typescript and Langchain bindings 2023-04-05 13:24:47 -04:00
Andriy Mulyar
565cc1ece1
Added MD5 signatures to ecosystem links. 2023-04-05 13:15:23 -04:00
Andriy Mulyar
73b78f017b
Typescript bindings link 2023-04-05 13:03:17 -04:00
Andriy Mulyar
e16cbeb4b7
GPT4All Compatibility Ecosystem 2023-04-05 12:48:54 -04:00
Andriy Mulyar
1eeaa5c8ee
Discord Link 2023-04-04 23:23:34 -04:00
Zach Nussbaum
399a65e779 feat: multinode setup 2023-04-05 02:53:04 +00:00
Zach Nussbaum
0a3834d086 fix: gptj multinode 2023-04-05 02:52:44 +00:00
Zach Nussbaum
fde7d9506f fix: ignore env 2023-04-05 02:52:21 +00:00
Zach Nussbaum
97d4499d79 fix: only on first process, not once on every node 2023-04-05 02:36:22 +00:00
Zach Nussbaum
d0402288bd fix: eval func 2023-04-04 23:25:37 +00:00
Zach
65ec606f21 fix: prompt len for larger 2023-04-04 22:01:55 +00:00
Zach Nussbaum
df2d5f7e46 feat: gpt-j config 2023-04-04 20:58:08 +00:00
Zach Nussbaum
3efc19ebc5 feat: adamw, fix training, log gradients 2023-04-04 20:57:42 +00:00
Zach Nussbaum
5c5f41ba36 fix: clean up data, pad at end 2023-04-04 20:53:23 +00:00