Commit Graph

264 Commits

Author SHA1 Message Date
Zach
5caed073ee fix: topic model for embeddings 2023-04-09 15:12:49 +00:00
Zach Nussbaum
c11ea7dc31 fix: bs try one more time? 2023-04-08 21:47:07 +00:00
Zach Nussbaum
2827693a57 fix: smaller bs for 40gb 2023-04-08 21:36:20 +00:00
Zach Nussbaum
f5ca8bedcf fix: config for lora gptj 2023-04-08 21:17:12 +00:00
Zach Nussbaum
6de58dd1fc fix: saving name 2023-04-08 20:56:13 +00:00
Zach Nussbaum
305fe3d444 Merge remote-tracking branch 'origin/mosaic' into gptj 2023-04-08 20:47:01 +00:00
Zach Nussbaum
c1ca081cda fix: eos/pad token + wd 2023-04-08 20:38:10 +00:00
Zach Nussbaum
5de765410a fix: add wd + min lr to config 2023-04-08 20:37:51 +00:00
Zach Nussbaum
c0a9065032 fix: tokenization error 2023-04-08 20:33:51 +00:00
Zach Nussbaum
df29c62521 fix: config + ignore pkl 2023-04-08 20:33:02 +00:00
Zach Nussbaum
f479a37818 feat: build map script 2023-04-08 19:30:53 +00:00
Zach
2034f0c479 fix: embeddings instead of logits!!! 2023-04-08 17:05:40 +00:00
zanussbaum
fbb788e451 feat: lora gptj 2023-04-07 17:53:07 -04:00
zanussbaum
dcc5204352 fix: batch size 2023-04-07 17:41:45 -04:00
zanussbaum
aa8dd7a636 feat: update for mosaic 2023-04-07 16:54:29 -04:00
Zach Nussbaum
1b5e660476 fix: pyarrow filter 2023-04-07 19:04:19 +00:00
Andriy Mulyar
a5110c81d9 Updated roadmap and links. 2023-04-07 13:53:47 -04:00
Zach Nussbaum
f974ca651c fix: inference save shards 2023-04-07 16:23:34 +00:00
Andriy Mulyar
20b08568d7 Merge pull request #268 from MalikMAlna/dev
Slight cleanup
2023-04-07 10:50:56 -04:00
Andriy Mulyar
d06ba7fdd5 Merge pull request #267 from dte/patch-1
Update README.md
2023-04-07 10:50:27 -04:00
Andriy Mulyar
aa39e80833 Correct MD5 Hash 2023-04-07 10:50:02 -04:00
Andriy Mulyar
98da69119b Update README.md 2023-04-07 10:47:15 -04:00
Zach
573272ad69 fix: drop uneven batch size 2023-04-07 12:09:31 +00:00
Zach
d93f6c2ced fix: concat 2023-04-07 04:33:34 +00:00
Zach
57eb786756 fix: data for inference 2023-04-07 01:45:07 +00:00
Zach
3f9fea47d7 feat: inference for embedding plots 2023-04-07 01:40:39 +00:00
MalikMAlna
1195d09fba Rephrasing comment for clarity 2023-04-06 20:20:18 -04:00
MalikMAlna
17fb6f668a Changing single to double quotes for quote consistency 2023-04-06 20:07:08 -04:00
MalikMAlna
334f36d844 Slight cleanup of superfluous comment and space after commas 2023-04-06 19:57:46 -04:00
MalikMAlna
b8292dd7d0 Slight cleanup of superfluous comment and space after comma 2023-04-06 19:56:49 -04:00
Dillon Erb
2052f459ed Update README.md 2023-04-06 18:11:05 -04:00
Andriy Mulyar
f59ab5153f Merge pull request #129 from sagehawk/main
adds to README.md
2023-04-06 14:17:32 -04:00
Andriy Mulyar
7580f3204c Merge pull request #175 from chrismessina/patch-1
Update README.md
2023-04-06 14:17:05 -04:00
Andriy Mulyar
e1e27b4214 Merge pull request #208 from MalikMAlna/main
Fixing Small Punctuation and Capitalization Issues
2023-04-06 14:15:57 -04:00
Andriy Mulyar
e81d232b8a Merge pull request #260 from nomic-ai/license
Add MIT license.
2023-04-06 11:29:56 -04:00
Ben Schmidt
5fa260e18a Add MIT license. 2023-04-06 11:28:59 -04:00
Zach
8f2eb1a583 fix: grad accum loss calc 2023-04-06 12:11:10 +00:00
Zach
e4e88dff33 fix: data processing 2023-04-06 03:03:34 +00:00
Zach
c2fc164779 fix: configs 2023-04-05 20:42:35 +00:00
Zach
838b19bea5 fix: try except push 2023-04-05 20:42:22 +00:00
Andriy Mulyar
828a3a67fa Formatting Update 2023-04-05 14:10:00 -04:00
Andriy Mulyar
5977a650e6 Typescript and Langchain bindings 2023-04-05 13:24:47 -04:00
Andriy Mulyar
f90831180e Added MD5 signatures to ecosystem links. 2023-04-05 13:15:23 -04:00
Andriy Mulyar
873a558825 Typescript bindings link 2023-04-05 13:03:17 -04:00
Andriy Mulyar
d617c54a5e GPT4All Compatibility Ecosystem 2023-04-05 12:48:54 -04:00
Andriy Mulyar
467dc5bc7e Discord Link 2023-04-04 23:23:34 -04:00
Zach Nussbaum
885b7f1a3a feat: multinode setup 2023-04-05 02:53:04 +00:00
Zach Nussbaum
98ceac34e1 fix: gptj multinode 2023-04-05 02:52:44 +00:00
Zach Nussbaum
0c8666f635 fix: ignore env 2023-04-05 02:52:21 +00:00
Zach Nussbaum
9567f445b8 fix: only on first process, not once on every node 2023-04-05 02:36:22 +00:00