Commit Graph

13 Commits

Author SHA1 Message Date
Zach Nussbaum
d7395ee37a Merge: main into gptj 2023-04-13 15:16:31 +00:00
Zach Nussbaum
b1e361882d fix: multi-turn data breaks 2023-04-12 03:51:29 +00:00
Zach Nussbaum
c0a9065032 fix: tokenization error 2023-04-08 20:33:51 +00:00
Zach
573272ad69 fix: drop uneven batch size 2023-04-07 12:09:31 +00:00
Zach
57eb786756 fix: data for inference 2023-04-07 01:45:07 +00:00
Zach
e4e88dff33 fix: data processing 2023-04-06 03:03:34 +00:00
Zach
8dd99cc00a fix: prompt len for larger 2023-04-04 22:01:55 +00:00
Zach Nussbaum
c68311810a fix: clean up data, pad at end 2023-04-04 20:53:23 +00:00
Zach Nussbaum
668c71dc90 Update data.py 2023-03-28 21:13:05 -07:00
Zach Nussbaum
1a95f68494 fix: just read from watermark file 2023-03-27 17:30:44 +00:00
Zach Nussbaum
bb28929305 fix: eos conditional, watermark 2023-03-27 16:29:43 +00:00
Zach Nussbaum
eac7734cbf fix: add eos 2023-03-26 17:45:31 +00:00
Zach Nussbaum
723a50bdf1 feat: train and clean data 2023-03-25 16:17:48 +00:00