Merge pull request #174 from waybarrios/fixing_data_bug

DatasetDict to dataset object.
This commit is contained in:
Zach Nussbaum 2023-04-03 17:34:23 -04:00 committed by GitHub
commit e6cd5fd04d

View File

@ -68,7 +68,7 @@ def load_data(config, tokenizer):
dataset = load_dataset("json", data_files=files, split="train")
else:
dataset = load_dataset(dataset_path)
dataset = load_dataset(dataset_path,split='train')
dataset = dataset.train_test_split(test_size=.05, seed=config["seed"])