llama-dl/README.md

# llama-dl

[HN discussion](https://news.ycombinator.com/item?id=35026902) | [Twitter announcement](https://twitter.com/theshawwn/status/1632238214529400832)

**UPDATE (2:43 AM CST)**: Facebook has closed off this download vector. I'm currently mirroring the model to Cloudflare R2, and I'll update the script to use it right now; I'll keep you updated as I go. Check back in like... an hour?

**UPDATE (3:58 AM CST)**: I've mirrored everything to R2, and updated the script to point to it. Note that the download command has changed (it uses a new version of the bash script) so you'll need to re-copy from this README. The safety guarantees are the same for you in the end, though, and the bandwidth is still around 36MB/s, which isn't too bad. I'm honestly too tired to update the rest of the README to reflect this slowdown; I'll just leave it the way it was for tonight. Please tweet on the [announcement thread](https://twitter.com/theshawwn/status/1632238214529400832) if anything breaks again, and I'll fix it again. </passes out>

This repository contains a high-speed download of LLaMA, Facebook's 65B parameter model that was recently made available via torrent. (Discussion: [Facebook LLAMA is being openly distributed via torrents](https://news.ycombinator.com/item?id=35007978))

It downloads all model weights (7B, 13B, 30B, 65B) at around 200 MB/s:

![image](https://user-images.githubusercontent.com/59632/222940196-e763d8a0-2282-4f78-8bbe-14c559eea90f.png)


```
real    19m21.173s
user    3m30.473s
sys     2m30.847s
```

## Download

To download all model weights, `cd` into the directory you want them, then run this:

Linux:

```sh
curl -o- https://raw.githubusercontent.com/shawwn/llama-dl/56f50b96072f42fb2520b1ad5a1d6ef30351f23c/llama.sh | bash
```

Mac:

```sh
brew install bash
curl -o- https://raw.githubusercontent.com/shawwn/llama-dl/56f50b96072f42fb2520b1ad5a1d6ef30351f23c/llama.sh | $(brew --prefix)/bin/bash
```

(Sorry mac users; they use some array syntax in the script that isn't supported on the version of bash that ships with Mac.)

Running random bash scripts generally isn't a good idea, but I'll stake my personal reputation on the fact that this link is safe. (It points to a specific SHA-1 hash rather than https://raw.githubusercontent.com/shawwn/llama-dl/main/llama.sh so that it's still safe even in the event that my repo or account got compromised.)

## How much space do I need?

219G (235164838073 bytes) total. Here are the sizes of the individual files for reference:
```sh
./tokenizer_checklist.chk  50
./tokenizer.model          499723
./7B/checklist.chk         100
./7B/consolidated.00.pth   13476939516
./7B/params.json           101
./13B/checklist.chk        154
./13B/consolidated.00.pth  13016334699
./13B/consolidated.01.pth  13016334699
./13B/params.json          101
./30B/checklist.chk        262
./30B/consolidated.00.pth  16265763099
./30B/consolidated.01.pth  16265763099
./30B/consolidated.02.pth  16265763099
./30B/consolidated.03.pth  16265763099
./30B/params.json          101
./65B/checklist.chk        478
./65B/consolidated.00.pth  16323959449
./65B/consolidated.01.pth  16323959449
./65B/consolidated.02.pth  16323959449
./65B/consolidated.03.pth  16323959449
./65B/consolidated.04.pth  16323959449
./65B/consolidated.05.pth  16323959449
./65B/consolidated.06.pth  16323959449
./65B/consolidated.07.pth  16323959449
./65B/params.json          101
---
total                      235164838073
```

## How do I know this is safe?

I ran this:

```
mkdir LLaMA
cd LLaMA
time curl -o- https://raw.githubusercontent.com/shawwn/llama-dl/56f50b96072f42fb2520b1ad5a1d6ef30351f23c/llama.sh | bash
cd ..
webtorrent 'magnet:?xt=urn:btih:b8287ebfa04f879b048d4d4404108cf3e8014352&dn=LLaMA&tr=udp%3a%2f%2ftracker.opentrackr.org%3a1337%2fannounce'
```

Webtorrent began seeding immediately, which means every file is identical to what you would've gotten via the torrent. So this is just a faster version of the torrent.

<img width="310" alt="image" src="https://user-images.githubusercontent.com/59632/222940942-0051a645-b561-4f0b-878c-3d195354d526.png">

<img width="310" alt="image" src="https://user-images.githubusercontent.com/59632/222941107-b4ef0b21-3fa7-40d1-ae56-cbe385e6ac00.png">

## How much faster?

Roughly 18x. As of March 4 2023, the torrent seems to download at around 11MB/s. Whereas this download script downloads at around 120MB/s on average, bursting occasionally up to 220MB/s.

<img width="300" alt="image" src="https://user-images.githubusercontent.com/59632/222940992-f037b12c-c077-4136-8960-b2b1667ddc79.png">

## Will I get in trouble for using this download link?

I doubt it. This is the download link that was leaked in the original torrent. (i.e. the leaker accidentally leaked their own unique download link that Facebook sent them.)

Technically, it may be illegal to knowingly use a private download link that was intended for someone else. Realistically, Facebook would risk their ML reputation by going after people who are merely trying to use what they themselves advertise as "open source."

## Final thoughts

I was shocked that this script was distributed with the original torrent, and that no one seemed to notice (a) that it still works, and (b) is almost 20x faster than the torrent method. I was impatient and curious to try to run 65B on an 8xA100 cluster, so I didn't want to wait till tomorrow and started poking around, which is when I found this. I decided to just tweet it out and let you, fellow scientists and hackers, enjoy it before Facebook notices and shuts it off.

"Power to the people" is an overused trope, but as a research scientist, I feel it's important to let individual hackers be able to experiment with the same tools, techniques, and systems that professional ML researchers are fortunate to have access to. This is a tricky situation, because at some point between now and 10 years from now, this might become dangerous -- AI alarmists often ask "Would you want random people experimenting with nuclear weapons in their basement?" My answer is "No, but we're not there yet."

Word on Twitter is that LLaMA's samples seem worse than GPT-3 by a large margin, but then I realized no one has really been able to try the full 65B model yet, for a combination of reasons. (Mostly lack of access to 8xA100 hardware.) So I decided to try it out for myself and see.

Even if it's GPT-3 level, the fact is, LLaMA is already openly available. The torrent isn't going anywhere. So my own thoughts on this are mostly irrelevant; determined hackers can get it themselves anyway.

But for what it's worth, my personal opinion is that LLaMA probably isn't OpenAI-grade -- there's a big difference between training a model in an academic setting vs when your entire company depends on it for wide-scale commercial success. I wasn't impressed that 30B didn't seem to know who Captain Picard was.

People have already started decrying this leak as dangerous. But everyone used to say the same thing about 1.5B. (In fact, the allure of 1.5B's grandiose claims was what drove me to take ML seriously in 2019.) Turns out, four years later, no one really cares about 1.5B anymore, and it certainly didn't cause wide-scale societal harm. I doubt LLaMA will either.

2023 will be interesting. I can't wait for 2024.

Signed with love,

Shawn Presser

twitter: [@theshawwn](https://twitter.com/theshawwn)

HN: [sillysaurusx](https://news.ycombinator.com/user?id=sillysaurusx)
Create README.md 1 year ago			`# llama-dl`

Add HN and twitter links 1 year ago			`[HN discussion](https://news.ycombinator.com/item?id=35026902) \| [Twitter announcement](https://twitter.com/theshawwn/status/1632238214529400832)`

Update README.md 1 year ago			`UPDATE (2:43 AM CST): Facebook has closed off this download vector. I'm currently mirroring the model to Cloudflare R2, and I'll update the script to use it right now; I'll keep you updated as I go. Check back in like... an hour?`

Update README.md 1 year ago			UPDATE (3:58 AM CST): I've mirrored everything to R2, and updated the script to point to it. Note that the download command has changed (it uses a new version of the bash script) so you'll need to re-copy from this README. The safety guarantees are the same for you in the end, though, and the bandwidth is still around 36MB/s, which isn't too bad. I'm honestly too tired to update the rest of the README to reflect this slowdown; I'll just leave it the way it was for tonight. Please tweet on the [announcement thread](https://twitter.com/theshawwn/status/1632238214529400832) if anything breaks again, and I'll fix it again. </passes out>
Update README.md 1 year ago
Create README.md 1 year ago			`This repository contains a high-speed download of LLaMA, Facebook's 65B parameter model that was recently made available via torrent. (Discussion: [Facebook LLAMA is being openly distributed via torrents](https://news.ycombinator.com/item?id=35007978))`

			`It downloads all model weights (7B, 13B, 30B, 65B) at around 200 MB/s:`

			`![image](https://user-images.githubusercontent.com/59632/222940196-e763d8a0-2282-4f78-8bbe-14c559eea90f.png)`


			```
			`real 19m21.173s`
			`user 3m30.473s`
			`sys 2m30.847s`
			```

			`## Download`

			To download all model weights, `cd` into the directory you want them, then run this:

Update README for mac users 1 year ago			`Linux:`

Create README.md 1 year ago			```sh
Update README.md 1 year ago			`curl -o- https://raw.githubusercontent.com/shawwn/llama-dl/56f50b96072f42fb2520b1ad5a1d6ef30351f23c/llama.sh \| bash`
Create README.md 1 year ago			```

Update README for mac users 1 year ago			`Mac:`

			```sh
			`brew install bash`
			`curl -o- https://raw.githubusercontent.com/shawwn/llama-dl/56f50b96072f42fb2520b1ad5a1d6ef30351f23c/llama.sh \| $(brew --prefix)/bin/bash`
			```

			`(Sorry mac users; they use some array syntax in the script that isn't supported on the version of bash that ships with Mac.)`

Create README.md 1 year ago			`Running random bash scripts generally isn't a good idea, but I'll stake my personal reputation on the fact that this link is safe. (It points to a specific SHA-1 hash rather than https://raw.githubusercontent.com/shawwn/llama-dl/main/llama.sh so that it's still safe even in the event that my repo or account got compromised.)`

Add "How much space do I need?" 1 year ago			`## How much space do I need?`

			`219G (235164838073 bytes) total. Here are the sizes of the individual files for reference:`
Add shell script highlighting 1 year ago			```sh
Minor clarity tweak 1 year ago			`./tokenizer_checklist.chk 50`
			`./tokenizer.model 499723`
Prettify size list 1 year ago			`./7B/checklist.chk 100`
			`./7B/consolidated.00.pth 13476939516`
			`./7B/params.json 101`
			`./13B/checklist.chk 154`
			`./13B/consolidated.00.pth 13016334699`
			`./13B/consolidated.01.pth 13016334699`
			`./13B/params.json 101`
			`./30B/checklist.chk 262`
			`./30B/consolidated.00.pth 16265763099`
			`./30B/consolidated.01.pth 16265763099`
			`./30B/consolidated.02.pth 16265763099`
			`./30B/consolidated.03.pth 16265763099`
			`./30B/params.json 101`
			`./65B/checklist.chk 478`
			`./65B/consolidated.00.pth 16323959449`
			`./65B/consolidated.01.pth 16323959449`
			`./65B/consolidated.02.pth 16323959449`
			`./65B/consolidated.03.pth 16323959449`
			`./65B/consolidated.04.pth 16323959449`
			`./65B/consolidated.05.pth 16323959449`
			`./65B/consolidated.06.pth 16323959449`
			`./65B/consolidated.07.pth 16323959449`
			`./65B/params.json 101`
			`---`
			`total 235164838073`
Add "How much space do I need?" 1 year ago			```

Create README.md 1 year ago			`## How do I know this is safe?`

			`I ran this:`

			```
			`mkdir LLaMA`
			`cd LLaMA`
Update README.md 1 year ago			`time curl -o- https://raw.githubusercontent.com/shawwn/llama-dl/56f50b96072f42fb2520b1ad5a1d6ef30351f23c/llama.sh \| bash`
Create README.md 1 year ago			`cd ..`
			`webtorrent 'magnet:?xt=urn:btih:b8287ebfa04f879b048d4d4404108cf3e8014352&dn=LLaMA&tr=udp%3a%2f%2ftracker.opentrackr.org%3a1337%2fannounce'`
			```

			`Webtorrent began seeding immediately, which means every file is identical to what you would've gotten via the torrent. So this is just a faster version of the torrent.`

			`<img width="310" alt="image" src="https://user-images.githubusercontent.com/59632/222940942-0051a645-b561-4f0b-878c-3d195354d526.png">`

			`<img width="310" alt="image" src="https://user-images.githubusercontent.com/59632/222941107-b4ef0b21-3fa7-40d1-ae56-cbe385e6ac00.png">`

			`## How much faster?`

			`Roughly 18x. As of March 4 2023, the torrent seems to download at around 11MB/s. Whereas this download script downloads at around 120MB/s on average, bursting occasionally up to 220MB/s.`

			`<img width="300" alt="image" src="https://user-images.githubusercontent.com/59632/222940992-f037b12c-c077-4136-8960-b2b1667ddc79.png">`

			`## Will I get in trouble for using this download link?`

			`I doubt it. This is the download link that was leaked in the original torrent. (i.e. the leaker accidentally leaked their own unique download link that Facebook sent them.)`

			`Technically, it may be illegal to knowingly use a private download link that was intended for someone else. Realistically, Facebook would risk their ML reputation by going after people who are merely trying to use what they themselves advertise as "open source."`

			`## Final thoughts`

			`I was shocked that this script was distributed with the original torrent, and that no one seemed to notice (a) that it still works, and (b) is almost 20x faster than the torrent method. I was impatient and curious to try to run 65B on an 8xA100 cluster, so I didn't want to wait till tomorrow and started poking around, which is when I found this. I decided to just tweet it out and let you, fellow scientists and hackers, enjoy it before Facebook notices and shuts it off.`

			"Power to the people" is an overused trope, but as a research scientist, I feel it's important to let individual hackers be able to experiment with the same tools, techniques, and systems that professional ML researchers are fortunate to have access to. This is a tricky situation, because at some point between now and 10 years from now, this might become dangerous -- AI alarmists often ask "Would you want random people experimenting with nuclear weapons in their basement?" My answer is "No, but we're not there yet."

			`Word on Twitter is that LLaMA's samples seem worse than GPT-3 by a large margin, but then I realized no one has really been able to try the full 65B model yet, for a combination of reasons. (Mostly lack of access to 8xA100 hardware.) So I decided to try it out for myself and see.`

			`Even if it's GPT-3 level, the fact is, LLaMA is already openly available. The torrent isn't going anywhere. So my own thoughts on this are mostly irrelevant; determined hackers can get it themselves anyway.`

Update README.md 1 year ago			`But for what it's worth, my personal opinion is that LLaMA probably isn't OpenAI-grade -- there's a big difference between training a model in an academic setting vs when your entire company depends on it for wide-scale commercial success. I wasn't impressed that 30B didn't seem to know who Captain Picard was.`
Create README.md 1 year ago
			`People have already started decrying this leak as dangerous. But everyone used to say the same thing about 1.5B. (In fact, the allure of 1.5B's grandiose claims was what drove me to take ML seriously in 2019.) Turns out, four years later, no one really cares about 1.5B anymore, and it certainly didn't cause wide-scale societal harm. I doubt LLaMA will either.`

			`2023 will be interesting. I can't wait for 2024.`

			`Signed with love,`
Update README.md 1 year ago
Create README.md 1 year ago			`Shawn Presser`

Update README.md 1 year ago			`twitter: [@theshawwn](https://twitter.com/theshawwn)`
Create README.md 1 year ago
			`HN: [sillysaurusx](https://news.ycombinator.com/user?id=sillysaurusx)`