add GPT2-GPTQ-4bit

pull/17/head
Maxime Labonne 12 months ago committed by GitHub
parent f10636d564
commit 3473651eeb
No known key found for this signature in database
GPG Key ID: 4AEE18F83AFDEB23

@ -9,6 +9,10 @@ A list of notebooks and articles related to large language models.
| Decoding Strategies in Large Language Models | A guide to text generation from beam search to nucleus sampling | [Article](https://mlabonne.github.io/blog/posts/2022-06-07-Decoding_strategies.html) | <a href="https://colab.research.google.com/drive/19CJlOS5lI29g-B3dziNn93Enez1yiHk2?usp=sharing"><img src="images/colab.svg" alt="Open In Colab"></a> |
| Visualizing GPT-2's Loss Landscape | 3D plot of the loss landscape based on weight pertubations. | [Tweet](https://twitter.com/maximelabonne/status/1667618081844219904) | <a href="https://colab.research.google.com/drive/1Fu1jikJzFxnSPzR_V2JJyDVWWJNXssaL?usp=sharing"><img src="images/colab.svg" alt="Open In Colab"></a> |
| Improve ChatGPT with Knowledge Graphs | Augment ChatGPT's answers with knowledge graphs. | [Article](https://mlabonne.github.io/blog/posts/Article_Improve_ChatGPT_with_Knowledge_Graphs.html) | <a href="https://colab.research.google.com/drive/1mwhOSw9Y9bgEaIFKT4CLi0n18pXRM4cj?usp=sharing"><img src="images/colab.svg" alt="Open In Colab"></a> |
| Create a 4-bit GPT-2 model using AutoGPTQ | Quick notebook to quantize LLMs and upload them to the HF Hub. | [Tweet](https://twitter.com/maximelabonne/status/1672205530717122561) | <a href="https://colab.research.google.com/drive/1P6JEAdwfMtGP92aQCLUuMdsSnuxBUxvQ?usp=sharing"><img src="images/colab.svg" alt="Open In Colab"></a> |
## Roadmap

Loading…
Cancel
Save