From 3473651eeb194702399792a398915cffbc3b46c9 Mon Sep 17 00:00:00 2001 From: Maxime Labonne <81252890+mlabonne@users.noreply.github.com> Date: Fri, 23 Jun 2023 12:36:12 +0100 Subject: [PATCH] add GPT2-GPTQ-4bit --- README.md | 4 ++++ 1 file changed, 4 insertions(+) diff --git a/README.md b/README.md index b9a3210..25e689b 100644 --- a/README.md +++ b/README.md @@ -9,6 +9,10 @@ A list of notebooks and articles related to large language models. | Decoding Strategies in Large Language Models | A guide to text generation from beam search to nucleus sampling | [Article](https://mlabonne.github.io/blog/posts/2022-06-07-Decoding_strategies.html) | Open In Colab | | Visualizing GPT-2's Loss Landscape | 3D plot of the loss landscape based on weight pertubations. | [Tweet](https://twitter.com/maximelabonne/status/1667618081844219904) | Open In Colab | | Improve ChatGPT with Knowledge Graphs | Augment ChatGPT's answers with knowledge graphs. | [Article](https://mlabonne.github.io/blog/posts/Article_Improve_ChatGPT_with_Knowledge_Graphs.html) | Open In Colab | +| Create a 4-bit GPT-2 model using AutoGPTQ | Quick notebook to quantize LLMs and upload them to the HF Hub. | [Tweet](https://twitter.com/maximelabonne/status/1672205530717122561) | Open In Colab | + + + ## Roadmap