langchain/modal.md at c0cca393a79cb25dcac9b919af2fa7f4f129e4e9

Enrico Shippole 9becdeaadf

Add Writer, Banana, Modal, StochasticAI (#1270 )

Add LLM wrappers and examples for Banana, Writer, Modal, Stochastic AI

Added rigid json format for Banana and Modal

2023-02-24 06:58:58 -08:00

1.6 KiB

Raw Blame History

This page covers how to use the Modal ecosystem within LangChain. It is broken into two parts: installation and setup, and then references to specific Modal wrappers.

Installation and Setup

Install with pip install modal-client
Run modal token new

You must include a prompt. There is a rigid response structure.

class Item(BaseModel):
    prompt: str

@stub.webhook(method="POST")
def my_webhook(item: Item):
    return {"prompt": my_function.call(item.prompt)}

An example with GPT2:

from pydantic import BaseModel

import modal

stub = modal.Stub("example-get-started")

volume = modal.SharedVolume().persist("gpt2_model_vol")
CACHE_PATH = "/root/model_cache"

@stub.function(
    gpu="any",
    image=modal.Image.debian_slim().pip_install(
        "tokenizers", "transformers", "torch", "accelerate"
    ),
    shared_volumes={CACHE_PATH: volume},
    retries=3,
)
def run_gpt2(text: str):
    from transformers import GPT2Tokenizer, GPT2LMHeadModel
    tokenizer = GPT2Tokenizer.from_pretrained('gpt2')
    model = GPT2LMHeadModel.from_pretrained('gpt2')
    encoded_input = tokenizer(text, return_tensors='pt').input_ids
    output = model.generate(encoded_input, max_length=50, do_sample=True)
    return tokenizer.decode(output[0], skip_special_tokens=True)

class Item(BaseModel):
    prompt: str

@stub.webhook(method="POST")
def get_text(item: Item):
    return {"prompt": run_gpt2.call(item.prompt)}

Wrappers

LLM

There exists an Modal LLM wrapper, which you can access with

from langchain.llms import Modal

1.6 KiB Raw Blame History

Modal

Installation and Setup

Define your Modal Functions and Webhooks

Wrappers

LLM

1.6 KiB

Raw Blame History