gpt4free/g4f/Provider/DeepInfra.py

from __future__ import annotations

import json
import requests
from ..typing       import AsyncResult, Messages
from .base_provider import AsyncGeneratorProvider, ProviderModelMixin
from ..requests     import StreamSession

class DeepInfra(AsyncGeneratorProvider, ProviderModelMixin):
    url = "https://deepinfra.com"
    working = True
    supports_stream = True
    supports_message_history = True
    default_model = 'meta-llama/Llama-2-70b-chat-hf'
    
    @classmethod
    def get_models(cls):
        if not cls.models:
            url = 'https://api.deepinfra.com/models/featured'
            models = requests.get(url).json()
            cls.models = [model['model_name'] for model in models]
        return cls.models

    @classmethod
    async def create_async_generator(
        cls,
        model: str,
        messages: Messages,
        stream: bool,
        proxy: str = None,
        timeout: int = 120,
        auth: str = None,
        **kwargs
    ) -> AsyncResult:
        headers = {
            'Accept-Encoding': 'gzip, deflate, br',
            'Accept-Language': 'en-US',
            'Connection': 'keep-alive',
            'Content-Type': 'application/json',
            'Origin': 'https://deepinfra.com',
            'Referer': 'https://deepinfra.com/',
            'Sec-Fetch-Dest': 'empty',
            'Sec-Fetch-Mode': 'cors',
            'Sec-Fetch-Site': 'same-site',
            'User-Agent': 'Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/119.0.0.0 Safari/537.36',
            'X-Deepinfra-Source': 'web-embed',
            'accept': 'text/event-stream',
            'sec-ch-ua': '"Google Chrome";v="119", "Chromium";v="119", "Not?A_Brand";v="24"',
            'sec-ch-ua-mobile': '?0',
            'sec-ch-ua-platform': '"macOS"',
        }
        if auth:
            headers['Authorization'] = f"bearer {auth}" 
            
        async with StreamSession(headers=headers,
            timeout=timeout,
            proxies={"https": proxy},
            impersonate="chrome110"
        ) as session:
            json_data = {
                'model'   : cls.get_model(model),
                'messages': messages,
                'temperature': kwargs.get("temperature", 0.7),
                'max_tokens': kwargs.get("max_tokens", 512),
                'stop': kwargs.get("stop", []),
                'stream'  : True
            }
            async with session.post('https://api.deepinfra.com/v1/openai/chat/completions',
                                    json=json_data) as response:
                response.raise_for_status()
                first = True
                async for line in response.iter_lines():
                    if not line.startswith(b"data: "):
                        continue
                    try:
                        json_line = json.loads(line[6:])
                        choices = json_line.get("choices", [{}])
                        finish_reason = choices[0].get("finish_reason")
                        if finish_reason:
                            break
                        token = choices[0].get("delta", {}).get("content")
                        if token:
                            if first:
                                token = token.lstrip()
                            if token:
                                first = False
                                yield token
                    except Exception:
                        raise RuntimeError(f"Response: {line}")
Add Llama2 Providers / Models 2023-10-26 19:32:49 +00:00			`from __future__ import annotations`

Fix DeepInfra Provider 2024-01-01 22:23:45 +00:00			`import json`
Add ProviderModelMixin for model selection 2024-01-23 18:44:48 +00:00			`import requests`
			`from ..typing import AsyncResult, Messages`
			`from .base_provider import AsyncGeneratorProvider, ProviderModelMixin`
			`from ..requests import StreamSession`
Add Llama2 Providers / Models 2023-10-26 19:32:49 +00:00
Add ProviderModelMixin for model selection 2024-01-23 18:44:48 +00:00			`class DeepInfra(AsyncGeneratorProvider, ProviderModelMixin):`
Fix DeepInfra Provider 2024-01-01 22:23:45 +00:00			`url = "https://deepinfra.com"`
			`working = True`
			`supports_stream = True`
			`supports_message_history = True`
Add ProviderModelMixin for model selection 2024-01-23 18:44:48 +00:00			`default_model = 'meta-llama/Llama-2-70b-chat-hf'`

New minimum requirements (#1515) * New minimum requirements * Add ConversationStyleOptionSets to Bing * Add image.ImageRequest * Improve python version support * Improve unittests 2024-01-26 06:54:13 +00:00			`@classmethod`
			`def get_models(cls):`
			`if not cls.models:`
			`url = 'https://api.deepinfra.com/models/featured'`
Fix DeepInfra: Model is not supported 2024-01-30 03:15:25 +00:00			`models = requests.get(url).json()`
			`cls.models = [model['model_name'] for model in models]`
New minimum requirements (#1515) * New minimum requirements * Add ConversationStyleOptionSets to Bing * Add image.ImageRequest * Improve python version support * Improve unittests 2024-01-26 06:54:13 +00:00			`return cls.models`
Add ProviderModelMixin for model selection 2024-01-23 18:44:48 +00:00
			`@classmethod`
Fix DeepInfra Provider 2024-01-01 22:23:45 +00:00			`async def create_async_generator(`
Add ProviderModelMixin for model selection 2024-01-23 18:44:48 +00:00			`cls,`
Fix DeepInfra Provider 2024-01-01 22:23:45 +00:00			`model: str,`
			`messages: Messages,`
			`stream: bool,`
			`proxy: str = None,`
			`timeout: int = 120,`
			`auth: str = None,`
			`**kwargs`
			`) -> AsyncResult:`
Add Llama2 Providers / Models 2023-10-26 19:32:49 +00:00			`headers = {`
Fix DeepInfra Provider 2024-01-01 22:23:45 +00:00			`'Accept-Encoding': 'gzip, deflate, br',`
			`'Accept-Language': 'en-US',`
~ fix DeepInfra 2023-11-24 14:16:00 +00:00			`'Connection': 'keep-alive',`
			`'Content-Type': 'application/json',`
			`'Origin': 'https://deepinfra.com',`
			`'Referer': 'https://deepinfra.com/',`
			`'Sec-Fetch-Dest': 'empty',`
			`'Sec-Fetch-Mode': 'cors',`
			`'Sec-Fetch-Site': 'same-site',`
			`'User-Agent': 'Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/119.0.0.0 Safari/537.36',`
			`'X-Deepinfra-Source': 'web-embed',`
			`'accept': 'text/event-stream',`
			`'sec-ch-ua': '"Google Chrome";v="119", "Chromium";v="119", "Not?A_Brand";v="24"',`
			`'sec-ch-ua-mobile': '?0',`
			`'sec-ch-ua-platform': '"macOS"',`
Add Llama2 Providers / Models 2023-10-26 19:32:49 +00:00			`}`
Add support for all models Add AbstractProvider class Add ProviderType type Add get_last_provider function Add version module and VersionUtils Display used provider in gui Fix error response in api 2024-01-01 16:48:57 +00:00			`if auth:`
			`headers['Authorization'] = f"bearer {auth}"`
Fix DeepInfra Provider 2024-01-01 22:23:45 +00:00
			`async with StreamSession(headers=headers,`
			`timeout=timeout,`
			`proxies={"https": proxy},`
			`impersonate="chrome110"`
			`) as session:`
			`json_data = {`
Add ProviderModelMixin for model selection 2024-01-23 18:44:48 +00:00			`'model' : cls.get_model(model),`
Fix DeepInfra Provider 2024-01-01 22:23:45 +00:00			`'messages': messages,`
Basic support for the temperature parameter 2024-02-14 02:27:16 +00:00			`'temperature': kwargs.get("temperature", 0.7),`
DeepInfra: Add max_tokens and stop 2024-02-24 18:36:42 +00:00			`'max_tokens': kwargs.get("max_tokens", 512),`
			`'stop': kwargs.get("stop", []),`
Fix DeepInfra Provider 2024-01-01 22:23:45 +00:00			`'stream' : True`
			`}`
			`async with session.post('https://api.deepinfra.com/v1/openai/chat/completions',`
			`json=json_data) as response:`
			`response.raise_for_status()`
			`first = True`
			`async for line in response.iter_lines():`
DeepInfra: Fix token duplication 2024-01-21 04:07:34 +00:00			`if not line.startswith(b"data: "):`
			`continue`
Fix DeepInfra Provider 2024-01-01 22:23:45 +00:00			`try:`
Fix load json in DeepInfra 2024-01-21 08:43:46 +00:00			`json_line = json.loads(line[6:])`
DeepInfra: Fix token duplication 2024-01-21 04:07:34 +00:00			`choices = json_line.get("choices", [{}])`
Fix load json in DeepInfra 2024-01-21 08:43:46 +00:00			`finish_reason = choices[0].get("finish_reason")`
DeepInfra: Fix token duplication 2024-01-21 04:07:34 +00:00			`if finish_reason:`
Fix DeepInfra Provider 2024-01-01 22:23:45 +00:00			`break`
Fix load json in DeepInfra 2024-01-21 08:43:46 +00:00			`token = choices[0].get("delta", {}).get("content")`
DeepInfra: Fix token duplication 2024-01-21 04:07:34 +00:00			`if token:`
Fix DeepInfra Provider 2024-01-01 22:23:45 +00:00			`if first:`
DeepInfra: Fix token duplication 2024-01-21 04:07:34 +00:00			`token = token.lstrip()`
Add ProviderModelMixin for model selection 2024-01-23 18:44:48 +00:00			`if token:`
DeepInfra: Fix token duplication 2024-01-21 04:07:34 +00:00			`first = False`
Add ProviderModelMixin for model selection 2024-01-23 18:44:48 +00:00			`yield token`
Fix DeepInfra Provider 2024-01-01 22:23:45 +00:00			`except Exception:`
DeepInfra: Fix token duplication 2024-01-21 04:07:34 +00:00			`raise RuntimeError(f"Response: {line}")`