Commit Graph

27 Commits (12872b3d2956a26fb7d1cd464b3672ebe40d94c1)

Author SHA1 Message Date
sigoden 12872b3d29
refactor: update models.yaml (#602) 2 months ago
sigoden 746b087111
refactor: add/modify rag-related config (#599) 2 months ago
sigoden 1f33b3a07a
refactor: rag default_chunk_size (#588) 2 months ago
sigoden a732291f33
refactor: rename `pass_max_tokens` to `require_max_tokens` (#562) 2 months ago
sigoden 1ec6abfaee
feat: support RAG (#560)
* feat: support RAG

* support more embeddings models and implement concurrent embedding api

* show the progress of addings paths

* ignore embedding context when saving message

* embedding model max_chunk_size => default_chunk_size

* support pdf and pandoc formats (docx, epub, ipynb)
2 months ago
rolfwilms 569317728c
fix: bedrock issues (#544)
* Removed extraneous key [stream] for AWS Bedrock Claude models.

* Reduceddefault  AWS Bedrock llama-3 max_output_tokens to 2048 to align with API requirements.

---------

Co-authored-by: Rolf Wilms <rwilms@csc.com>
3 months ago
sigoden 2ccbb0f06a
refactor: qiawen client add qwen-long (#537) 3 months ago
sigoden b4a40e3fed
feat: support function calling (#514)
* feat: support function calling

* fix on Windows OS

* implement multi-steps function calling

* fix on Windows OS

* add error for client not support function calling

* refactor message data structure and make claude client supporting function calling

* support reuse previous call results

* improve error handling for function calling

* use prefix `may_`  as indicator for `execute` type fucntions
3 months ago
sigoden 5378033b34
refactor: add gemini-1.5-flash to models.yaml (#510) 3 months ago
sigoden 369cf9a36a refactor: minor refinement 3 months ago
sigoden 20a507375e
refactor: update models.yaml (#501) 3 months ago
sigoden 7762cd6bed
refactor: model pass_max_tokens (#493) 3 months ago
sigoden d5fd624eb8 refactor: update models.yaml 3 months ago
sigoden 956a960390
feat: support zhipuai client (#491) 3 months ago
sigoden 0071d84aa5
feat: support deepseek client (#490) 3 months ago
sigoden 9b283024b4
feat: extract vertexai-claude client (#485) 3 months ago
sigoden 5eae392dbd
refactore: add models for openai-compatible platforms (#471) 4 months ago
sigoden 8dba46becf
feat: openai-compatible platforms share the same client (#469) 4 months ago
sigoden 50eac8b594
feat: support replicate client (#466) 4 months ago
sigoden ffb0af8236
refactor: add some openai-compatiable platforms to config.example.yaml (#464) 4 months ago
sigoden 4ddccc361c refactor: update groq models at models.yaml 4 months ago
sigoden 602494b650
refactor: merge config models, update client models (#460) 4 months ago
sigoden 34041a976c
feat: support cloudflare client (#459) 4 months ago
sigoden 865be2bf75
feat: non-streaming returns completion stats (#456) 4 months ago
sigoden 1f2b626703
feat: support bedrock client (#450) 4 months ago
sigoden 615bab215b
feat: support vertexai claude (#439) 4 months ago
sigoden d6df1e84a7
refactor: extract prelude models to models.yaml (#451) 4 months ago